Gene Phep_3865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3865 
Symbol 
ID8254999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4637782 
End bp4640538 
Gene Length2757 bp 
Protein Length918 aa 
Translation table11 
GC content45% 
IMG OID644937529 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003094118 
Protein GI255533746 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.160045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTT TAGGGATGTT ATGCATCAGC TGCCTGTGTT TTGCAGCAGG CAGTGCCACG 
GCCCGGCAAC ATATCCCTCT TGAAGGAACC TGGCAGGTTA AACTGGATTC GGCAAATGTA
GGTGTACAGG AAAAATGGTA TCATCAGCAA TTCAGCCAGC GTATCCAACT ACCCGGAACT
TTGGATGACG CAGGGCTGGG CAGGTCAAAT AACCTCTCTG CAGATAAACT GGTTAAAGAT
GTTTTGATCA ACCTGATCAG AAAACATACA TATATTGGTG TGGCATGGTA TGCGAGGGAG
ATCCTCATTC CCAAAGACTG GAAAGATAAA GACATCAGCC TGTACCTGGA ACGGGTGATC
TGGAACACGA GGGTCTGGAT TGACGGACAG GAGGCTGGGG TGCAGGAAAG TCTGAGTGTT
CCCCATCGCT TTGAGCTGAG TGCTTTGGCA AAGCCAGGCC GGCACCGGCT TGTTATCCGC
ATAGATAACA GCAAACAATA CGACATGACC CATCTCAATA TGGCCCATGC CTATACTGAT
GGTACACAGA TTATCTGGAA TGGTGTAATA GGCAAAATGG AACTCATGGC AAAGGACAAA
ATAAACATTG CCACATTGCA AACCTATCCC CGTCTCAAAG ATAAATCTGT AAATGTAATT
GCTACATTGC AAAATGGCCT TAAACAAAGT AAAAAGGGGA TATTGCAGCT CCAGGTTATC
GGAAAAGACA AACGGATTGT CGCAAACCGT AGCATACCCG TCAATCTTGC CGCCGGTGAT
ACCCGGCAGG AAATCAATAT CCCTTTAGGC AAAGACGCCC TGCTTTGGGA CGAATTCAAC
AGCAATCTTT ATGTGCTAAA AGCACAGTTG ACCATTAGCG GAACTTCTTT TAAAGATGCA
AGTTCAACAA CGTTTGGCCT GCGGGAGATT ACAAACCAGG GCAGTACCCT GCAGGTTAAC
GGCCGCAGGG TATTTTTAAG GGGCACGCTT GAATGCAACA TTTTTCCATT AACCGGACAT
CCGCCTATGG ATAAAAAAGG CTGGGTCAAA GTGTTTGGTA CCGCTAAGGC GTATGGCTTA
AACCATCTTC GTTTTCATTC CTGGTGTCCG CCAAAAGCAG CTTTTGAAGT AGCTGATTCA
CTGGGTTTTT ACTTACAGGT AGAATTGCCG TTATGGAGCC TTAAAACCGG GGAAGACAAA
AACACCAATC GCTTTATTGA AGAAGAGGCC CAAAGGATCA GTTCGGAATA CGGAAATCAT
CCTTCTTTCT GTCTGTGGTC TTTGGGTAAT GAACTTCAGG GAGATTTTAG CTGGCTGGCA
CAACTGCTGC AAAAATTAAA AATGAAAGAT AAACGTCACC TTTATACCAC TACTACTTTT
ACATTTCAGA AAGACCATGG CCGCTGGCCA GAACCAGGAG ATGATTATTT CATTACGCAA
TACACTAAAA AAGGCTGGGT GCGCGGACAG GGTATATTCA ATACCTATGC GCCAAATTTC
TCTACAGATT ATACAAAAGC CATAGATAGC TTACCCGTAC CCTCCATTAC GCATGAGATC
GGGCAGTACT CCGTTTATCC AAACTTAAAA GAAGTACCAA AGTATACCGG TGTGCTGGAG
CCTGTAAATT TCAAGGCCAT CAGCAAAGAC CTGCAAAGAA AAAACATGCT GTCGCTAGCT
GGTCAGTTTA CCCTGGCCAG TGGTAAGTTC TCGGCCAGCC TTTACAAAGA GGAAATTGAA
AGAGCCCTTA AAACTAAAGG CTTAAGCGGC TTTCAACTGC TGGATCTTCA TGATTTCCCT
GGTCAGGGTA CTGCCTTGGT AGGCATCCTT GATGCTTTCT GGGACAGTAA AGGTTTAGTT
TCTCCGGCAG AGCACCGTAT GTATACTGCA GCTATAGTGC CGTTAATCCG GTTTTCGAAG
GCAGCTTATA CCAATGCCGA AATTTTTGAA GCAGATGCCG AGGTTGCCAA TTTCAGTAAT
AAGGCATTAC AGCAAGTTAC ACCGCTATGG ACTGTTAAAA ACGATAAAGG AGAGACACTG
TTCAGTGGAG CACTAGCCGC TAAAGATATC CCGCTGGGCA ATGGAATTGG CCTTGGTAAA
ATTAACTTTA GTTTAAAAGA CATAAAAAAA GCCACGCACC TCATAGTAGA GCTGCAGCTC
AAGGGTACAG TAAGCAAAAA TAAGTGGAGT ATTTGGGTAT ACCCAGAACA ACCTGGAACT
GCACCGAAAG ATATGGTGTT CGCCACTTCT TTATCTCAGG CACTTAAACA CCTGAATGAA
GGCAGGAAAG TATTGCTCAA TCCGGATACT ACTCATATAA ATGGCGTGCA GGGTCGTTTC
GCTCCTGTAT TCTGGAGCCC TGTCCATTTC CCTAACCAGC CAGGGACCAT GGGGCTGCTG
TGCGATCCGG CTCATCCGGC ACTGGCAGAT TTTCCAACAG ACTTTTACAG CAACTGGCAA
TGGTGGGACC TCATTACGGC ATCCAAAACT ATGATTCTGG ATTCCGTTCC GGCAGTAGAT
CCGATTGTCA GGATCATCGA TAATTTTTAC AAGAACAGAA AAATGGCCAA TATTGTAGAG
GCCAGAGTTG GAAAGGGGCA GCTCATCATC TGTTCTATGG ATATTACTAC CAACCTGGAA
AAAAGACCGG CGGCCAGGCA ATTAAGGTAC AGTCTGGAGC AATATATGGG CAGTAATAAA
TTTAACCCGG CAGTAACGCT GAGTACTGGC GATCTGGAGC AACTGATAAA AGAGTAA
 
Protein sequence
MKFLGMLCIS CLCFAAGSAT ARQHIPLEGT WQVKLDSANV GVQEKWYHQQ FSQRIQLPGT 
LDDAGLGRSN NLSADKLVKD VLINLIRKHT YIGVAWYARE ILIPKDWKDK DISLYLERVI
WNTRVWIDGQ EAGVQESLSV PHRFELSALA KPGRHRLVIR IDNSKQYDMT HLNMAHAYTD
GTQIIWNGVI GKMELMAKDK INIATLQTYP RLKDKSVNVI ATLQNGLKQS KKGILQLQVI
GKDKRIVANR SIPVNLAAGD TRQEINIPLG KDALLWDEFN SNLYVLKAQL TISGTSFKDA
SSTTFGLREI TNQGSTLQVN GRRVFLRGTL ECNIFPLTGH PPMDKKGWVK VFGTAKAYGL
NHLRFHSWCP PKAAFEVADS LGFYLQVELP LWSLKTGEDK NTNRFIEEEA QRISSEYGNH
PSFCLWSLGN ELQGDFSWLA QLLQKLKMKD KRHLYTTTTF TFQKDHGRWP EPGDDYFITQ
YTKKGWVRGQ GIFNTYAPNF STDYTKAIDS LPVPSITHEI GQYSVYPNLK EVPKYTGVLE
PVNFKAISKD LQRKNMLSLA GQFTLASGKF SASLYKEEIE RALKTKGLSG FQLLDLHDFP
GQGTALVGIL DAFWDSKGLV SPAEHRMYTA AIVPLIRFSK AAYTNAEIFE ADAEVANFSN
KALQQVTPLW TVKNDKGETL FSGALAAKDI PLGNGIGLGK INFSLKDIKK ATHLIVELQL
KGTVSKNKWS IWVYPEQPGT APKDMVFATS LSQALKHLNE GRKVLLNPDT THINGVQGRF
APVFWSPVHF PNQPGTMGLL CDPAHPALAD FPTDFYSNWQ WWDLITASKT MILDSVPAVD
PIVRIIDNFY KNRKMANIVE ARVGKGQLII CSMDITTNLE KRPAARQLRY SLEQYMGSNK
FNPAVTLSTG DLEQLIKE