Gene Phep_3871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3871 
Symbol 
ID8255005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4648446 
End bp4651220 
Gene Length2775 bp 
Protein Length924 aa 
Translation table11 
GC content46% 
IMG OID644937535 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003094124 
Protein GI255533752 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAT TTAAATCCCT GTTGTTTTTT TTACTGATAA CTTTTTTATT TAGTCCAATA 
CTCCGTGCCC AGCAGCAGGG AATAAGTGAA AATGTGCTTC CTATAGAGGG CATCTGGCAT
TTTAAGCTGG ACCCCTTTGA AACAGGTATC AATAGCAATG GTGTTCAGCT GCTCCCATCA
CTTGCAGAGA CCATTACCTT GCCAGGTTCT ACAGACCAGG CAGGCAAAGG GTATCAGACA
CAGGCAATGA CTTCGATCAG GCTGACAAGG CCATTTGAAT ACAAGGGGAT AGCCTGGTAT
GAGAAAGAGA TTTTTGTGCC ACTGGAATGG AAGGACAAGG AGATCCAACT CTACTTAGAG
CGTGCGCATT GGGAAACCAG GGTATGGATT AACGATAAAC CCATCGGAAA AAGGGAAAGC
CTTTCGGTGC CCCATATTTA TTCCATCACT GCGCTTGTTC GGCCCGGAAA GAAGAATAAG
ATCCGGATAA GGGTAAACAA TGAAAAGATA TATGATCTGG AATATGCCCA TGCCATTAGT
GCGGAAACCC AAACAAACTG GAATGGGATT ATCGGGAAAA TGCAGTTGCA GGCTAATGAT
AAGATTTATC TTGCCGATGT ACAGATTTAT CCCCATGCAG AAAAGAAAAC GGCTACAGCC
AAAGTACTGA TCAGCAATGC GGCAAAGAAA CAAGTAGAAG GCGAACTTTT TTTTGTCTGC
AGCCTTAAAA AAGCAGGTGC CGAACCTATG CCTGTACACC GTATAAAATT TTCCGGTCAG
GATTCGGTCA TTGCCCTTAC AACGGAAATT CCATTGGGAG AGCAGATCCA ATTGTGGGAT
GAGTTTGATC CTAATTTATA CCAGTTGAAT GTAAGCTTAA ATGCCGGGGC AGAGGGGCAG
TTAAGCAGGG CTGCTAAAAC GCTTGATTTT GGTATAAGGA CATTGGCTAC CCGGGAAACA
CAATTCTTAT TCAATGGTAT TCCCACCTTT ATCAGGGGAA CGGTAAACTC TTCGGAGTTT
CCATTGACGG GCTATCCGCC TACCAGGCTG AAAGAATGGC TCCGCATTTT TAAGACCTGC
AAGGATTATG GATTAAATGC CATGCGCTTT CATAGCTGGT GTCCGCCCGA AGCAGCCTTT
GAAGCCGCAG ATCAGTTGGG TTTTTACCTG CAGGTAGAAA ATCCGGACTG GAGGTTTACT
GTAGGGAAAG ATGCGGCCGT GAACCGGTTC TTAAAAGAAG AAGCCGACAG GATATTGCAA
GCCTATGGCA ACCATCCTTC ATTTATTATG TTTTGTGAAG GAAATGAAAT GGTTGGGCCG
GCGGTAAAGG AGTTTCTGAC GGAACAGGTT AAACACTGGA AAGAGACCGA TCCAAGGCAT
TTATATACAG GGAGTGCGGC TTATCCCTTG ATTGCAGAAA ACCAGTTTCA TGTATTGTAT
GGGGCAAGAC CACACCGCTG GAAAGAAGGC CTGAAAAGCC GGTTTAATGT ACGTCCACTG
GATACAGAGT ATGATTATGG GGAGTATGTG AAGAAAAATA AGGAACCGAT GATTACCCAT
GAGATCGGCC AATGGTGTGC GTTCCCTGAT TTTGGTGAAA TTTCTAAGTA TACCGGGGTC
TTAAAACCCT ATAATTATGA ATTGTTCAGG GAGCTGTTGA GGGACCATCA GCTGATGGAT
CAGGCAGGGG ATTTTACCAG GGCTTCGGGG AAATTTCAGG TGATCATGAA AAAGGAGGAA
GTGGAATCTT ATTTACGTAC TCCGGGTTTT GGGGGCTATC ACATGCTCCA GTTAAATGAT
TTTCCGGGAC AGGGGACTGC CCCTGTGGGT GTGGTTGATA TTTTCTGGGA TCCGAAACCT
TATGTGACTG CCAAAGAATT TAGCAGGTTT CAGTCGGCCC GGGTGCCCTT GCTCAGAACG
GCCTCTTTTA CCTGGACGAA TGACCAGACT TTTAAGGCCA GGGCACAGTT TGCCAACTTT
GGGAAGTTAA GTATGGAAAA TGCGGCAGTA AGCTGGTCAT TAAAATATCC GGATGGGGGC
TTATATGCCG GAGGGCAATT TAACCGCTGC AATATTCCTG TAGGTAGTCC TTTTGAACTG
GGTGAGCTAT CTGTTCCATT GGATCGGGTA ACAGCGGCGA CGAAACTGGT GCTGACGATT
AGCGTGGATG GAACCACATA CAGCAACCAT TGGAACATAT GGGTATATCC TAAAACATTG
CCTTCACCTG AAAGGAAAGG GCTGATGGTT GCTGATCATT GGGATAGCAA AGTGAAGCAA
TACCTTGAAA AAGGGGGAAA GGTGCTTTTG CTGGCCGATA CCTCAAAAAT ACTTTCGGAT
GCCGATCCGG CATTTTCCGG GATTTCATGG AATACGGTAT GGTCTGGCAT GCCGCCAAAC
CTGCTGGGCA TTTTGTGTAA CCCGGAGCAT CCGGCACTGA AATACTTCCC TACAGCAGAA
CACTCTGACT GGCAGTGGTG GGATATTGTA CGCAATTCAA AGCCTATGGT ACTTGAACAG
ATGCCTTTTT CATTTAAGCC ACTGGTACAG ATGATCCCCG ACTGGAACAA TCCACGTAAG
ATAGCCCTGG TGTTTGAAGT TAAAATAGGA AAGGGGAGCC TGCTGGTATC GGCAGTAGAT
CTGAAAAACA ACCTGGACAA ACGCCCGGTG GCCCGGCAAC TTTTGTATAG CCTGAAGGCA
TACATGAACA GTGATAAATT TTTACCTTTA ACCGAAGTGC CAGCCCAGAT GATCGATATG
ATCTTTAAAA AATAA
 
Protein sequence
MMKFKSLLFF LLITFLFSPI LRAQQQGISE NVLPIEGIWH FKLDPFETGI NSNGVQLLPS 
LAETITLPGS TDQAGKGYQT QAMTSIRLTR PFEYKGIAWY EKEIFVPLEW KDKEIQLYLE
RAHWETRVWI NDKPIGKRES LSVPHIYSIT ALVRPGKKNK IRIRVNNEKI YDLEYAHAIS
AETQTNWNGI IGKMQLQAND KIYLADVQIY PHAEKKTATA KVLISNAAKK QVEGELFFVC
SLKKAGAEPM PVHRIKFSGQ DSVIALTTEI PLGEQIQLWD EFDPNLYQLN VSLNAGAEGQ
LSRAAKTLDF GIRTLATRET QFLFNGIPTF IRGTVNSSEF PLTGYPPTRL KEWLRIFKTC
KDYGLNAMRF HSWCPPEAAF EAADQLGFYL QVENPDWRFT VGKDAAVNRF LKEEADRILQ
AYGNHPSFIM FCEGNEMVGP AVKEFLTEQV KHWKETDPRH LYTGSAAYPL IAENQFHVLY
GARPHRWKEG LKSRFNVRPL DTEYDYGEYV KKNKEPMITH EIGQWCAFPD FGEISKYTGV
LKPYNYELFR ELLRDHQLMD QAGDFTRASG KFQVIMKKEE VESYLRTPGF GGYHMLQLND
FPGQGTAPVG VVDIFWDPKP YVTAKEFSRF QSARVPLLRT ASFTWTNDQT FKARAQFANF
GKLSMENAAV SWSLKYPDGG LYAGGQFNRC NIPVGSPFEL GELSVPLDRV TAATKLVLTI
SVDGTTYSNH WNIWVYPKTL PSPERKGLMV ADHWDSKVKQ YLEKGGKVLL LADTSKILSD
ADPAFSGISW NTVWSGMPPN LLGILCNPEH PALKYFPTAE HSDWQWWDIV RNSKPMVLEQ
MPFSFKPLVQ MIPDWNNPRK IALVFEVKIG KGSLLVSAVD LKNNLDKRPV ARQLLYSLKA
YMNSDKFLPL TEVPAQMIDM IFKK