Gene Phep_2919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2919 
Symbol 
ID8254030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3481229 
End bp3482326 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content47% 
IMG OID644936567 
ProductQuinoprotein glucose dehydrogenase 
Protein accessionYP_003093179 
Protein GI255532807 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID[TIGR03606] dehydrogenase, PQQ-dependent, s-GDH family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000852444 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATCC TCGCTATAAG TATGTTTGCC CTTTTAATGG CTTTTTTTGC CTGTAAAAAG 
AACAAATCCA ATACCCCTGA CGGCGAACAG CTGCCCGATG TGGAATTAAA GACCAGAACG
GTAAGTTCCG GCCTGTCGCA TGTATGGGAA ATGGTTTATG GCCCCGATCA GCAGTTGTGG
ATCACCGAAA GGGCTGGTAA GATCAGTCGC GTTAACCCGC AGACAGGTGC AGTGAGCCTG
CTGCTGAATG TGCCCGATGT GGTGTCCAAT GGTGAAGGTG GCCTGCTGGG TATGGCTATA
AACCCACAGT TTAGCACCAA TCCATGGGTA TATGTGGTAT ACAATTACAA TTCTGCTTCG
GGCTATAAAG AAAAGGTAGT GCGCTATACC TATTCGGGTA CTGCCTTAAC CTCGCCATTA
ACCATTCTCG ACCTTATCCC CGCATCCTCC ATCCATAACG GGTCCAGGTT GCTGATCAGC
AATGACCAAA AGCTTTTCAT CAGCACTGGT GATGCCAGTG AAGCTGCAAA TGCGCAAAAT
ACCGCTTCTT TATCAGGTAA AATATTACGC GTAAATATGG ATGGCAGCAT TCCTGTGGAT
AATCCGATAG CGAACAACAG GATATGGACT TACGGGCACC GCAATCCGCA GGGACTGGTG
CAGGTGGGCG CTAAGCTCTA TGCATCGGAA CATGGGCCAA ACAATGATGA TGAAGTAAAC
CTGATCCTGA AGGGCCGTAA CTATGGCTGG CCAAATGTGG AAGGTTTTTG CGATAAACCT
GCCGAGCAGA CTTTTTGCAG CGCCAACAAT GTGGTAGAGC CGCTGATGGC CTGGACACCT
ACTATTGCCA CATCCGGATT AACCTATTAC AATTCCGATC TGATTCCCCA GTTTAAAAAT
TCATTGCTGC TGCTCAGTTT AAAGGCCTCT AAATTTACCC AGCTTAAACT GAATGAGGCG
GGTGATAAAA TATTGGGCAG CAAAGATTTT TTTGTCAATG AATTTGGCCG CTTAAGGGCC
ATATGCCAGT CGCCCGACGG CAAAATATAC ATAGGCAGCA GCAATGGCAG CAACGATAAG
ATCATAGAAA TCAACTAA
 
Protein sequence
MKILAISMFA LLMAFFACKK NKSNTPDGEQ LPDVELKTRT VSSGLSHVWE MVYGPDQQLW 
ITERAGKISR VNPQTGAVSL LLNVPDVVSN GEGGLLGMAI NPQFSTNPWV YVVYNYNSAS
GYKEKVVRYT YSGTALTSPL TILDLIPASS IHNGSRLLIS NDQKLFISTG DASEAANAQN
TASLSGKILR VNMDGSIPVD NPIANNRIWT YGHRNPQGLV QVGAKLYASE HGPNNDDEVN
LILKGRNYGW PNVEGFCDKP AEQTFCSANN VVEPLMAWTP TIATSGLTYY NSDLIPQFKN
SLLLLSLKAS KFTQLKLNEA GDKILGSKDF FVNEFGRLRA ICQSPDGKIY IGSSNGSNDK
IIEIN