Gene GM21_0899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0899 
Symbol 
ID8136220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1074229 
End bp1075296 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content63% 
IMG OID644868515 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_003020724 
Protein GI253699535 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.4439400000000002e-32 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGGACG CATTCACCCC GACCTCGCTT CTGGTGACAG GAGGTGCCGG GTTCATCGGT 
TCCAATTTCA TCAACCACTT CATGGCCGGT AATCCCGGCT GCCGGGTCAT CAACCTGGAC
CTTTTGACCT ACGCCGGGAA CCTGAAAAAC CTCGCTGCCG TTGAGGGAAA CCCCGCCTAC
CGCTTCGTGA AAGGGGACAT CTGCGACGCC GGCCTCGTGG CCGGACTCCT CGCCGAGGAA
AAGGTGGACG CCGTGGTGCA TTTCGCCGCC GAATCCCACG TGGACCGTTC CATCACCGGC
CCCGACATCT TCGTGAGGAC CAACGTCCTC GGGACCCAGA CGCTGCTCGA AGCAAGCCGC
CTGCACGCGG AGCGCGTTGC CGGTTTCCGG TTCCTGCAGG TATCCACGGA CGAGGTGTAC
GGCAGCCTCG GCGCCCAAGG GTACTTCACC GAGGAGACGC CGCTGGCGCC CAACTCCCCC
TACTCGGCCA GCAAGGCGGG GGCGGACCTT TTGGTGCGCG CCTACTCCGA GACCTTTGGC
CTTGCCACCC TGAACACGCG CTGCTCCAAC AACTACGGCC CGTACCACTT CCCGGAGAAG
CTGATTCCCC TCATGATCCA CAACATCCTC AAGAAGAAGC CGCTGCCGGT GTACGGAGAC
GGGCTGAACG TGAGGGACTG GCTGCACGTG AAGGACCATT CCGCCGCCAT CGAGCGGGTG
CTCAAAAAGG CAAAGCCGGG GGAGATCTTC AACGTCGGCG GCAACAACGA GTGGAAGAAC
ATAGACATCG TGAACCTGGT CTGCGACCTG ATGGACCAAA GGCTCGGGCG GCGCCCCGGC
GAGAGCAGGG GACTGATCGC CTTCGTCCAG GACCGCAAGG GGCACGACCG CCGCTACGCC
ATAGACGCCT CCAAGCTGAA GCGGGAGCTT TCCTGGGAAC CGAGCTACAC CTTCGAGCGC
GGCATCGCCG AGACCATCGA CTGGTACCTG GCCAACCAGG GGTGGGTCGA GGAGGTCGCT
TCCGGCGCCT ACCGCGAATA CTACGAAAAG CAGTACGGGC AGCAGTAA
 
Protein sequence
MQDAFTPTSL LVTGGAGFIG SNFINHFMAG NPGCRVINLD LLTYAGNLKN LAAVEGNPAY 
RFVKGDICDA GLVAGLLAEE KVDAVVHFAA ESHVDRSITG PDIFVRTNVL GTQTLLEASR
LHAERVAGFR FLQVSTDEVY GSLGAQGYFT EETPLAPNSP YSASKAGADL LVRAYSETFG
LATLNTRCSN NYGPYHFPEK LIPLMIHNIL KKKPLPVYGD GLNVRDWLHV KDHSAAIERV
LKKAKPGEIF NVGGNNEWKN IDIVNLVCDL MDQRLGRRPG ESRGLIAFVQ DRKGHDRRYA
IDASKLKREL SWEPSYTFER GIAETIDWYL ANQGWVEEVA SGAYREYYEK QYGQQ