Gene GM21_1936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1936 
Symbol 
ID8137270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2246922 
End bp2248274 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content61% 
IMG OID644869550 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_003021747 
Protein GI253700558 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones104 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAT GTGTGGTCGG ATCAGGCTAC GTGGGTCTCG TAGCCGGCAC CTGTTTTGCC 
GAAAGCGGTA ACGACGTCAT CTGTGTCGAC GTCGACAAGG ACAAGATAGA CGGTCTGAAG
CGCGGCGTCA TTCCCATCTA CGAGCCCGGC CTGAAGGAAA TGGTCTTAAG GAACTGCGAG
GAGGGGAGGC TCAACTTCAC CACCGACCTC GACCTGGCCG TCAAGGAGTC GCTGGTTTGC
TTCATCGCGG TCGGCACCCC CCCCGGCGCC GACGGCTCCG CCGACCTGCA GTACGTCCTC
TCCGTCGCCC GCTCCATCGG CCGCGCCATG GAGAGCTTCA AGATCATCGT CGACAAGTCC
ACCGTCCCGG TCGGGACAGC CGACAAGGTG CGCGCCGCCG TGAACGAGGA GCTCGCCAAG
CGCGGGACGC ATATAGAATT CGACGTGGTG TCCAACCCCG AGTTTTTAAA GGAAGGGGCC
GCCATCGACG ACTTCATGAA ACCCGACCGC GTCGTCATCG GTACCGACAA CGTGAGGACC
GCCGAGATTA TGAAGGAGCT CTACTCGGCC TTCATGCGCA AGTCCAACCG CCTGCTGGTG
ATGGACATCA GAAGCGCCGA GATGACCAAG TACGCCGCCA ACGCCATGCT CGCCACCCGC
ATCACGTTCA TGAACCAGAT CGCGAACCTC TGCGAGATGG TGGGCGCGGA CGTCATGGCG
GTTCGGGAGG GGATCGGCTC CGACTCCCGC ATCGGTTACG ACTTCCTCTT CCCCGGCGTC
GGCTACGGCG GCTCCTGCTT CCCCAAGGAC GTCAAGGCCC TGGTGAAGAC GGCGGACGAG
TGCAGCTACG ACTTCGTCCT TTTGAAGGCG GTGGAGACCG CCAATGAACG GCAAAAGGCG
ATCCTCTCCG ACAAGATACT GCGCCGTCTG GGAAGCGCAG GCGACAAGCC TCTGGCCGGC
AAGCGCTTCG CCATCTGGGG ATTGTCCTTC AAGCCCCGCA CCGACGACAT GAGAGACGCC
CCTTCGCTCA CCATCATCAA CAGGCTTTTG GAAATGGGAG CGAGCGTGCA CGCCCACGAC
CCCGAGGCGA TGAACGAGGC GAAGAAGCAT TTCGGCGACC GCATCAGCTA CAGCGTGAAC
AAGTACGACC TGATGAGAGG GGCCGATGCG CTCGTCGTCA TCACCGAGTG GAACGAGTAC
AGGAACCCCG ATTTCGACCG CATCAAGGAG CTCCTGATCA ACCCGATCAT CTTCGACGGC
CGGAACCTCT ACCACCCTGG CCGCATGAAG GAGGCCGGGT TCGAGTACCT CCCCATCGGC
CGAAACGGCG AGGCCGTCTG CGAAATGGAC TAA
 
Protein sequence
MKVCVVGSGY VGLVAGTCFA ESGNDVICVD VDKDKIDGLK RGVIPIYEPG LKEMVLRNCE 
EGRLNFTTDL DLAVKESLVC FIAVGTPPGA DGSADLQYVL SVARSIGRAM ESFKIIVDKS
TVPVGTADKV RAAVNEELAK RGTHIEFDVV SNPEFLKEGA AIDDFMKPDR VVIGTDNVRT
AEIMKELYSA FMRKSNRLLV MDIRSAEMTK YAANAMLATR ITFMNQIANL CEMVGADVMA
VREGIGSDSR IGYDFLFPGV GYGGSCFPKD VKALVKTADE CSYDFVLLKA VETANERQKA
ILSDKILRRL GSAGDKPLAG KRFAIWGLSF KPRTDDMRDA PSLTIINRLL EMGASVHAHD
PEAMNEAKKH FGDRISYSVN KYDLMRGADA LVVITEWNEY RNPDFDRIKE LLINPIIFDG
RNLYHPGRMK EAGFEYLPIG RNGEAVCEMD