Gene GM21_2537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2537 
Symbol 
ID8137879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2964590 
End bp2966647 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content65% 
IMG OID644870146 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_003022336 
Protein GI253701147 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.00000121125 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGGGG CGACCAGTGA AGCTGTACCG GGGCACTTTG GCATAGTGGA GTGGTTTAGG 
CCGGGCGAGC GGGAACGGGT GGAACGGGTC CTCTCCGACA TGAAGGCGAT CGGCGCCAAG
AGCCTCAGGA CCGGGATCTC CTGGGCGGAC TGGTACACGA GCGAAGGGAA GAAGTGGTAC
GACTGGCTCA TCCCGCGGCT GGCGCGGGAG GTGGAGCTCC TTCCCTGCGT GCTTTACACC
CCCCCCTCCA TCGGGATCGA GGCGAAGACC TCCTCCCCCC CGCGTCGTCC CAAGGATTAC
GCCGACTTCA TCGACCTCTT CATCACCGCC TTCGGGGAGC ATTTCGAGTA CCTGGAACTC
TGGAACGAGC CGAACAACCT GAGCGAGTGG GACTGGACCC TCGACCCGCA CTGGACCTCC
TTCGGCGAGA TGATCGGGGG GGCGGCCTAC TGGGCCAGAA AGCGCGGCAA GAAAACGGTC
CTCGGGGGGA TGAGCCCCAT CGACGGCCAC TGGCTCTGCC GGATGTTCGA GCTGGGGGTG
ATGGATTACA TCGACGTGGT GGGGATCCAC GGCTTCCCGG ACATCTTCGA TTACACCTGG
AAGGGGTGGC AGCGCAACAT CGCCATGGTG CGGGAAATCC TGGACGAGAG GGAGTGCGCC
TGCGAGATCT GGGTCACCGA GGCGGGGTTC TCCACCTGGC AGCACGACGA GTTCAAGCAG
GCAAAGGTCT TCCTCGATTT CCTCGCCGCC CCGGCGCAGC GCGTCTACTG GTACGGCGTG
GACGATCTCG ATCCGTCGCT TTCCGCGGTG GACCGCTACC ATCTGGACGA GCGCGAGTAC
TTTTTCGGCT TGAAGAAGGC GGACGGAGCG CCCAAGCTCC TCTACCGCCT GCTCCGGGAG
GGGACGCTTT CCTCCCTCAA GCGCGTGGTG GCCGCGGGGA GCGCCGCGAG GGTGGACGGC
GGCGGGGCGG AAAAGGCGGT GCTTGTCACC GGGGGGGCCG GGTTCATAGG GACCAACCTG
GTGCAGCACC TGGTGGCGCA GGGGGAGAGG GTGATCCTCT ACGACAACCT CTCCCGCGCA
GGGGTCGAGA AGAACCTCCT CTGGCTCATG GACAACTGCG GCGAAAGGCT GCAGGTGGTG
ATAGGTGACA CCCGCAACTC TCTCCTTCTG GAGCAGGCGG TAAGCGAGGC GAAGCAGGTC
TTCCACTTCG CGGCCCAGGT AGCGGTCACC ACCAGTATCG ACAACCCCGG GAACGACTTC
TCCGTCAACG CCCAAGGGAC CTTCTCGCTC CTGGAGGCGA TCCGCAAGGC GAAGACCCCT
CCTTCGCTTC TCTACACCTC CACCAACAAG GTGTACGGGG CCATCGAGGG GTGCGGCGTC
CGGAAAAACG GGGTGCGCTA CGAGCCGCTC GACCCGCAGC TCCGCTCCCA CGGACTGGGA
GAGGAGACCC CGCTCGATTT CCTGAGCCCC TATGGCTGCT CCAAGGGGTG CGCCGACCAG
TACGTCCTGG ACTATGCCCG CAGCTTCGGC ATCGCGGCGG CGGTCTTCCG GATGAGCTGC
ATATACGGTC CGCACCAGTA CGGCACCGAG GACCAGGGGT GGGTGGCGCA CTTCGCCATA
CAGACCATGA AGGGGGAGCC CATCACCCTC TACGGGGACG GCTGCCAGAT CCGGGATCTT
CTCTTCGTGG AGGACCTGGT GGACGCCATG TGCCGGGCGC GGGACATCAT GCCCCGCATA
GCCGGGCAGG CTTTCAACAT CGGAGGCGGC CCCGCCCGCA CCATAAGCCT CTTGGAGCTT
TTGGATCTTT TGCGCGAGCT GCACGGCGGA CTTCCCACCA TACTGCGCGA CGACTGGCGC
ACGGGGGACC AGAGATACTA CGTCTCGGAC ACCAGGAAGT TCTGCAAGGC GACCGGATGG
ACGCCGCGGC ACTCGGTGGC CGAGGGGGTG CGCAGGCTGT ACGACTGGCT CCTGGAAACA
ATGCACTCCC CGGCGCGCGG CGCCGGGAGC TTCGACAAGC AAAGGTACCC GGCGACGGGA
GCGGAGGCGA TCGGATGA
 
Protein sequence
MKGATSEAVP GHFGIVEWFR PGERERVERV LSDMKAIGAK SLRTGISWAD WYTSEGKKWY 
DWLIPRLARE VELLPCVLYT PPSIGIEAKT SSPPRRPKDY ADFIDLFITA FGEHFEYLEL
WNEPNNLSEW DWTLDPHWTS FGEMIGGAAY WARKRGKKTV LGGMSPIDGH WLCRMFELGV
MDYIDVVGIH GFPDIFDYTW KGWQRNIAMV REILDERECA CEIWVTEAGF STWQHDEFKQ
AKVFLDFLAA PAQRVYWYGV DDLDPSLSAV DRYHLDEREY FFGLKKADGA PKLLYRLLRE
GTLSSLKRVV AAGSAARVDG GGAEKAVLVT GGAGFIGTNL VQHLVAQGER VILYDNLSRA
GVEKNLLWLM DNCGERLQVV IGDTRNSLLL EQAVSEAKQV FHFAAQVAVT TSIDNPGNDF
SVNAQGTFSL LEAIRKAKTP PSLLYTSTNK VYGAIEGCGV RKNGVRYEPL DPQLRSHGLG
EETPLDFLSP YGCSKGCADQ YVLDYARSFG IAAAVFRMSC IYGPHQYGTE DQGWVAHFAI
QTMKGEPITL YGDGCQIRDL LFVEDLVDAM CRARDIMPRI AGQAFNIGGG PARTISLLEL
LDLLRELHGG LPTILRDDWR TGDQRYYVSD TRKFCKATGW TPRHSVAEGV RRLYDWLLET
MHSPARGAGS FDKQRYPATG AEAIG