Gene GM21_3371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3371 
Symbol 
ID8138738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3904101 
End bp3905726 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content64% 
IMG OID644870989 
Producthydro-lyase, Fe-S type, tartrate/fumarate subfamily, beta subunit 
Protein accessionYP_003023154 
Protein GI253701965 
COG category[C] Energy production and conversion 
COG ID[COG1951] Tartrate dehydratase alpha subunit/Fumarate hydratase class I, N-terminal domain 
TIGRFAM ID[TIGR00722] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, alpha region
[TIGR00723] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, beta region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACTC CAGAGTTCAA GTACCAGGAA GCCTTCCCGC TCGGACCGGA CAGCACGAAG 
TACCGCCTGA TCCCCGATTC CAAGCAGTAC GTATCGGTGG CCAACTTCGA GGGGCAGGAG
ATCCTCAGGG TGGCGCCCGA GGCGCTCACC ATCGCGGCCA ACGCCGCAAT GAAGGACCTC
TCCTTCATGC TGCGCCCGGA GCACAACGAG CAGGTTGCCA AGATCCTTGC CGACCCGGAA
GCATCGCAAA ACGACAAGGG TGTCGCCCTC GCTTTCCTGA GAAACGCGGA GGTCGCAGCG
CAGCTCGAAC TTCCCATCTG CCAGGACACC GGCACCGCCA TCGTCATGGG GAAGAAAGGG
CAGCAGGTCT GGACCGGCGT CAACGACGAG GAGTACCTCT CCAAGGGGGT CTACAAGACC
TACACCGAGG AGAACCTGCG CTACTCCCAG ACCGTGGCTC TCGACATGTA CAAGGAGATC
AACACCGGCA CCAACCTCCC GGCCCAGATC GACGTCTACG CCACCAAAGG GGCCGAGTAC
AAGTTCCTCT TCATGGCCAA GGGGGGCGGC TCCGCCAACA AGACCTACCT CTACCAGGAG
ACCAAGGCGC TCCTTAACCC GGACAGCCTG ATCAAATTCC TGGTCGCCAA GATGAAGACC
CTGGGGACCG CGGCCTGCCC GCCGTACCAC TTGGCCTTCG CCATCGGCGG CACCTCAGCC
GAGGCGTGCC TTAAGACCGT GAAGCTCGCC AGCGCCAAGT ACCTCGACGC GCTCCCCACC
TCCGGCAACG ACGGCGGCCA GGCGTTCCGC GACGTCGAGC TCGAGCAGCA GCTCCTCAAG
GCGGCCCAGG AATCCGGCAT CGGCGCGCAG TTCGGCGGCA AGTACTTCGC CCACGACGTG
CGCATCGTCC GCCTGCCCCG CCACGGCGCC TCCTGCCCGG TCGGCATGGG GGTTTCCTGT
TCCGCCGACC GCAACATCAA GGCTAAGATC AACAAGGAAG GGATCTGGGT CGAGCAGATG
GACGACAACC CGGGCCGCCT GATCCCCGAG CGTTTCCGCG GCAAGCACGA GCACGGCGTC
AGGATCAACC TGAACCAGCC GCTCAAGGAC GTACTGGCCG AGCTGACCAA GCACCCGGTC
TCCACTCCGC TGCTCCTCTC CGGCACCATC GTGGTCGGCC GCGACATCGC CCACGCCAAG
TTCAAGGAGA TCCTCGACTC CGGCAAGCCG CTTCCCGAGT ACCTGAAGAA TCATCCGATC
TACTACGCGG GCCCGGCGAA AACCCCGAAA GGGAAACCCT CCGGCTCCTT CGGCCCCACC
ACCGCCGGCC GCATGGACTC CTACGTCGAC CTCTTGCAGG AAAACGGCGG CTCCATGATC
ATGATCGCCA AGGGCAACCG CTCCCAGCAG GTCACCGACG CCTGCAAGAA GCACGGCGGC
TTCTACCTGG GCTCCATCGG CGGCCCGGCT GCCATCCTGG CCGAGGAGAA CATCAAGAAG
GTCGAGTGCA TCGACTTCCC CGAGCTCGGC ATGGAAGCGG TCTGGAAGAT AGAGGTTGAG
GACTTCCCGG CGTTCATCCT GGTGGACGAC AAGGGGAACG ACTTCTTCAA GCAGCTGGGA
ATCTAG
 
Protein sequence
MATPEFKYQE AFPLGPDSTK YRLIPDSKQY VSVANFEGQE ILRVAPEALT IAANAAMKDL 
SFMLRPEHNE QVAKILADPE ASQNDKGVAL AFLRNAEVAA QLELPICQDT GTAIVMGKKG
QQVWTGVNDE EYLSKGVYKT YTEENLRYSQ TVALDMYKEI NTGTNLPAQI DVYATKGAEY
KFLFMAKGGG SANKTYLYQE TKALLNPDSL IKFLVAKMKT LGTAACPPYH LAFAIGGTSA
EACLKTVKLA SAKYLDALPT SGNDGGQAFR DVELEQQLLK AAQESGIGAQ FGGKYFAHDV
RIVRLPRHGA SCPVGMGVSC SADRNIKAKI NKEGIWVEQM DDNPGRLIPE RFRGKHEHGV
RINLNQPLKD VLAELTKHPV STPLLLSGTI VVGRDIAHAK FKEILDSGKP LPEYLKNHPI
YYAGPAKTPK GKPSGSFGPT TAGRMDSYVD LLQENGGSMI MIAKGNRSQQ VTDACKKHGG
FYLGSIGGPA AILAEENIKK VECIDFPELG MEAVWKIEVE DFPAFILVDD KGNDFFKQLG
I