Gene GM21_3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3784 
Symbol 
ID8139158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4356992 
End bp4358200 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content64% 
IMG OID644871403 
Productthreonine dehydratase 
Protein accessionYP_003023561 
Protein GI253702372 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones109 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGATT ACAACCTGAT CGTCGACGCA GCCCAGAGAC TCAAGAAGAG GGTGCGCCGT 
ACGGAACTGA TCCACGCTCA GCATTTCAGC GAGCGGCTCG GCCTCCCCCT TTACTTCAAA
TGCGAGAACC TGCAGCGTAC CGGCGCCTTC AAGATCCGCG GCGCGCTCAA TTTCATGACG
GCGCAGCCGC GCCAGGCTCT GTCGGGTGGC GTGATCACCG CCTCGGCGGG GAACCACGCC
CAGGGTGTCG CCTTCTCCGC TGATCTCCTC GGGGTGAAGG CGGTGGTCTA CATGCCGGAG
AGCACCCCGC CGCAGAAGGT GTTCGCCACC CGGGACTACG GTGCGGAGGT CGTCCTGGAG
GGAAAGAATT TCGACGAGGC TTGCGCCGCC GCCCTGAGGC AGGCCGAGGC TACAGGAGCT
CTTTTCGTGC ACCCCTTCAA CGATCCCTTG GTGATGGCTG GACAGGGGAC CATAGGACTT
GAACTGCTCG AGGACCTTCC GGATCTGTCC AACGTGCTGG TCCCGATCGG GGGGGGAGGG
CTGATAGCCG GCATCGCCCG CGCCATCAAG GAGACCCACC CGCAGGTGAG GGTCATCGGC
GTCGAGGCCG CAGCGGCCCC ATCGATGCGC CAGGCGCTGC ACAAAGGGGA GGTCGTCACC
GTGCCGATAC GGGCGAGTCT CGCCGACGGT ATCGCGGTCA AGACCGCCGG GAGCAACACC
TTTCCCGTGG TAAAGGAATA CGTGGACGAG ATCGTACTGG TGGATGAGGA GGAGATCGCC
CTCGCCATCG TCTCGCTCAT GGAGCGGAAC AAGCTGATGG TGGAGGGGGC CGGCGCCGTG
GTGCTCGCGG CGCTTTTGAA CGGGAAGGTG AAGCGGATCT CCGGGAAGAC CGTGGCCCTT
CTTTCAGGCG GGAACATCGA CGTGAAGACC ATAGCGGTGG TCGTGGAGCG GGGGCTTTTG
GCCGCGGGGC GCTACCTGAA GCTGAAGATC GAGTTGGACG ACGTTCCCGG CGCGCTGGCG
CGGCTTGCCG CCGAAATCGC CGAGGCGCGG GCCAACATAT CCATCATCAC CCATGACCGC
CGCTCCGAGT CGCTTCCCAT CGGCAAGACC GAGGTGCTGG TCGAATTGGA GACCAGGGGT
CCTGAGCACA TCCAGGACGT CATCAGGCAC CTGAGCAAGC GGGAGTACCT GCTGGAAGTC
ATCAAATAA
 
Protein sequence
MLDYNLIVDA AQRLKKRVRR TELIHAQHFS ERLGLPLYFK CENLQRTGAF KIRGALNFMT 
AQPRQALSGG VITASAGNHA QGVAFSADLL GVKAVVYMPE STPPQKVFAT RDYGAEVVLE
GKNFDEACAA ALRQAEATGA LFVHPFNDPL VMAGQGTIGL ELLEDLPDLS NVLVPIGGGG
LIAGIARAIK ETHPQVRVIG VEAAAAPSMR QALHKGEVVT VPIRASLADG IAVKTAGSNT
FPVVKEYVDE IVLVDEEEIA LAIVSLMERN KLMVEGAGAV VLAALLNGKV KRISGKTVAL
LSGGNIDVKT IAVVVERGLL AAGRYLKLKI ELDDVPGALA RLAAEIAEAR ANISIITHDR
RSESLPIGKT EVLVELETRG PEHIQDVIRH LSKREYLLEV IK