Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3784 |
Symbol | |
ID | 8139158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4356992 |
End bp | 4358200 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871403 |
Product | threonine dehydratase |
Protein accession | YP_003023561 |
Protein GI | 253702372 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR01127] threonine dehydratase, medium form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 109 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGATT ACAACCTGAT CGTCGACGCA GCCCAGAGAC TCAAGAAGAG GGTGCGCCGT ACGGAACTGA TCCACGCTCA GCATTTCAGC GAGCGGCTCG GCCTCCCCCT TTACTTCAAA TGCGAGAACC TGCAGCGTAC CGGCGCCTTC AAGATCCGCG GCGCGCTCAA TTTCATGACG GCGCAGCCGC GCCAGGCTCT GTCGGGTGGC GTGATCACCG CCTCGGCGGG GAACCACGCC CAGGGTGTCG CCTTCTCCGC TGATCTCCTC GGGGTGAAGG CGGTGGTCTA CATGCCGGAG AGCACCCCGC CGCAGAAGGT GTTCGCCACC CGGGACTACG GTGCGGAGGT CGTCCTGGAG GGAAAGAATT TCGACGAGGC TTGCGCCGCC GCCCTGAGGC AGGCCGAGGC TACAGGAGCT CTTTTCGTGC ACCCCTTCAA CGATCCCTTG GTGATGGCTG GACAGGGGAC CATAGGACTT GAACTGCTCG AGGACCTTCC GGATCTGTCC AACGTGCTGG TCCCGATCGG GGGGGGAGGG CTGATAGCCG GCATCGCCCG CGCCATCAAG GAGACCCACC CGCAGGTGAG GGTCATCGGC GTCGAGGCCG CAGCGGCCCC ATCGATGCGC CAGGCGCTGC ACAAAGGGGA GGTCGTCACC GTGCCGATAC GGGCGAGTCT CGCCGACGGT ATCGCGGTCA AGACCGCCGG GAGCAACACC TTTCCCGTGG TAAAGGAATA CGTGGACGAG ATCGTACTGG TGGATGAGGA GGAGATCGCC CTCGCCATCG TCTCGCTCAT GGAGCGGAAC AAGCTGATGG TGGAGGGGGC CGGCGCCGTG GTGCTCGCGG CGCTTTTGAA CGGGAAGGTG AAGCGGATCT CCGGGAAGAC CGTGGCCCTT CTTTCAGGCG GGAACATCGA CGTGAAGACC ATAGCGGTGG TCGTGGAGCG GGGGCTTTTG GCCGCGGGGC GCTACCTGAA GCTGAAGATC GAGTTGGACG ACGTTCCCGG CGCGCTGGCG CGGCTTGCCG CCGAAATCGC CGAGGCGCGG GCCAACATAT CCATCATCAC CCATGACCGC CGCTCCGAGT CGCTTCCCAT CGGCAAGACC GAGGTGCTGG TCGAATTGGA GACCAGGGGT CCTGAGCACA TCCAGGACGT CATCAGGCAC CTGAGCAAGC GGGAGTACCT GCTGGAAGTC ATCAAATAA
|
Protein sequence | MLDYNLIVDA AQRLKKRVRR TELIHAQHFS ERLGLPLYFK CENLQRTGAF KIRGALNFMT AQPRQALSGG VITASAGNHA QGVAFSADLL GVKAVVYMPE STPPQKVFAT RDYGAEVVLE GKNFDEACAA ALRQAEATGA LFVHPFNDPL VMAGQGTIGL ELLEDLPDLS NVLVPIGGGG LIAGIARAIK ETHPQVRVIG VEAAAAPSMR QALHKGEVVT VPIRASLADG IAVKTAGSNT FPVVKEYVDE IVLVDEEEIA LAIVSLMERN KLMVEGAGAV VLAALLNGKV KRISGKTVAL LSGGNIDVKT IAVVVERGLL AAGRYLKLKI ELDDVPGALA RLAAEIAEAR ANISIITHDR RSESLPIGKT EVLVELETRG PEHIQDVIRH LSKREYLLEV IK
|
| |