Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1839 |
Symbol | |
ID | 7084262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2060650 |
End bp | 2062101 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643698862 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002355487 |
Protein GI | 217970253 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.613277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAGA TCCACCTGCT GATCGGCGGC GAACGTCGCC GCGCCACGGA CGGCGCCAGC TTCGAACGCC GCAACCCGCT CGACCACGGC GTCGCCACGC GCGCTCCCGC GGCCACCGCT GCGGACGCGG TGGCCGCGGT CGAGGCCGCC GCCGCGGCCT TCCCCGCGTG GGCCGCCACC GGCCCCGGCG AGCGCCGCGC GCTGCTGATG AAGGCGGCGC ACGCGCTCGA GGCGCGCGCC GAGGCCTTTA CCGCGGCGAT GGCCGCCGAG ACCGGCGCCT CGGCGATCTG GGCCGGCTTC AACGTGCACC TGGCGGCCGG CATGCTGCTC GAGGCGGCGG CGCTGACCAC CCGCATCGAG GGCAGCATCC TGCCCTCGGA CGTGCCCGGC TCGGTGGCGA TGGCGGTGCG CCAGCCCGCT GGCGTGGTGC TCGGCATCGC GCCCTGGAAC GCCCCGGTGA TCCTGGGCGT GCGCGCGATC GCCACCCCGC TCGCCTGCGG CAACACCGTG GTGCTCAAGG GCTCGGAGCT GTGCCCGGCC ACCCACGGCC TGATCATCGA GGCGCTGCAG GACGCCGGGC TGCCGGCGGG CGTGGTGAAC TTCGTCACCA ACGCCCCGGC CGACGCCGGC GCGGTGGTCG AGGCCATGGT CGCGCACCCG GCGCTGCGCC GGGTGAACTT CACCGGCTCG ACCCACGTCG GCCGCCTGAT CGCGCAGACC TGCGCCCGGT ACCTCAAGCC GGCGGTGCTC GAGCTCGGCG GCAAGGCGCC TTTCGTCGTG CTCGACGACG CCGACCTCGA CGCCGCGGTG GCGGCGGCCA CCTTCGGCGC CTTCGCCAAC TCGGGCCAGA TCTGCATGTC CACCGAGCGC ATCGTCGTCG ATGCGGCGGT GGCCGACGAC TTCGTCGCCC GCCTGGCGGC GCGCGCCCGC GCCCTGCCCC TGGGCGACCC GCGCAAGGGC CCGGTGGTGC TCGGCTCGGT GGTCGACCAG CGCACGGTCG AGCGCTGCAA CGCGCTGATC GACGATGCGC TCGCCAAGGG GGCGACGCTG GTGTGCGGCG GCAAGGCCGA CAGCACGCTG ATGCCCGCCA CGCTGCTCGA CCACGTCAGC GCGCAGATGC GCATCTACCA CGAGGAGACC TTCGCCCCGG TGAAGGCGAT CGTGCGCGTC CAGGGCACCG AGGCCGCCAT CGCCTGCGCC AACGACAACC CCTTCGGCCT CGCCGCCGCA GTGTTCGGGC GCGACCTCGC GCGCGCCTGG CAGGTGGCCG GGCGCATCCA GTCGGGCATC TGCCACATCA ACGGGCCCAC GGTGCACGAC GAAGCGCAGA TGCCCTTCGG CGGCGTGAAG GATTCGGGCT GGGGACGCTT CGGCGGCCAG GCCGGGATCG AAGCCTTCAC CGAGCTGCGC TGGATCACCC TGCAGACGAG CCCGCGCCAC TACCCGTTCT GA
|
Protein sequence | MNEIHLLIGG ERRRATDGAS FERRNPLDHG VATRAPAATA ADAVAAVEAA AAAFPAWAAT GPGERRALLM KAAHALEARA EAFTAAMAAE TGASAIWAGF NVHLAAGMLL EAAALTTRIE GSILPSDVPG SVAMAVRQPA GVVLGIAPWN APVILGVRAI ATPLACGNTV VLKGSELCPA THGLIIEALQ DAGLPAGVVN FVTNAPADAG AVVEAMVAHP ALRRVNFTGS THVGRLIAQT CARYLKPAVL ELGGKAPFVV LDDADLDAAV AAATFGAFAN SGQICMSTER IVVDAAVADD FVARLAARAR ALPLGDPRKG PVVLGSVVDQ RTVERCNALI DDALAKGATL VCGGKADSTL MPATLLDHVS AQMRIYHEET FAPVKAIVRV QGTEAAIACA NDNPFGLAAA VFGRDLARAW QVAGRIQSGI CHINGPTVHD EAQMPFGGVK DSGWGRFGGQ AGIEAFTELR WITLQTSPRH YPF
|
| |