Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2578 |
Symbol | |
ID | 7873319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2782699 |
End bp | 2784006 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643699501 |
Product | homoserine dehydrogenase |
Protein accession | YP_002889557 |
Protein GI | 237653243 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCTA TCAACGTTGG CCTCCTTGGC ATCGGTACCG TCGGTGGCGG CACCTACACC GTCCTCAAAC GCAACGCCGA GGAGATCACC CGGCGCGCCG GCCGTCCGAT CCGCATCGTC ACCGTGGCCG ACAAGAACCT CGAGCTCGCG CGCAAGGTGA CCGGCGGCGA GGTCAAGCTC ACCGACGACG CCTTCTCGGT GGTCACCGAT CCGGGCATCG ACATCGTCGT CGAGCTGATC GGCGGCTACG GCGTGGCCCG GGAGCTGGTG CTGAAGGCGA TCGAGAACGG CAAGCACGTG GTCACCGCCA ACAAGGCGCT GCTCGCGGTG CATGGCAACG AGATCTTCGC CGCGGCGCAG AAGAAGGGCG TGATGGTGGC CTTCGAGGCC GCGGTCGCGG GCGGCATCCC GATCATCAAG GCGCTGCGCG AAGGCCTCAC CGCCAACCGC ATCGAGTGGC TGGCCGGCAT CATCAACGGC ACCACCAACT TCATCCTGTC CGAGATGCGC GACAAGGGCC TGCCCTTCGC CGAGGTGCTC AAGGAAGCGC AGGCGCTCGG CTACGCCGAG GCCGATCCGA CCTTCGACGT CGAGGGGGTC GACGCCGCGC ACAAGGCGAC GATCATGAGC GCGATCGCCT TCGGCATCCC GATGCAGTTC GACAAGGCCT ACATCGAGGG CATCAGCAAG CTCGACTCGG TGGACATCGG CTACGCCGAA CAGCTCGGCT ATCGCATCAA GCTGCTCGGC ATCGCCCGCC GCCGCGAGAA CGGCGTCGAG CTGCGCGTGC ACCCCACGCT GATTCCGGCG AAGCGCCTCA TCGCCAACGT CGAGGGTGCG ATGAATGCGG TGCTGGTGCA GGGCGACGCC GTCGGCGCGA CGCTCTACTA CGGCAAGGGC GCGGGCGCCG AGCCCACCGC TTCGGCGGTG ATCGCCGACC TGGTCGACGT CACCCGCCTG CACACCTCCG ACCCCGAGCA CCGCGTGCCC CACCTGGCCT TCCAGCCCGA CCAGGTGCAC GACGTTCCGG TGCTGCCGAT CGAGGAGGTC GTGACCTCCT ACTACCTGCG CATGCGGGTC GAGGACAAGC CGGGCGTGCT CGCCGACATC ACCCGCATCC TGGCCGACAG CGGGATCTCG ATCGAGGCGC TGATCCAGAA GCAGGCGGCC GAGGGCGAGG CGCACACCGA CATCATCATG CTGACCCACC AGACCGCGGA AAAGAACGCC AATGCCGCCA TCGTGCGCAT CGAGGCGCTG CCCGTAGTGC AGGGCAAGGT CGTGAAGCTG CGTATGGAAG CTCTCTGA
|
Protein sequence | MKPINVGLLG IGTVGGGTYT VLKRNAEEIT RRAGRPIRIV TVADKNLELA RKVTGGEVKL TDDAFSVVTD PGIDIVVELI GGYGVARELV LKAIENGKHV VTANKALLAV HGNEIFAAAQ KKGVMVAFEA AVAGGIPIIK ALREGLTANR IEWLAGIING TTNFILSEMR DKGLPFAEVL KEAQALGYAE ADPTFDVEGV DAAHKATIMS AIAFGIPMQF DKAYIEGISK LDSVDIGYAE QLGYRIKLLG IARRRENGVE LRVHPTLIPA KRLIANVEGA MNAVLVQGDA VGATLYYGKG AGAEPTASAV IADLVDVTRL HTSDPEHRVP HLAFQPDQVH DVPVLPIEEV VTSYYLRMRV EDKPGVLADI TRILADSGIS IEALIQKQAA EGEAHTDIIM LTHQTAEKNA NAAIVRIEAL PVVQGKVVKL RMEAL
|
| |