Gene Tmz1t_2578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2578 
Symbol 
ID7873319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2782699 
End bp2784006 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content67% 
IMG OID643699501 
Producthomoserine dehydrogenase 
Protein accessionYP_002889557 
Protein GI237653243 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCTA TCAACGTTGG CCTCCTTGGC ATCGGTACCG TCGGTGGCGG CACCTACACC 
GTCCTCAAAC GCAACGCCGA GGAGATCACC CGGCGCGCCG GCCGTCCGAT CCGCATCGTC
ACCGTGGCCG ACAAGAACCT CGAGCTCGCG CGCAAGGTGA CCGGCGGCGA GGTCAAGCTC
ACCGACGACG CCTTCTCGGT GGTCACCGAT CCGGGCATCG ACATCGTCGT CGAGCTGATC
GGCGGCTACG GCGTGGCCCG GGAGCTGGTG CTGAAGGCGA TCGAGAACGG CAAGCACGTG
GTCACCGCCA ACAAGGCGCT GCTCGCGGTG CATGGCAACG AGATCTTCGC CGCGGCGCAG
AAGAAGGGCG TGATGGTGGC CTTCGAGGCC GCGGTCGCGG GCGGCATCCC GATCATCAAG
GCGCTGCGCG AAGGCCTCAC CGCCAACCGC ATCGAGTGGC TGGCCGGCAT CATCAACGGC
ACCACCAACT TCATCCTGTC CGAGATGCGC GACAAGGGCC TGCCCTTCGC CGAGGTGCTC
AAGGAAGCGC AGGCGCTCGG CTACGCCGAG GCCGATCCGA CCTTCGACGT CGAGGGGGTC
GACGCCGCGC ACAAGGCGAC GATCATGAGC GCGATCGCCT TCGGCATCCC GATGCAGTTC
GACAAGGCCT ACATCGAGGG CATCAGCAAG CTCGACTCGG TGGACATCGG CTACGCCGAA
CAGCTCGGCT ATCGCATCAA GCTGCTCGGC ATCGCCCGCC GCCGCGAGAA CGGCGTCGAG
CTGCGCGTGC ACCCCACGCT GATTCCGGCG AAGCGCCTCA TCGCCAACGT CGAGGGTGCG
ATGAATGCGG TGCTGGTGCA GGGCGACGCC GTCGGCGCGA CGCTCTACTA CGGCAAGGGC
GCGGGCGCCG AGCCCACCGC TTCGGCGGTG ATCGCCGACC TGGTCGACGT CACCCGCCTG
CACACCTCCG ACCCCGAGCA CCGCGTGCCC CACCTGGCCT TCCAGCCCGA CCAGGTGCAC
GACGTTCCGG TGCTGCCGAT CGAGGAGGTC GTGACCTCCT ACTACCTGCG CATGCGGGTC
GAGGACAAGC CGGGCGTGCT CGCCGACATC ACCCGCATCC TGGCCGACAG CGGGATCTCG
ATCGAGGCGC TGATCCAGAA GCAGGCGGCC GAGGGCGAGG CGCACACCGA CATCATCATG
CTGACCCACC AGACCGCGGA AAAGAACGCC AATGCCGCCA TCGTGCGCAT CGAGGCGCTG
CCCGTAGTGC AGGGCAAGGT CGTGAAGCTG CGTATGGAAG CTCTCTGA
 
Protein sequence
MKPINVGLLG IGTVGGGTYT VLKRNAEEIT RRAGRPIRIV TVADKNLELA RKVTGGEVKL 
TDDAFSVVTD PGIDIVVELI GGYGVARELV LKAIENGKHV VTANKALLAV HGNEIFAAAQ
KKGVMVAFEA AVAGGIPIIK ALREGLTANR IEWLAGIING TTNFILSEMR DKGLPFAEVL
KEAQALGYAE ADPTFDVEGV DAAHKATIMS AIAFGIPMQF DKAYIEGISK LDSVDIGYAE
QLGYRIKLLG IARRRENGVE LRVHPTLIPA KRLIANVEGA MNAVLVQGDA VGATLYYGKG
AGAEPTASAV IADLVDVTRL HTSDPEHRVP HLAFQPDQVH DVPVLPIEEV VTSYYLRMRV
EDKPGVLADI TRILADSGIS IEALIQKQAA EGEAHTDIIM LTHQTAEKNA NAAIVRIEAL
PVVQGKVVKL RMEAL