Gene Tmz1t_1508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1508 
Symbol 
ID7083590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1684809 
End bp1686014 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID643698525 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_002355162 
Protein GI217969928 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0554104 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAG CATTCATCTG CGACGCCATC CGCACCCCCA TCGGCCGCTA CGGCGGCTCC 
CTGGCCTCCG TGCGTGCCGA CGACCTCGGC GCCGTGCCGC TCAAGGCCCT GATGACCCGC
AACCCCCAGG TCGACTGGAC CGCGGTCGAA GACATCATCT ACGGCTGCGC CAACCAGGCC
GGCGAGGACA ACCGCAACGT CGCGCGCATG TCCGGCCTGC TCGCCGGGCT GCCGATCGAG
GTGCCCGGCA CCACGGTCAA CCGCCTGTGC GGCTCGGGCA TGGACGCCAT CGGCCTGGCC
GCGCGCTCGA TCAAGTCGGG CGAGACCGAG CTGATGATCG CCGGCGGCGT CGAGAGCATG
TCGCGCGCGC CCTTCGTGAT GGGCAAGGCC GAGTCGGCCT TCTCGCGCAG CGCCGCGATC
TACGACACCA CCATCGGCTG GCGCTTCATC AACCCGCTGA TGAAGAAGCT GTACGAGACG
CACTCGATGC CGCAGACCGC GGACAACGTC GCCGCCGACT TCGACATCTC GCGCGCCGAC
CAGGACGCCT TCGCGCTGCG CTCGCAGCAG CGCTGGGCCG CCGCCCACGC CGCCGGTCGC
TTCAAGGACG AGCTGGTGCC GGTGGTGATC CCGCGCAAGA AGGGCGACCC GATCGTCTTC
GACACCGACG AGCATCCGCG CCCCGAAACC ACGCTGGAGA TGCTGGCCAA GCTCAAGGGC
GTCAATGGCC CCGAGCTCAG CGTCACCGCC GGCAACGCCT CGGGCGTCAA TGACGGCGCC
TGCGCGCTGC TGCTGGCCTC CGACGCCGCC GCCGCGAAGC ATGGCCTGAC CCCGCGCGCC
CGCGTCGTCG CCATGGCCAC CGCCGGCGTG GCGCCGCGCA TCATGGGCTT CGGCCCCGCA
CCCGCGGTGC GCAAGGTGCT CGCCAAGGCG GGCCTCACGC TCGATCAGAT GGATGTGATC
GAACTCAACG AGGCTTTCGC GGCACAAGGC CTCGCCGTGC TGCGCGACCT CGGCCTCGCC
GACGACGAAG AACGCGTGAA CCCCAATGGC GGCGCCATCG CCCTGGGCCA CCCGCTGGGC
ATGAGCGGCG CCCGCCTGGT CACCACGGCG GCCTACGAGC TGCAGCGCCG CAATGGCCGC
TACGCGCTGT GCACGATGTG CATCGGCGTC GGCCAGGGCA TCGCGATGAT CATCGAGCGC
GTCTGA
 
Protein sequence
MTQAFICDAI RTPIGRYGGS LASVRADDLG AVPLKALMTR NPQVDWTAVE DIIYGCANQA 
GEDNRNVARM SGLLAGLPIE VPGTTVNRLC GSGMDAIGLA ARSIKSGETE LMIAGGVESM
SRAPFVMGKA ESAFSRSAAI YDTTIGWRFI NPLMKKLYET HSMPQTADNV AADFDISRAD
QDAFALRSQQ RWAAAHAAGR FKDELVPVVI PRKKGDPIVF DTDEHPRPET TLEMLAKLKG
VNGPELSVTA GNASGVNDGA CALLLASDAA AAKHGLTPRA RVVAMATAGV APRIMGFGPA
PAVRKVLAKA GLTLDQMDVI ELNEAFAAQG LAVLRDLGLA DDEERVNPNG GAIALGHPLG
MSGARLVTTA AYELQRRNGR YALCTMCIGV GQGIAMIIER V