Gene Tmz1t_3654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3654 
Symbol 
ID7873159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4011269 
End bp4012633 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content70% 
IMG OID643700595 
Producttryptophan synthase subunit beta 
Protein accessionYP_002890624 
Protein GI237654310 
COG category[R] General function prediction only 
COG ID[COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) 
TIGRFAM ID[TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.142399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCCA CCCGCATCCT GCTCGACGCC GCCGACATCC CGACGCACTG GTACAACGTC 
GCCGCCGATC TGCCGACGCC GCTCGCGCCG CCGCTCGGCC CCGACGGCCA TCCGGCCACG
CCGGAGCAGA TGGGGGTCAT CTTCCCGCCG GCGATCCTCG AACAGGAGAT GAGCACCGAG
CGCTGGATCG CCATCCCGCA GGAGGTGCGC GAGATCTACC GCCTGTGGCG GCCGAGCCCG
CTGTGTCGCG CGGTGCGCCT GGAGCAGGCG CTCGGCACCC CGGCGAAGAT CTTCTTCAAG
TACGAGGGCG TCTCGCCCGC CGGCTCGCAC AAGCCCAACT CCGCGGTGCC GCAGGCCTTC
TACAACAAGC AGGCCGGCAT CACCCGCCTG ACCACCGAGA CCGGCGCCGG GCAGTGGGGC
TCGTCGATCG CCTTCGCCGG GCAGATGTTC GGGCTGGAGG TGCGCATCTA CATGGTCAAG
GTCAGCTACG ACCAGAAGCC CTACCGCCGC CTGATGATGC AGACCTGGGG CGGCGAGGTC
TTCGCCAGCC CCTCGCCGCA CACCGAGACC GGCCGCCGCC TGCTCGCCGA GAACCCGGAC
AACCCCGGCT CGCTCGGCAT CGCCATCTCC GAGGCGGTGG AAGAGGCCGC CGGCCGCGCC
GACACCAACT ACACCCTGGG TTCGGTGCTC AACCACGTGG TGCTGCACCA GAGCATCATC
GGCCTGGAGG CGAAGAAGCA GCTCGACAAG GTCGGCCTCT ATCCCGACGT CGTCATCGGC
CCCTGCGGCG GCGGCTCGAG CTTCGCCGGC ATCGCCTTCC CCTTCCTCGC CGACAAGGCC
GCGGGCGACA AGCGCGCCGC CACGCTGCGC TGCGTGGCGG TGGAGCCGAC CTCCTGCCCG
ACCCTGACCA AGGGCCAGTA CGCCTACGAC TTCGGCGACG CCTCCGGCTT CACCCCGCTG
ATGAAGATGT ATACGCTGGG CCACGATTTC ATGCCGCCCG GAATCCATGC CGGCGGCCTG
CGCTACCATG GCGACTCGCC GCTGGTGTCG AACCTGCTGC ACGCCGGCCT CATCGAGGCC
GCCGCGGTGC CGCAGCTGGC GACCTTCGAG GCGGGGGTGC AGTTCGCACG CGCCGAGGGC
ATCATCCCGG CGCCCGAGTC CTGCCACGCC ATCCGTCAGG CCATCGACGA GGCGCTCGCC
TGCAAGGCGA CCGGCGAGGC GAAGACCATT CTGTTCAACC TGACCGGCCA CGGCCACTTC
GACATGAGCT CGTACGAGCG CTACTTCTCG GGCAAGCTCG AGGACTTCGA CTACCCGGCC
GAGGCGGTGG CAACTTCGCT CGCCCACCTG CCCAAGGTCG GCTGA
 
Protein sequence
MEATRILLDA ADIPTHWYNV AADLPTPLAP PLGPDGHPAT PEQMGVIFPP AILEQEMSTE 
RWIAIPQEVR EIYRLWRPSP LCRAVRLEQA LGTPAKIFFK YEGVSPAGSH KPNSAVPQAF
YNKQAGITRL TTETGAGQWG SSIAFAGQMF GLEVRIYMVK VSYDQKPYRR LMMQTWGGEV
FASPSPHTET GRRLLAENPD NPGSLGIAIS EAVEEAAGRA DTNYTLGSVL NHVVLHQSII
GLEAKKQLDK VGLYPDVVIG PCGGGSSFAG IAFPFLADKA AGDKRAATLR CVAVEPTSCP
TLTKGQYAYD FGDASGFTPL MKMYTLGHDF MPPGIHAGGL RYHGDSPLVS NLLHAGLIEA
AAVPQLATFE AGVQFARAEG IIPAPESCHA IRQAIDEALA CKATGEAKTI LFNLTGHGHF
DMSSYERYFS GKLEDFDYPA EAVATSLAHL PKVG