Gene Tmz1t_3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3038 
Symbol 
ID7874508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3288901 
End bp3289998 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content70% 
IMG OID643699961 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002890013 
Protein GI237653699 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.141855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATTG CCAGTCTTGC CCCCGACTAC ATCCGCGCGA TCATGGCCTA CCAGCCGGGC 
AAGCCGATCT CGGAGCTCGC CCGCGAGATG GGAATCCCCG AGGAGAGCAT CGTCAAGCTC
GCCTCCAACG AGAACCCGCT CGGCATGAGC GCGCGCGCGC GCGATGCCGC GATCGCCGCG
ATCGGCGAGG TCTCGCGCTA TCCGGACGGC GGCGCGTTCG CGCTCAAGAA GGCCTTGTGC
GAACGCTTCG GCGTCAAGCC CGAGCAGCTC GTGATCGGCA ACGGCTCGAA CGACATCCTC
GAGCTGGCCT CGCAGGCCTT CCTCGCGCCG GGGCTGTCGG CGGTGTATTC GCGTCACGCC
TTCGCGGTCT ATCCGCTCGC CACCAACGCG CGCGGCGCGC GCGGCATCGA GGTGGCGGCG
AAGAACTTCG GCCACGACCT CGACGCCATG GCGGCGGCGA TCGAGCCGCA GACCCGTGTC
GTCTTCATCG CCAACCCGAA CAACCCCACC GGCACCTTCG TCCCGGGCGC CGAGCTCGAG
GCCTTCCTCG CCAAGGTCCC GCGCCACGTG CTGGTGGTGC TCGACGAGGC CTACACCGAA
TACCTCGCCC CCGAGCAGCG CTACGACTCG ATCGCCTGGC TGGCGCGCTT CCCCAACCTG
CTGGTGTCGC GCACCTTCTC CAAGGCCTAC GGCCTGGCCG GCCTGCGCGT GGGCTACGGC
ATCGCCCACC CCGAGGTGGC CGACCTGATG AACCGGGTGC GCCAGCCCTT CAACGTGTCC
TCGGTGGCGC TCGCCGCGGC CGAGGCGGCG CTCGGCGACG ACGAATTCCT CGCCCGCAGC
GCCGAGCTCA ACCGCCGCGG CATGACGCAG CTCGTCGCCG CTTTCCGCGA ACTCGGCCTC
GAATGGATCC CCTCGGCCGG CAACTTCGTC ACCTTCAAGG TGGGCGACGC GATCGGGGTG
AACCAGGCGC TGCTGCGCCA GGGCGTGATC GTGCGTCCGA TCGCCGCCTA CGGCATGCCG
CACTGGCTGC GCGTGTCGAT CGGTCTGCCC GAGGAAAACG CGCGCTTCAT CGAGGCGCTG
CGCCAGGCGC TCGCCTGA
 
Protein sequence
MSIASLAPDY IRAIMAYQPG KPISELAREM GIPEESIVKL ASNENPLGMS ARARDAAIAA 
IGEVSRYPDG GAFALKKALC ERFGVKPEQL VIGNGSNDIL ELASQAFLAP GLSAVYSRHA
FAVYPLATNA RGARGIEVAA KNFGHDLDAM AAAIEPQTRV VFIANPNNPT GTFVPGAELE
AFLAKVPRHV LVVLDEAYTE YLAPEQRYDS IAWLARFPNL LVSRTFSKAY GLAGLRVGYG
IAHPEVADLM NRVRQPFNVS SVALAAAEAA LGDDEFLARS AELNRRGMTQ LVAAFRELGL
EWIPSAGNFV TFKVGDAIGV NQALLRQGVI VRPIAAYGMP HWLRVSIGLP EENARFIEAL
RQALA