Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3038 |
Symbol | |
ID | 7874508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3288901 |
End bp | 3289998 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643699961 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_002890013 |
Protein GI | 237653699 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.141855 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATTG CCAGTCTTGC CCCCGACTAC ATCCGCGCGA TCATGGCCTA CCAGCCGGGC AAGCCGATCT CGGAGCTCGC CCGCGAGATG GGAATCCCCG AGGAGAGCAT CGTCAAGCTC GCCTCCAACG AGAACCCGCT CGGCATGAGC GCGCGCGCGC GCGATGCCGC GATCGCCGCG ATCGGCGAGG TCTCGCGCTA TCCGGACGGC GGCGCGTTCG CGCTCAAGAA GGCCTTGTGC GAACGCTTCG GCGTCAAGCC CGAGCAGCTC GTGATCGGCA ACGGCTCGAA CGACATCCTC GAGCTGGCCT CGCAGGCCTT CCTCGCGCCG GGGCTGTCGG CGGTGTATTC GCGTCACGCC TTCGCGGTCT ATCCGCTCGC CACCAACGCG CGCGGCGCGC GCGGCATCGA GGTGGCGGCG AAGAACTTCG GCCACGACCT CGACGCCATG GCGGCGGCGA TCGAGCCGCA GACCCGTGTC GTCTTCATCG CCAACCCGAA CAACCCCACC GGCACCTTCG TCCCGGGCGC CGAGCTCGAG GCCTTCCTCG CCAAGGTCCC GCGCCACGTG CTGGTGGTGC TCGACGAGGC CTACACCGAA TACCTCGCCC CCGAGCAGCG CTACGACTCG ATCGCCTGGC TGGCGCGCTT CCCCAACCTG CTGGTGTCGC GCACCTTCTC CAAGGCCTAC GGCCTGGCCG GCCTGCGCGT GGGCTACGGC ATCGCCCACC CCGAGGTGGC CGACCTGATG AACCGGGTGC GCCAGCCCTT CAACGTGTCC TCGGTGGCGC TCGCCGCGGC CGAGGCGGCG CTCGGCGACG ACGAATTCCT CGCCCGCAGC GCCGAGCTCA ACCGCCGCGG CATGACGCAG CTCGTCGCCG CTTTCCGCGA ACTCGGCCTC GAATGGATCC CCTCGGCCGG CAACTTCGTC ACCTTCAAGG TGGGCGACGC GATCGGGGTG AACCAGGCGC TGCTGCGCCA GGGCGTGATC GTGCGTCCGA TCGCCGCCTA CGGCATGCCG CACTGGCTGC GCGTGTCGAT CGGTCTGCCC GAGGAAAACG CGCGCTTCAT CGAGGCGCTG CGCCAGGCGC TCGCCTGA
|
Protein sequence | MSIASLAPDY IRAIMAYQPG KPISELAREM GIPEESIVKL ASNENPLGMS ARARDAAIAA IGEVSRYPDG GAFALKKALC ERFGVKPEQL VIGNGSNDIL ELASQAFLAP GLSAVYSRHA FAVYPLATNA RGARGIEVAA KNFGHDLDAM AAAIEPQTRV VFIANPNNPT GTFVPGAELE AFLAKVPRHV LVVLDEAYTE YLAPEQRYDS IAWLARFPNL LVSRTFSKAY GLAGLRVGYG IAHPEVADLM NRVRQPFNVS SVALAAAEAA LGDDEFLARS AELNRRGMTQ LVAAFRELGL EWIPSAGNFV TFKVGDAIGV NQALLRQGVI VRPIAAYGMP HWLRVSIGLP EENARFIEAL RQALA
|
| |