Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0515 |
Symbol | |
ID | 3831817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 534374 |
End bp | 535534 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637828449 |
Product | histidinol phosphate aminotransferase |
Protein accession | YP_429388 |
Protein GI | 83589379 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.839874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATGGTGA GAAAATCTAC CGCCTCTAAC CGGCGGTTGC AGGACAAAGG AGATGAAGAA CCAGTGGCTA CCGTTCCCAA GTGTCGTGAA GCCATCCTGA GCATTAAACC ATACGTTCCG GGGAAACCCA TTGAAGAGGT CCAGCGTGAG CTGGGAGTTA AGGATGTCAT TAAGCTGGCT TCCAATGAAA ATCCGCTGGG ACCTTCCCCG GATGCCGTCC AGGCCTTGCA AGAGGCCAGC GATAGGATCT TCCTCTACCC TGACGGCAAC TGCTATTATC TTAAAGAAGC CCTGGCGGCC AAACTGGGGG TAAAACAGGA GAACCTGATT ATCGGCAATG GTACTGATGA GATCCTGAAG ATGCTGGCCG AGACCTACAT TAACCCTGGC GATGAAATCG TCGTCGCCGA CCCGACCTTT TCCGAATACG AATTCGCCGC CCAGGTAATG GGGGGACGGG CCATAAAGGT TCCCACTCGT AATTTCCGCC ACGACCTGGC AGCCATGGCG GCAGCTATTA CTCCCAGGAC GCGGCTGGTC TTCGTCTGCA ACCCCAACAA TCCCACCGGG ACGATTGTCG GCCAGGCCGC CCTGGACGGT TTCTTAAAGC AGGTGCCTCC TTCCGTACTA GTCGTCCTGG ACGAGGCCTA TTCCGATTAT GTTACCGCCG AACACTATCC CAACAGCCTG GCCTATGTCC GGGCCGGCAG GGCCAATGTC ATAATCCTGC GCACCTTCTC TAAAATTTAC GGCCTGGCCG GGTTGCGGGT CGGTTACGGG GTCGCTGTTC CCGAGATTAT CAGGGACTTG AACCGGGTAC GGGAGCCTTT TAACGTCAAC CTTCTCGCCC AGGTTGCCGC CGTTGCTGCC CTCAAGGACG AAGCCCACGT AGGTAAGAGC AGGGAAGTTA ACAGCGAGGG CAAGGACTAT CTCTACAGCC AATTTGAATC CCTGGGACTA AAGTACGTTC CCACCGAAGC CAATTTCATC TTTGTGGATA TCCAGCGGGA CAGCCGGGAG GTTTTCCGTC AACTGTTGCA AAAGGGAGTT ATCGTCCGGA CTGGCGATAT CTTTGGCTAT GATACCTTCC TGCGGGTAAC CATTGGTACC CGGCGCCAGA ACGAAACCTT CATTCGCGCT CTGAGGGAAA TTTTGGCTTA G
|
Protein sequence | MMVRKSTASN RRLQDKGDEE PVATVPKCRE AILSIKPYVP GKPIEEVQRE LGVKDVIKLA SNENPLGPSP DAVQALQEAS DRIFLYPDGN CYYLKEALAA KLGVKQENLI IGNGTDEILK MLAETYINPG DEIVVADPTF SEYEFAAQVM GGRAIKVPTR NFRHDLAAMA AAITPRTRLV FVCNPNNPTG TIVGQAALDG FLKQVPPSVL VVLDEAYSDY VTAEHYPNSL AYVRAGRANV IILRTFSKIY GLAGLRVGYG VAVPEIIRDL NRVREPFNVN LLAQVAAVAA LKDEAHVGKS REVNSEGKDY LYSQFESLGL KYVPTEANFI FVDIQRDSRE VFRQLLQKGV IVRTGDIFGY DTFLRVTIGT RRQNETFIRA LREILA
|
| |