Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2883 |
Symbol | |
ID | 4809090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3409985 |
End bp | 3411061 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108302 |
Product | histidinol phosphate aminotransferase |
Protein accession | YP_001039274 |
Protein GI | 125975364 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.017474 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAAGG ATTTGGTAAG ACCTGAGCTG CAAAAATTAG TACCGTATGT TTCCCATCAG GTCCCGTACA GAATAAAGCT TGACGCAAAT GAAAGTCCCT TCGAGCTTCC TGAAAGTATC AGGAAGAAAC TGGCGGACTA CTTTTTAAAA GGGCCGGGTT TAAACATTTA TCCAGATAAT GAGTCTGTGG AGCTTAGAAA GACCATCGCA AAATACTGGA ATGTTGATGC GGATGAGGTT ATCGTGGGTA CCGGTTCCAA CCAGCTTATT CAGCTGATAA TAACAGTGTT TGTGGGGAAA GGGGAGAAGG TGCTGTATCC CTGGCCCACG TTTTCGATGT ATAAAATAAA CACTCTGATA GCGGGCGGTG AACCGGTGGC ATTTCCCCTT GACAAGGAAA AAGACTTTGT TTTGGATACC GACAAGTTTA TTGAGGCTGT AAAAACGGAA AATGCCAAGG TGGTATTCCT TTGCAATCCC AACAATCCGA CAGGAGGGCT TGTTCCTCTG GAGCATATTG AAAAAATAGT GAAAGAGTGC AAAAGTTCAA TTATTGTTGT TGATGAAGCG TATGCCGAAT TTTGCCCGCA GAGTGTGATA CCTCTTGTAA AGAAGTATGA AAACCTTGTG GTGCTGCGCA CTTTCTCCAA AGCGTATCTT CTTGCCGGAG CAAGGTGCGG ATATTCCATA AGCGGAATTG AGATTGCAAA TGAGATAAAC AAAGTAAGAC CGACATACAA TGTAAGTTCA TTGACCCAGC TTATAGCAAA AATGGTGTTT GAGGAACAGG AAGAGATGCA GAAAATGATA CGGTATTTGA TTGAGCAAAG AGGAGAGCTG GAGAAATCTT TGAAAAAGAT AAAGGATGTC TGCATTTATC CTTCCCATGC AAATTATATA CTTGTCAAAG TGCCTGAAGC TGAGATGATA AGCAAGGAAC TTCAAAAGCG GGGAATACTC ATCCGAAGCT TTCCCAATGA TCCGGTGCTT TACGACTGTA TCAGGATAAC TGTAGGAACC AAAGAACAAA ATGATATTTT CTTAGAGGAG TTTTCCGACA TAATTAATAA TTTTTAA
|
Protein sequence | MIKDLVRPEL QKLVPYVSHQ VPYRIKLDAN ESPFELPESI RKKLADYFLK GPGLNIYPDN ESVELRKTIA KYWNVDADEV IVGTGSNQLI QLIITVFVGK GEKVLYPWPT FSMYKINTLI AGGEPVAFPL DKEKDFVLDT DKFIEAVKTE NAKVVFLCNP NNPTGGLVPL EHIEKIVKEC KSSIIVVDEA YAEFCPQSVI PLVKKYENLV VLRTFSKAYL LAGARCGYSI SGIEIANEIN KVRPTYNVSS LTQLIAKMVF EEQEEMQKMI RYLIEQRGEL EKSLKKIKDV CIYPSHANYI LVKVPEAEMI SKELQKRGIL IRSFPNDPVL YDCIRITVGT KEQNDIFLEE FSDIINNF
|
| |