Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0610 |
Symbol | |
ID | 4808212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 747663 |
End bp | 748730 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106024 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_001037038 |
Protein GI | 125973128 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.413209 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAT ATTGGAGTAA TATAGTTAAA AAAATAAGTC CATATGTTCC GGGAGAGCAG CCAAAGGACA AAAAGTACAT AAAACTGAAT ACCAATGAAA ATCCGTATCC GCCTTCGGAA AAAGTTTTAA AGGCAATTTC AGCGGCGGTA AATGAAAGCC TGAGGTTATA CCCGGATCCG GCTTGTGAAA GCTTGAGGAA TACTTTGGCT AAGTATTACG GGATTAAAGC TTCGGAAGTT TTTGTGGGTA ACGGCTCTGA TGAACTTTTG GCTTTTTCGT TTATGGCGTT TTTCAATCCC GGAGACACTA TTATTTTTCC GGATATAACC TATAGCTTTT ATGAGGTTTA TTCCTCGATG TTTTCCGTAA ACTACAGGTT GATTCCCTTG GATGATGAAT TTAACGTCCC TGTGGAAGAG TTTTTCACCG AAAACGACGG AATAATACTG GCAAACCCGA ATGCCCCGAC CGGCAAAGCT CTTCCGCTTC AAAGCATAAG AAAAATACTT GAAAAGAATG ATGACAAAGT CGTTATTATT GACGAAGCAT ATGTTGATTT CGGAGCCCGG TCATCTGTAC CGTTGATAAA GGAATTTGAA AATCTTTTGG TTATTCAGAC ACTGTCAAAA TCCAGGGCCC TGGCAGGTCT TCGTGTGGGT TTTGCTTTGG GAAGCGAACA GTTGATAGAG GGTTTGGATC GTGTAAAGAA TTCCATAAAC TCATATACTC TGGACAGACT TGCCCTTATT GGTGCGGAAG AAGCCATAAA GGATCATGAG TATTTTTGTG AAATCAGAGA TAAGATAATC AACACCAGGG AGTGGGTTTC AAAGAAGCTG TCTTCCATGG GTTTTAAAGT GATTGAGTCA AAGGCCAACT TCATTTTTAT AAGTCATCCA AAAATAAACG GCAGGCTGTT GTTTGAGAAG TATAAAGAAA ACAATATCCT GGTTCGGCAT TTTAACAGCC CGAGAATTGA CAATTTCCTT CGTGTCAGTA TCGGTTCTGA TGAAGAAATG AATATCTTTT GTGAGAAAAC AAAAGAAATT ATTGAATCGT TAAATTAA
|
Protein sequence | MSKYWSNIVK KISPYVPGEQ PKDKKYIKLN TNENPYPPSE KVLKAISAAV NESLRLYPDP ACESLRNTLA KYYGIKASEV FVGNGSDELL AFSFMAFFNP GDTIIFPDIT YSFYEVYSSM FSVNYRLIPL DDEFNVPVEE FFTENDGIIL ANPNAPTGKA LPLQSIRKIL EKNDDKVVII DEAYVDFGAR SSVPLIKEFE NLLVIQTLSK SRALAGLRVG FALGSEQLIE GLDRVKNSIN SYTLDRLALI GAEEAIKDHE YFCEIRDKII NTREWVSKKL SSMGFKVIES KANFIFISHP KINGRLLFEK YKENNILVRH FNSPRIDNFL RVSIGSDEEM NIFCEKTKEI IESLN
|
| |