Gene Cthe_0610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0610 
Symbol 
ID4808212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp747663 
End bp748730 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content39% 
IMG OID640106024 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001037038 
Protein GI125973128 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.413209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAT ATTGGAGTAA TATAGTTAAA AAAATAAGTC CATATGTTCC GGGAGAGCAG 
CCAAAGGACA AAAAGTACAT AAAACTGAAT ACCAATGAAA ATCCGTATCC GCCTTCGGAA
AAAGTTTTAA AGGCAATTTC AGCGGCGGTA AATGAAAGCC TGAGGTTATA CCCGGATCCG
GCTTGTGAAA GCTTGAGGAA TACTTTGGCT AAGTATTACG GGATTAAAGC TTCGGAAGTT
TTTGTGGGTA ACGGCTCTGA TGAACTTTTG GCTTTTTCGT TTATGGCGTT TTTCAATCCC
GGAGACACTA TTATTTTTCC GGATATAACC TATAGCTTTT ATGAGGTTTA TTCCTCGATG
TTTTCCGTAA ACTACAGGTT GATTCCCTTG GATGATGAAT TTAACGTCCC TGTGGAAGAG
TTTTTCACCG AAAACGACGG AATAATACTG GCAAACCCGA ATGCCCCGAC CGGCAAAGCT
CTTCCGCTTC AAAGCATAAG AAAAATACTT GAAAAGAATG ATGACAAAGT CGTTATTATT
GACGAAGCAT ATGTTGATTT CGGAGCCCGG TCATCTGTAC CGTTGATAAA GGAATTTGAA
AATCTTTTGG TTATTCAGAC ACTGTCAAAA TCCAGGGCCC TGGCAGGTCT TCGTGTGGGT
TTTGCTTTGG GAAGCGAACA GTTGATAGAG GGTTTGGATC GTGTAAAGAA TTCCATAAAC
TCATATACTC TGGACAGACT TGCCCTTATT GGTGCGGAAG AAGCCATAAA GGATCATGAG
TATTTTTGTG AAATCAGAGA TAAGATAATC AACACCAGGG AGTGGGTTTC AAAGAAGCTG
TCTTCCATGG GTTTTAAAGT GATTGAGTCA AAGGCCAACT TCATTTTTAT AAGTCATCCA
AAAATAAACG GCAGGCTGTT GTTTGAGAAG TATAAAGAAA ACAATATCCT GGTTCGGCAT
TTTAACAGCC CGAGAATTGA CAATTTCCTT CGTGTCAGTA TCGGTTCTGA TGAAGAAATG
AATATCTTTT GTGAGAAAAC AAAAGAAATT ATTGAATCGT TAAATTAA
 
Protein sequence
MSKYWSNIVK KISPYVPGEQ PKDKKYIKLN TNENPYPPSE KVLKAISAAV NESLRLYPDP 
ACESLRNTLA KYYGIKASEV FVGNGSDELL AFSFMAFFNP GDTIIFPDIT YSFYEVYSSM
FSVNYRLIPL DDEFNVPVEE FFTENDGIIL ANPNAPTGKA LPLQSIRKIL EKNDDKVVII
DEAYVDFGAR SSVPLIKEFE NLLVIQTLSK SRALAGLRVG FALGSEQLIE GLDRVKNSIN
SYTLDRLALI GAEEAIKDHE YFCEIRDKII NTREWVSKKL SSMGFKVIES KANFIFISHP
KINGRLLFEK YKENNILVRH FNSPRIDNFL RVSIGSDEEM NIFCEKTKEI IESLN