Gene Cthe_1332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1332 
Symbol 
ID4809472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1621165 
End bp1622418 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content44% 
IMG OID640106756 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001037757 
Protein GI125973847 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000373388 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAACAC AGGCACCTAA AGGAACAAAG GACATTTTGC CGTCCGAGGT TTACAAATGG 
CACTACATAG AAAAGGAAAT AGCAAAGCTT TGTCATGACT TTGGCTATAA GGAAATCCGC
ATTCCCGTAT TTGAACATAC TGAACTTTTC CAACGGGGAG TCGGGGATAC TACGGATATT
GTACAAAAGG AAATGTACAC TTTTCTGGAT AAAGGGCAGA GAAGCATTAC ATTGAGGCCT
GAGGGAACTG CGGGAGTTGT AAGGAGCTAT ATTGAAAACG GCATGGCATC GCTTCCCCAG
CCTGTAAAAC TGTATTACAA CATTACGGCT TACAGGTATG AAAATGTGCA AAAAGGCCGC
TACAGGGAGT TTCACCAGTT TGGGGTCGAG GCTTTTGGAG CCCCGGGTCC TTCAATCGAT
GTTGAAATAA TAAGCATGGT GAAGTTGTTT TTTGACCGCC TGGGAATAAA AGAAATCAGC
CTCAATATAA ACAGCATAGG TTGTCCCGTC TGCAGGGCTG AATACAATAA GAAGCTGATG
GATTATTTAA GGCCGAACCT AAGTAAATTG TGCGCTACAT GCAATACACG TTTTGAAAGG
AATCCTCTGA GAATTATCGA CTGCAAGGAG GAATCCTGTA AAAAAATTAC GGCTAATGCG
CCGGCTCTTG TGGAGAATTT GTGCGATGAC TGCAAGAACC ATTTTGAAGG ACTTAAAGCC
GGGCTTGAAA ACTTGGGCAT TGATTACAAG ATTGATAAAA ACATAGTAAG AGGCCTTGAT
TATTATACCA AAACCGTATT TGAATTTGTG TCGGACAATA TCGGCGCACA GGGTACGGTA
TGTGGCGGAG GAAGATATGA CGGATTGGTT GAGGCATGCG GCGGAAAGCC CACACCGGGA
ATAGGTTTTG CAATGGGACT GGAAAGGTTA TTAATGGTGA TGGAAAATCA AGGTATTAAA
TTCCCCGAGT CTAAAAAACC GGATATTTTT ATTGCGGCGA TAGGCGACAA AGCCAACAGT
TACGCGGAAA AGATGGTTTA TGAATTGCGG AAGGAAGGTC TTAGCGCAGA GAAAGATTTG
ATGGGCAAAA GCCTTAAGGC GCAGATGAAA TATGCCGACA AGCTGGGAGC AAAGTACAGC
ATTGCCCTCG GCGATGATGA GATTGAGTCG GGCAAGGCGG TACTTAAGAA TATGGAGACG
GGAGAACAAA AAGAAATAAG TCTTGACACC TTAATAAGCA GGTTGAAGAT GTAA
 
Protein sequence
MLTQAPKGTK DILPSEVYKW HYIEKEIAKL CHDFGYKEIR IPVFEHTELF QRGVGDTTDI 
VQKEMYTFLD KGQRSITLRP EGTAGVVRSY IENGMASLPQ PVKLYYNITA YRYENVQKGR
YREFHQFGVE AFGAPGPSID VEIISMVKLF FDRLGIKEIS LNINSIGCPV CRAEYNKKLM
DYLRPNLSKL CATCNTRFER NPLRIIDCKE ESCKKITANA PALVENLCDD CKNHFEGLKA
GLENLGIDYK IDKNIVRGLD YYTKTVFEFV SDNIGAQGTV CGGGRYDGLV EACGGKPTPG
IGFAMGLERL LMVMENQGIK FPESKKPDIF IAAIGDKANS YAEKMVYELR KEGLSAEKDL
MGKSLKAQMK YADKLGAKYS IALGDDEIES GKAVLKNMET GEQKEISLDT LISRLKM