Gene Cthe_0648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0648 
Symbol 
ID4808177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp801866 
End bp803506 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content41% 
IMG OID640106062 
Productglutamyl-tRNA synthetase 
Protein accessionYP_001037076 
Protein GI125973166 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0008] Glutamyl- and glutaminyl-tRNA synthetases 
TIGRFAM ID[TIGR00464] glutamyl-tRNA synthetase, bacterial family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTATA AAAAGTTGGC GGACATGCTT TTCCCGCACA TAACCAAATC GGTTTCCTAT 
TACGAAGACG TTGTATTTCC TGCCAGAAAT TTAAGCCCCG GAGCAAAAGT TACGCGGCTG
GCGCCAAGTC CGACAGGTTT TATCCATCTT GGCAATCTGT ACGGGGCTTT CGTGGATGAA
CGTCTGGCTC ATCAGAGCAA CGGAGTGTTT ATTCTCCGCA TAGAGGATAC TGATGACAAG
CGTAAAGTTG AAGGAGCAGT GGAAACCATT ATATCCTCTT TGGAGTTCTT CAACCTCAAA
TTTGACGAAG GAGCAGGCAT CAACGGAGAA ACAGGCAATT ACGGCCCGTA TTTTCAGAGC
AATCGTGCTG AAATTTATCA AACCGTGGCC AAACATTTGG TTGAAATGGG CAGAGCCTAT
CCTTGTTTCT GCTCGGAAGA AGAACTTGAG GAAATAAGAA AACAGCAACT GGCTGAAAAC
GTTAATACAG GTTACTACGG CAAATGGGCT GTTCACAGAA ATTTGACCTT GGAAGAAGTT
CAAAAGCATC TGGAAAACAA CGAAAGCTTC GTTATTCGTT TTAAATCCAT GGGTAATCCT
GAAGAAACCT TTGAAATTGA TGATGCCATA AGAGGCCGTC TTTCCATGCA GGTTAACTTT
CAGGACATAG TGCTTCTAAA AGCAAACGGT ATACCAACCT ACCATTTTGC CCATGTGGTG
GATGACCATT TGATGAGAGT TACCCATGTG GTAAGAGGTG AGGAATGGCT TTCAACCCTT
CCCATCCATT ATGAATTGTT TACGACACTG GGATGGGATT TGCCGGTATA CTGTCATACA
GCCCATTTGA TGAAAATAGA CAACGGAGTT AAAAGAAAGC TTTCAAAAAG AAAGGATCCG
GAATTGGGAT TGGAATATTA CATGCAACTG GGGTATCACC CCGCTGCAGT AAGAGAGTAT
CTTATGACAA TATTAAACTC CAACTTTGAA GAGTGGAGAA TTCAAAATCC GGACAGCGAT
ATAAATGATT TTCCTTTCTC TCTGAATAAA ATGAGTAATT CCGGTGCTTT GTTTGACTTG
GACAAACTTA ACGATGTAAG CAAAAATGTT TTGGCTAAAA TTCCGGCGGA AGAAATTTAT
GAATTTTTGC TGAAATGGGC AAAGGAATAC AAAAAAGAGA TTGTTAATCT GCTCGAAGAA
CATAAGGATT CGGTAATAAA ACTTTTGTCT GTGGGCAGAA ATTCTGAAAA ACCGCGCAAA
GACTTAATCT ATTGTGAACA AATATTTGAG TTCATTAAAT ATTTCTTTGA TGAATACTTT
GCAATAGTTG ACAAATATCC GGACAACGTA GACGAAGAGG AAGCCAAAAA AATACTTAAG
GCATACCTTG AAACCTACGA CCACAACGAT GATCAAACAC AGTGGTTTGA AAAAATCAAG
GTTATTGCCA CAGAAAACGG ATATGCGGCA AAACCAAAGG ACTACAAGAA AAATCCCGAC
ATGTACAAAG GACATGTGGG TGATGTAAGC ACTGTTATAA GAATCGCCAT AGTAGGCCGC
AGCAGTTCAC CGGACCTTTG GGAGATACAG CAGATTATGG GCGAAGAAAA AGTAAGAGAA
AGAATTCAAA GATTGCTATA A
 
Protein sequence
MDYKKLADML FPHITKSVSY YEDVVFPARN LSPGAKVTRL APSPTGFIHL GNLYGAFVDE 
RLAHQSNGVF ILRIEDTDDK RKVEGAVETI ISSLEFFNLK FDEGAGINGE TGNYGPYFQS
NRAEIYQTVA KHLVEMGRAY PCFCSEEELE EIRKQQLAEN VNTGYYGKWA VHRNLTLEEV
QKHLENNESF VIRFKSMGNP EETFEIDDAI RGRLSMQVNF QDIVLLKANG IPTYHFAHVV
DDHLMRVTHV VRGEEWLSTL PIHYELFTTL GWDLPVYCHT AHLMKIDNGV KRKLSKRKDP
ELGLEYYMQL GYHPAAVREY LMTILNSNFE EWRIQNPDSD INDFPFSLNK MSNSGALFDL
DKLNDVSKNV LAKIPAEEIY EFLLKWAKEY KKEIVNLLEE HKDSVIKLLS VGRNSEKPRK
DLIYCEQIFE FIKYFFDEYF AIVDKYPDNV DEEEAKKILK AYLETYDHND DQTQWFEKIK
VIATENGYAA KPKDYKKNPD MYKGHVGDVS TVIRIAIVGR SSSPDLWEIQ QIMGEEKVRE
RIQRLL