Gene Cthe_0917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0917 
Symbol 
ID4811210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1099458 
End bp1101200 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content44% 
IMG OID640106336 
Productglutaminyl-tRNA synthetase 
Protein accessionYP_001037344 
Protein GI125973434 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0008] Glutamyl- and glutaminyl-tRNA synthetases 
TIGRFAM ID[TIGR00440] glutaminyl-tRNA synthetase
[TIGR00463] glutamyl-tRNA synthetase, archaeal and eukaryotic family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000234027 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCTC AGGTTGATGG CAAAGCAAAT GAATTAAATT CAACAGAAGA AAATGAAAAA 
GTATATTCAA ATTTTATTCA GGATATTATT GATGAGGATA ATAGAACAAA TAGATATGGC
GGAAGAGTGC ATACGAGGTT TCCGCCCGAG CCCAACGGCT ACCTTCATAT AGGACATGCC
AAGTCAATTT GCCTTAATTT CGGAATTGCA GAACAGAACG GCGGTTTGTG CAATCTTAGA
TTTGACGATA CAAATCCTTC GAAGGAAGAC ACTGAATATG TTGAGTCCAT TAAAGCCGAT
GTTAAATGGC TTGGTTTTGA CTGGGAAGAC AGATTGTACT ATGCATCGGA TTATTTTGAC
AAAATGTTTG AATACGCCGT TAAGCTGATT AAAATGGGAA AGGCTTATGT TTGCGATTTG
AGTGCCGATG AGATAAGGGA ATACAGGGGA ACTTTGACCG AGCCCGGAAA GGAAAGCCCG
TATCGCAACA GAAGTGTTGA AGAGAATCTG GATTTGTTTA TGAGAATGAA AAACGGTGAA
TTCGAGGAAG GTTCGAGGGT TTTGCGGGCA AAAATAGATA TGGCATCCCC CAATCTCAAT
ATGAGGGACC CGGTGATTTA CAGAATTATA AAGGCAAGTC ATCACAGGTC CGGTGACAAG
TGGTGTATTT ACCCGATGTA TGATTTTGCC CATCCTATAT CTGACTCTCT TGAGGGTATA
ACCCATTCCA TCTGTACGCT GGAGTTTGAA GACCACAGGC CTTTGTACAA CTGGGTTTTG
GAGACCCTTG ACATGGAGTG CAAGACCCGT CAGATAGAGT TTGCCCGTTT GAATTTGACC
TATACGGTAA TGAGCAAACG AAAATTGCTA AGGCTTGTGC AGGAAGGGCA TGTGAGGGGA
TGGGACGACC CAAGAATGCC TACCATATCA GGACTTAGAA GACGTGGATA TACTCCTTCG
GCCATAAGAA ACTTTTGTGC CCGCATAGGG GTTGCCAAGA GCAACAGCAC CGTTGATATA
TCGCTCCTTG AGCATTGTAT CAGAGAGGAA CTTAATCAGA AGGCTCAAAG GGTAATGGCG
GTACTGAGAC CTCTCAAACT TGTCATAGAC AACTATCCTG AGGGAATGGT TGAAGAGTTT
GAGGTTGAGA ACAATCCCGA GGATCCGAAT GCGGGAACCA GGAAAGTACC TTTTTCAAAA
GTTCTTTATA TTGAGAAGGA CGATTTTTGC GAAAATCCTC CGAAGAAATA TTTCCGCCTG
GCGCCCGGTC AGGAGGTTAG ACTCAAAGGT GCTTATATAG TAAAATGCGT GGATGTTGTA
AAGGATGACA AAACCGGAGA GATAACAGAA GTACATTGCA CCTATGACCC CCAATCCCGT
GGAGGAAATG CTTCTGACGG GCGAAAAGTA AAAGGTACCA TTCACTGGGT TTCGGCAGCT
CATGCCATTG ATGCCGAGGT ACGCCTGTAT GATCATTTGT TTAGCGTGCC CAATCCCGGT
GCTGATGAAA ATGTTGACTT TATTGAACAG CTAAATCCCA ATTCTCTCGA AGTGTTAAAA
TCGTGCAAGC TCGAGCCCAG CCTTGCTGGT GCAAAACCGG GAGATGCGTT CCAGTTCCTC
CGTTTGGGTT ATTTCTGTGT TGATTTGGTG GATTCAAAGG AAGGCTCTTT GGTATTTAAC
AGAACGGTGA CACTCAAGGA TACCTGGGCA AAGATAGCAA GTAAAGAGAA AGAAGATAAG
TAA
 
Protein sequence
MDAQVDGKAN ELNSTEENEK VYSNFIQDII DEDNRTNRYG GRVHTRFPPE PNGYLHIGHA 
KSICLNFGIA EQNGGLCNLR FDDTNPSKED TEYVESIKAD VKWLGFDWED RLYYASDYFD
KMFEYAVKLI KMGKAYVCDL SADEIREYRG TLTEPGKESP YRNRSVEENL DLFMRMKNGE
FEEGSRVLRA KIDMASPNLN MRDPVIYRII KASHHRSGDK WCIYPMYDFA HPISDSLEGI
THSICTLEFE DHRPLYNWVL ETLDMECKTR QIEFARLNLT YTVMSKRKLL RLVQEGHVRG
WDDPRMPTIS GLRRRGYTPS AIRNFCARIG VAKSNSTVDI SLLEHCIREE LNQKAQRVMA
VLRPLKLVID NYPEGMVEEF EVENNPEDPN AGTRKVPFSK VLYIEKDDFC ENPPKKYFRL
APGQEVRLKG AYIVKCVDVV KDDKTGEITE VHCTYDPQSR GGNASDGRKV KGTIHWVSAA
HAIDAEVRLY DHLFSVPNPG ADENVDFIEQ LNPNSLEVLK SCKLEPSLAG AKPGDAFQFL
RLGYFCVDLV DSKEGSLVFN RTVTLKDTWA KIASKEKEDK