Gene Cthe_0723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0723 
Symbol 
ID4810341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp877816 
End bp879042 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content41% 
IMG OID640106140 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_001037151 
Protein GI125973241 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.154159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAGTG TGTTTGAAAC TTTGAAAGAA AGGGGATATA TTGCGCAACT TACCCATGAA 
GAAAAAATAA AAGAGCTTCT AGAAAAGGAA AAAATTACAT TTTATATCGG ATTTGATCCC
ACTGCCGACA GTCTTCATGT GGGACATTTC CTGCAGATGA TGGTCATGGC CCACATGCAA
AAAGCGGGAC ACAGACCCAT AGCGTTAATC GGCGGCGGTA CAGCCATGGT CGGCGACCCC
ACAGGGAGAA CTGATATGAG AAAAATGATG ACCCGGGAGG AAATCAAGCA TAATGCCGAT
TGCTTTAAAA AGCAGCTTTC CAAATTCATT GAATTTGGAG AAGGAAAAGC CATAATGGTG
GATAACGCCG ATTGGCTGCT TGATTTAAAT TATATTGAAT TTTTGAGAGA TATAGGAGTG
CATTTTTCCG TCAACCGAAT GTTGACGGCA GAATGTTTTA AATCCAGACT CGAAAGAGGT
CTTTCCTTTA TCGAGTTCAA CTACATGCTC ATGCAAAGCT ATGACTTCCT GAAACTGTAT
AAGGAATACG GCTGTATCAT GCAGCTCGGC GGTGACGACC AGTGGTCAAA CATTCTCGGA
GGAATAGATC TTATTAGAAG AAAAGAAGGA AAAGAAGTCT ATGGTATGAC CTTTACACTT
CTTACCACCA GTGAAGGCAA AAAGATGGGT AAGACGGAAA AAGGTGCATT ATGGCTGGAT
GCCAACAAGA CTTCTCCTTA TGAGTTCTAT CAGTACTGGA GAAATATTCA TGACGCAGAT
GTGATAAAAT GCTTAAAACT CTTAACCTTT GTTCCTATGG AAGAAATCGA AGAGTATGCA
AAACTTAAAG ATCAGGAAAT TAACATTGCT AAAAAACGTC TGGCCTTCGA AGTTACAAAG
CTTATTCACG GTGAGGAAGA AGCATTAAAT GCCCAAAAAA CTGCAGAGGC CTTGTTTGAA
AAAGGTGCAA GCACTGACAA CATGCCTACT ACTGAAGTAG CCTCCGGTGA GCTTTCCAAC
GGTATAAACA TAATTGACCT GCTTTTAAAA ACAAAGCTGA TTCCTTCAAA GGGTGAAGGC
CGCCGACTTA TCGAACAGGG CGGAATTTCT GTAAACGACG TTAGAGTTGA AGGTTTTGAC
AGATTAGTCA CCATGGATGA TTTCAACAAC GGAGAGTTAA TTATTAAAAA GGGTAAAAAG
ACATACCACA GGGTAAAACT TGTATAA
 
Protein sequence
MSSVFETLKE RGYIAQLTHE EKIKELLEKE KITFYIGFDP TADSLHVGHF LQMMVMAHMQ 
KAGHRPIALI GGGTAMVGDP TGRTDMRKMM TREEIKHNAD CFKKQLSKFI EFGEGKAIMV
DNADWLLDLN YIEFLRDIGV HFSVNRMLTA ECFKSRLERG LSFIEFNYML MQSYDFLKLY
KEYGCIMQLG GDDQWSNILG GIDLIRRKEG KEVYGMTFTL LTTSEGKKMG KTEKGALWLD
ANKTSPYEFY QYWRNIHDAD VIKCLKLLTF VPMEEIEEYA KLKDQEINIA KKRLAFEVTK
LIHGEEEALN AQKTAEALFE KGASTDNMPT TEVASGELSN GINIIDLLLK TKLIPSKGEG
RRLIEQGGIS VNDVRVEGFD RLVTMDDFNN GELIIKKGKK TYHRVKLV