Gene Cthe_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1237 
SymbolleuS 
ID4809929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1500066 
End bp1502543 
Gene Length2478 bp 
Protein Length825 aa 
Translation table11 
GC content43% 
IMG OID640106660 
Productleucyl-tRNA synthetase 
Protein accessionYP_001037662 
Protein GI125973752 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0495] Leucyl-tRNA synthetase 
TIGRFAM ID[TIGR00396] leucyl-tRNA synthetase, eubacterial and mitochondrial family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTACA ATTTTGTGGA TATTGAAAAG AAGTGGCAAA AGAAATGGCT GGAGGAAAAA 
GCTTTTGCCG TCCGGGAAGA CGAAAGCAAA AAGAAATATT ATGTGCTTGA AATGTTTCCT
TATCCTTCCG GAAATCTTCA CATGGGACAT GTGAGGAATT ATTCCATAGG AGATGTTGTG
GCAAGATTTA AAAGAATGAA TGGTTTTAAT GTTCTTCACC CGATGGGATG GGATGCTTTC
GGTTTGCCTG CCGAAAATGC TGCCATAAAA AGAGGGGTTC ATCCCAATGA CTGGACATGG
TCCAATATCG ACAACATGAG AAGGCAGTTA AAACAACTTG GCATCAGTTA TGATTGGGAC
AGAGAGGTTG CCACCTGTCA TCCCGATTAC TACAAGTGGA CTCAATGGAT GTTCCTTCAA
CTTTATAAAA ACGGTCTTGC GTACAAGAAA AAGGCCTATG TAAACTGGTG CCCATCCTGT
GCGACGGTTC TTGCCAACGA GCAGGTTGTA AACGGAGTTT GTGAGCGCTG CAAGTCGGTA
GTCGGAAAAA AAGATTTGGA ACAGTGGTTC TTTAAGATAA CCGACTATGC TCAAAGGCTT
CTGGATGATA TTGAAAAGCT CAAAGGATGG CCTGACAAGG TTAAAGTTAT GCAGCAAAAC
TGGATTGGAC GGAGCGAAGG TGTCGAAGTG GATTTTAAAG TGGACGGAAT GGACAAAGCC
GTCAGGGTAT ATACCACAAG ACCTGATACC ATATATGGCG TGACCTACGT GGTGATTGCT
CCCGAGCATC CTGTGGTGAA AGAATTGATT AAGGGAACGG AACAGGAACA GGTATGTAAC
GAGTTTATAA ACAAGATGAT GTTTTTGAAT GAAATAGACA GAACTGCAAC CGATGTTGAG
AAAGAAGGAG TTTTCACAGG AAGGTATGTC ATTAATCCGT TAAACGGAGA CAGAGTTCCG
TTGTACCTTG CCAATTATGT TCTTGCAGAG TATGGAACCG GTGTTGTCAT GGCTGTTCCT
GCTCATGACC AGAGAGACTT TGAGTTTGCA AAGAAGTATA ATTTGCCGAT TAAAGTTGTA
ATTCAGCCGG AAGGCCAGGA GCTTGATGCG TCCAGGATGA CGGAAGCCTT CGTTGAGGTG
GGTTATCTTG TCAATTCCGC AGAATTTGAC GGTGTAAGAA GCGATGAGGC TATAGGAAAG
ATAATTGATT ATATCGAGCA AAAAGGCTAT GGAAAAAGAA AGATAAATTA CAGGCTCAGA
GACTGGCTGA TTTCCAGACA GAGATACTGG GGTGCGCCCA TACCGATAAT CTACTGTGAT
GATTGTGGAG CTGTGCCTGT TCCGGAAGAG GATTTGCCGG TTATTTTGCC GACGGACATC
AAATTTTCCG GTGTGGGAGA GTCGCCGCTT TCGACAAGTG AGACATTCAT ATCAGCCCCT
TGTCCTAAAT GCGGTAAGAT GGGAAGAAGA GAATTGGATA CCATGGATAC CTTTGTATGT
TCTTCATGGT ATTATCTTAG ATATTGTGAC CCATGTAACG ACAAGGCTCC TTTTGACAAG
GAAAGGATAA GATACTGGTT GCCGGTTGAC CAGTATATAG GCGGTGTTGA ACATGCAATT
CTGCACCTGT TGTATTCCAG ATTCCTTATG AAGGTTCTCT ATGATTTGGG ATATGTGGAT
TATGACGAAC CGTTTACAAA CCTCCTTACT CAGGGAATGG TGCTGAAAGA CGGAGCCAAA
ATGTCAAAAT CCCTGGGGAA TGTTGTAAGT CCTGAAGAGA TAATTGAAAA ATACGGTGCC
GATACGGCAA GGTTGTTCAT ACTGTTTGCA TCTCCACCGG AAAAAGACCT TGAATGGAGT
GACCAGGGAG TTGAAGGCTG CTATAGGTTT ATCAACAGAG TATGGAGAAT AGTCAATGAG
TTTGCCGATG CGGTAAAGGA AGGCGGAAAT ATTGACACTT CCACATTTAC CAAGGCTGAT
AAAGAGCTGT GGTACATGCT GAACAACACA TTGAAGCGCG TTACGGATGA TATCAGCCAG
AGGTTCAACT TCAACACTGC AATCAGTGCG GTTATGGAGC TGGTTAATTC CTTGTATTAT
TACAAGGATA AAGTGGCTGA TGACAGCAAG AACAAGGCTC TTGTCAGGGA AGTAATTGAA
AAGTTGATAA TAATGCTGGC TCCCTTTATT CCTCATGCAA CAGAAGAGCT TTGGTCCGCC
ATAGGAAAGG AAGGCAGCGT GCACGAACAG AAGTGGCCTT CATTTGACCC GGCTGCTCTT
GTGAAGGATG AAATTGAAAT AGTGGTTCAG ATAAACGGAA AGGTGAGAGA CAAGATTGTT
GTACCGTCGG ATCTTACAAA AGAGCAGGTT GAGGAGCGGG CTTTGAATAG TGAAAAGATT
AAAGCTGAAA CGGCCGGGAA AAATGTTGTA AAGGTTATTT CCGTTCCGGG CAAGCTGGTG
AATATTGTTG TGAAATAA
 
Protein sequence
MYYNFVDIEK KWQKKWLEEK AFAVREDESK KKYYVLEMFP YPSGNLHMGH VRNYSIGDVV 
ARFKRMNGFN VLHPMGWDAF GLPAENAAIK RGVHPNDWTW SNIDNMRRQL KQLGISYDWD
REVATCHPDY YKWTQWMFLQ LYKNGLAYKK KAYVNWCPSC ATVLANEQVV NGVCERCKSV
VGKKDLEQWF FKITDYAQRL LDDIEKLKGW PDKVKVMQQN WIGRSEGVEV DFKVDGMDKA
VRVYTTRPDT IYGVTYVVIA PEHPVVKELI KGTEQEQVCN EFINKMMFLN EIDRTATDVE
KEGVFTGRYV INPLNGDRVP LYLANYVLAE YGTGVVMAVP AHDQRDFEFA KKYNLPIKVV
IQPEGQELDA SRMTEAFVEV GYLVNSAEFD GVRSDEAIGK IIDYIEQKGY GKRKINYRLR
DWLISRQRYW GAPIPIIYCD DCGAVPVPEE DLPVILPTDI KFSGVGESPL STSETFISAP
CPKCGKMGRR ELDTMDTFVC SSWYYLRYCD PCNDKAPFDK ERIRYWLPVD QYIGGVEHAI
LHLLYSRFLM KVLYDLGYVD YDEPFTNLLT QGMVLKDGAK MSKSLGNVVS PEEIIEKYGA
DTARLFILFA SPPEKDLEWS DQGVEGCYRF INRVWRIVNE FADAVKEGGN IDTSTFTKAD
KELWYMLNNT LKRVTDDISQ RFNFNTAISA VMELVNSLYY YKDKVADDSK NKALVREVIE
KLIIMLAPFI PHATEELWSA IGKEGSVHEQ KWPSFDPAAL VKDEIEIVVQ INGKVRDKIV
VPSDLTKEQV EERALNSEKI KAETAGKNVV KVISVPGKLV NIVVK