Gene Cthe_1312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1312 
Symbol 
ID4809452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1595498 
End bp1596886 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content42% 
IMG OID640106736 
Productglycyl-tRNA synthetase 
Protein accessionYP_001037737 
Protein GI125973827 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0423] Glycyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00389] glycyl-tRNA synthetase, dimeric type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00852036 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGTAA AAAAGACAAT GGAGAAGATT GTAGCCCTGG CTAAAAACCG AGGATTTATT 
TATCCCGGCT CTGAAATATA CGGCGGTTTG GCAAATTCAT GGGATTACGG ACCTCTTGGA
GTGGAGCTTA AAAACAATAT AAAAAAGGCA TGGTGGAAGA AATTTGTTCA GGAAAACCCT
TACAATGTGG GTGTTGACTG TGCAATACTC ATGAATCCTC AGGTGTGGGT TGCATCGGGA
CATGTAGGCG GTTTCAGCGA CCCCCTGATT GACTGTAAAG AATGTAAAAC ACGTCACAGG
GCGGACAAAA TGATAGAGGA ATGGAATCTT AAAAACAATG AAAATGTCAA GGTTGACGGC
TGGTCCAATG AAATGCTTAT GAATTATATC AGGGAAAAGG GTGTAACCTG TCCTGAGTGT
GGCGGAAAAA ACTTTACCGA TATCAGGAAG TTTAACCTTA TGTTTAAAAC TTTCCAGGGA
GTGACTGAGG ATTCCAAATC CGAGATATAT TTAAGGCCGG AAACAGCCCA GGGTATATTT
GTGAACTTTA AAAATGTTCA GAGAACAACA AGAAAAAAGA TACCCTTTGG TATTGGACAG
ATAGGAAAGT CTTTCAGAAA CGAAATAACT CCCGGAAACT TTATTTTCAG AACCCGTGAG
TTTGAACAAA TGGAGCTGGA GTTTTTCTGT GAGCCGGGAA CAGACCTTGA GTGGTTTGAA
TACTGGAAGA ATTTCTGCTT CAACTGGTTA TTGAGCCTAA ACATTAAAAA GGAAAACCTG
AGGATGCGTG ACCATTCAAA GGAGGAACTG TCCCACTACA GCAATGCCAC AACCGATATT
GAATACCTGT TCCCGTTTGG CTGGGGAGAG CTGTGGGGAA TTGCAGACAG AACCGACTTT
GACTTAAGAC AGCATGCAGA GCATTCGAAA GAGGATTTGT CCTACTTTGA CCCGAACACC
AATAGAAAAT ACATACCGTA CTGTATTGAA CCGTCTCTCG GTGCAGACAG AGTTGCTTTG
GTTTTCCTCT GCGATGCGTA TGACGAGGAA GAAGTGGAAG AAGGGGATAT AAGGGTTGTG
CTGCGCTTCC ATCCTGCCAT AGCGCCGGTA AAAATAGCTG TGCTTCCTCT TTCTAAAAAG
CTTGGAGATG AGGCATATAA GATTTATGAA ATGCTCATTA AAAAATACAA CTGTGAATAT
GATGAGACAG GAAGTATAGG AAAGAGATAC AGAAGACAGG ATGAGATAGG CACACCTTAT
TGCGTAACCT TTGACTTTGA TTCCCTGAAC GACAGGTGTG TTACCGTAAG AGACAGAGAC
TCCATGCAGC AGGTTAGGAT TAAAATTGAC GAACTACTTT CGTATTTTGA AGGGAAATTT
GATTTCTAA
 
Protein sequence
MEVKKTMEKI VALAKNRGFI YPGSEIYGGL ANSWDYGPLG VELKNNIKKA WWKKFVQENP 
YNVGVDCAIL MNPQVWVASG HVGGFSDPLI DCKECKTRHR ADKMIEEWNL KNNENVKVDG
WSNEMLMNYI REKGVTCPEC GGKNFTDIRK FNLMFKTFQG VTEDSKSEIY LRPETAQGIF
VNFKNVQRTT RKKIPFGIGQ IGKSFRNEIT PGNFIFRTRE FEQMELEFFC EPGTDLEWFE
YWKNFCFNWL LSLNIKKENL RMRDHSKEEL SHYSNATTDI EYLFPFGWGE LWGIADRTDF
DLRQHAEHSK EDLSYFDPNT NRKYIPYCIE PSLGADRVAL VFLCDAYDEE EVEEGDIRVV
LRFHPAIAPV KIAVLPLSKK LGDEAYKIYE MLIKKYNCEY DETGSIGKRY RRQDEIGTPY
CVTFDFDSLN DRCVTVRDRD SMQQVRIKID ELLSYFEGKF DF