Gene Ccel_1372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1372 
SymbolthrS 
ID7312222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1669008 
End bp1670915 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content41% 
IMG OID643608294 
Productthreonyl-tRNA synthetase 
Protein accessionYP_002505706 
Protein GI220928797 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000383369 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAGTA TTACCTTGAA AGACGGTACC GTAAAAGAAT ATCAGAAAGG TGTTACAGTC 
CTTGAAGTGG CAGAGAGTAT AAGTGCAGGA CTGGCAAGAG TATCTTTGGT TGGAGAAGTT
GACGGCAAGG TAACGGAGCT TGATTTCAAA CTGGAAAGGG ATTGTAGCTT AAATCTTTTA
ACCTTTGAGG ATGAAGGAGG AAGGCTTGCT TACAGACACA CTGCTTCCCA TGTGTTGGCT
CAGGCGGTGA AAAGACTGTA CCCTGATGCC AAGCTGGCAA TAGGGCCGGC AATAGATACA
GGATTTTATT ATGATTTTGA AAGGGAAAAG CCATTTTCAA TAGAGGAACT TGATAGTATT
GAAAAGGAAA TGGAAAAAAT AGTAAAAGAG GACTTAAAGC TGGAGAGGTT CGTTCTCCCA
AGGAATGAGG CAATAAAGCT TATGGAAGAA AAAGGAGAAC CGTATAAGGC AGAACTTATA
ATGGACCTTC CCCAAGGAGA AGAAATATCC TTTTACAAAC AAGGGGAGTT TACAGACCTT
TGTGCAGGGC CACATTTGAC AGGGACAGGA AGATTAAAAG CTGTAAAGTT GCTTTCTGTT
GCGGGTGCTT ATTGGAGAGG CAACGAGAAG AATAAAATGC TTCAGAGAAT ATATGGTACA
GCTTTTCCCA AGAAAAGCCA GCTGGATGAA TACCTTTTCA GAATTGAAGA GGCAAAAAAG
CGTGACCACA GAAAGCTGGG AAGGGAGCTG GATTTATTCG ATATATTGGA CGAAGGCCCG
GGTTTCCCGT TCTTTATGCC AAAAGGAATG GTACTTCGCA ATCTTCTTGA GGATTACTGG
AGGTCGGAAC ATAAGAAAGC AGGCTATCAG GAGATTAAGA CACCTGTTAT ACTGAACAAG
GAACTCTGGC TGAGATCGGG GCATTGGGAT AATTACAAGA ATAATATGTA CACAGTTGCA
ATAGATGAGC AGGATTGTGC AATAAAACCA ATGAATTGTC CGGGCGGAAT ACTCGTATTC
AAAAGGAAAC TACATTCATA CAGGGATCTG CCTCAAAGAA TGGGAGAATT GGGGTTGGTG
CACAGACATG AGCTTTCAGG TGCACTCCAT GGGCTGATGA GAGTAAGGTG CTTTACTCAG
GACGATGCAC ATATTTTTAT GACTCCTGAA CAGATTACTG ATGAGGTAAC CGGGGTAATA
AACCTGATAG ATGATTTTTA TAGTGTATTT GGGTTTAAAT ACAATGTTGA ATTATCTACA
AGGCCGGAAA AGTCCATAGG ATCAGATGAA ATGTGGGAAT TGTCTACTGC GGGGCTAAAA
AAAGCACTTG ACGAAAAAGG TATCAAGTAT ACAATCAATA AGGGAGACGG TGCTTTCTAT
GGGCCTAAGA TAGATTTTCA TCTAGAGGAT TCTATAGGTC GTACATGGCA GTGCGGTACA
ATTCAGCTTG ACATGAACCT GCCGGAGAGA TTTGACCTTA GCTATATTGG CCCAGATGGA
GAAAAGCACA GGCCGGTAAT GGTTCACAGG GTTGTTTTTG GAAGCATAGA GAGATTCATA
GCCATATTAA CCGAACATTT TGCAGGTGCT TTTCCAACAT GGTTGAGTCC TGTTCAGGTT
AAAATTCTTC CTCTTGTTGA CAAGCACTAC GATTATGCTT ATGAAGTAAA AAAGCTTCTG
GAGGCTGACG ATATTAGGGT AGAGGTAGAT ACAAGAAACG AAAAGATAGG TTACAAAATC
CGTGAAGCCC AGATGGATAA GACTCCTTAT ATGCTTGTAA TTGGTGACAA GGAGCTGGAG
GGCAGGCTTG TTTCTGTCAG ATCCAGAAAA GACGGTGATT TAGGGACTAT TACACCGGAA
CAGTTTGCAG AGAAAATATC GAATGAAATA AAAAATAAAC TGAGATAA
 
Protein sequence
MISITLKDGT VKEYQKGVTV LEVAESISAG LARVSLVGEV DGKVTELDFK LERDCSLNLL 
TFEDEGGRLA YRHTASHVLA QAVKRLYPDA KLAIGPAIDT GFYYDFEREK PFSIEELDSI
EKEMEKIVKE DLKLERFVLP RNEAIKLMEE KGEPYKAELI MDLPQGEEIS FYKQGEFTDL
CAGPHLTGTG RLKAVKLLSV AGAYWRGNEK NKMLQRIYGT AFPKKSQLDE YLFRIEEAKK
RDHRKLGREL DLFDILDEGP GFPFFMPKGM VLRNLLEDYW RSEHKKAGYQ EIKTPVILNK
ELWLRSGHWD NYKNNMYTVA IDEQDCAIKP MNCPGGILVF KRKLHSYRDL PQRMGELGLV
HRHELSGALH GLMRVRCFTQ DDAHIFMTPE QITDEVTGVI NLIDDFYSVF GFKYNVELST
RPEKSIGSDE MWELSTAGLK KALDEKGIKY TINKGDGAFY GPKIDFHLED SIGRTWQCGT
IQLDMNLPER FDLSYIGPDG EKHRPVMVHR VVFGSIERFI AILTEHFAGA FPTWLSPVQV
KILPLVDKHY DYAYEVKKLL EADDIRVEVD TRNEKIGYKI REAQMDKTPY MLVIGDKELE
GRLVSVRSRK DGDLGTITPE QFAEKISNEI KNKLR