Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1372 |
Symbol | thrS |
ID | 7312222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1669008 |
End bp | 1670915 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643608294 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_002505706 |
Protein GI | 220928797 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000383369 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAGTA TTACCTTGAA AGACGGTACC GTAAAAGAAT ATCAGAAAGG TGTTACAGTC CTTGAAGTGG CAGAGAGTAT AAGTGCAGGA CTGGCAAGAG TATCTTTGGT TGGAGAAGTT GACGGCAAGG TAACGGAGCT TGATTTCAAA CTGGAAAGGG ATTGTAGCTT AAATCTTTTA ACCTTTGAGG ATGAAGGAGG AAGGCTTGCT TACAGACACA CTGCTTCCCA TGTGTTGGCT CAGGCGGTGA AAAGACTGTA CCCTGATGCC AAGCTGGCAA TAGGGCCGGC AATAGATACA GGATTTTATT ATGATTTTGA AAGGGAAAAG CCATTTTCAA TAGAGGAACT TGATAGTATT GAAAAGGAAA TGGAAAAAAT AGTAAAAGAG GACTTAAAGC TGGAGAGGTT CGTTCTCCCA AGGAATGAGG CAATAAAGCT TATGGAAGAA AAAGGAGAAC CGTATAAGGC AGAACTTATA ATGGACCTTC CCCAAGGAGA AGAAATATCC TTTTACAAAC AAGGGGAGTT TACAGACCTT TGTGCAGGGC CACATTTGAC AGGGACAGGA AGATTAAAAG CTGTAAAGTT GCTTTCTGTT GCGGGTGCTT ATTGGAGAGG CAACGAGAAG AATAAAATGC TTCAGAGAAT ATATGGTACA GCTTTTCCCA AGAAAAGCCA GCTGGATGAA TACCTTTTCA GAATTGAAGA GGCAAAAAAG CGTGACCACA GAAAGCTGGG AAGGGAGCTG GATTTATTCG ATATATTGGA CGAAGGCCCG GGTTTCCCGT TCTTTATGCC AAAAGGAATG GTACTTCGCA ATCTTCTTGA GGATTACTGG AGGTCGGAAC ATAAGAAAGC AGGCTATCAG GAGATTAAGA CACCTGTTAT ACTGAACAAG GAACTCTGGC TGAGATCGGG GCATTGGGAT AATTACAAGA ATAATATGTA CACAGTTGCA ATAGATGAGC AGGATTGTGC AATAAAACCA ATGAATTGTC CGGGCGGAAT ACTCGTATTC AAAAGGAAAC TACATTCATA CAGGGATCTG CCTCAAAGAA TGGGAGAATT GGGGTTGGTG CACAGACATG AGCTTTCAGG TGCACTCCAT GGGCTGATGA GAGTAAGGTG CTTTACTCAG GACGATGCAC ATATTTTTAT GACTCCTGAA CAGATTACTG ATGAGGTAAC CGGGGTAATA AACCTGATAG ATGATTTTTA TAGTGTATTT GGGTTTAAAT ACAATGTTGA ATTATCTACA AGGCCGGAAA AGTCCATAGG ATCAGATGAA ATGTGGGAAT TGTCTACTGC GGGGCTAAAA AAAGCACTTG ACGAAAAAGG TATCAAGTAT ACAATCAATA AGGGAGACGG TGCTTTCTAT GGGCCTAAGA TAGATTTTCA TCTAGAGGAT TCTATAGGTC GTACATGGCA GTGCGGTACA ATTCAGCTTG ACATGAACCT GCCGGAGAGA TTTGACCTTA GCTATATTGG CCCAGATGGA GAAAAGCACA GGCCGGTAAT GGTTCACAGG GTTGTTTTTG GAAGCATAGA GAGATTCATA GCCATATTAA CCGAACATTT TGCAGGTGCT TTTCCAACAT GGTTGAGTCC TGTTCAGGTT AAAATTCTTC CTCTTGTTGA CAAGCACTAC GATTATGCTT ATGAAGTAAA AAAGCTTCTG GAGGCTGACG ATATTAGGGT AGAGGTAGAT ACAAGAAACG AAAAGATAGG TTACAAAATC CGTGAAGCCC AGATGGATAA GACTCCTTAT ATGCTTGTAA TTGGTGACAA GGAGCTGGAG GGCAGGCTTG TTTCTGTCAG ATCCAGAAAA GACGGTGATT TAGGGACTAT TACACCGGAA CAGTTTGCAG AGAAAATATC GAATGAAATA AAAAATAAAC TGAGATAA
|
Protein sequence | MISITLKDGT VKEYQKGVTV LEVAESISAG LARVSLVGEV DGKVTELDFK LERDCSLNLL TFEDEGGRLA YRHTASHVLA QAVKRLYPDA KLAIGPAIDT GFYYDFEREK PFSIEELDSI EKEMEKIVKE DLKLERFVLP RNEAIKLMEE KGEPYKAELI MDLPQGEEIS FYKQGEFTDL CAGPHLTGTG RLKAVKLLSV AGAYWRGNEK NKMLQRIYGT AFPKKSQLDE YLFRIEEAKK RDHRKLGREL DLFDILDEGP GFPFFMPKGM VLRNLLEDYW RSEHKKAGYQ EIKTPVILNK ELWLRSGHWD NYKNNMYTVA IDEQDCAIKP MNCPGGILVF KRKLHSYRDL PQRMGELGLV HRHELSGALH GLMRVRCFTQ DDAHIFMTPE QITDEVTGVI NLIDDFYSVF GFKYNVELST RPEKSIGSDE MWELSTAGLK KALDEKGIKY TINKGDGAFY GPKIDFHLED SIGRTWQCGT IQLDMNLPER FDLSYIGPDG EKHRPVMVHR VVFGSIERFI AILTEHFAGA FPTWLSPVQV KILPLVDKHY DYAYEVKKLL EADDIRVEVD TRNEKIGYKI REAQMDKTPY MLVIGDKELE GRLVSVRSRK DGDLGTITPE QFAEKISNEI KNKLR
|
| |