Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0090 |
Symbol | cysS |
ID | 7978541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 116668 |
End bp | 118068 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644797064 |
Product | cysteinyl-tRNA synthetase |
Protein accession | YP_002948296 |
Protein GI | 239825672 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0215] Cysteinyl-tRNA synthetase |
TIGRFAM ID | [TIGR00435] cysteinyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000484516 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGCA TTCGACTTTA TAACACGTTA ACAAGAAAAA AGGAACTGTT TGAACCGCTA GAACCGAATA AAGTAAAAAT GTATGTATGC GGTCCAACCG TATATAACTA TATTCATATT GGGAACGCCC GCGCCGCGAT TGTGTTTGAT ACGATTCGCC GTTACCTAGA GTTTCGCGGT TATGAAGTGA AATACGTCTC CAATTTTACG GATGTGGATG ACAAGTTAAT TAAGGCGGCG CGCGAATTAG GAGAAGATGT GCCGACAATC GCGGAGCGTT TTATTCAAGC GTATTTTGAA GATATTACAG CGCTTGGATG CAAAAAAGCG GATGTTCATC CACGCGTAAC GGAAAATATA GATACGATTA TTGAATTTAT TCAAACATTA ATCGATAGAG GTTATGCATA CGAAGTAGAC GGGGATGTGT ACTATCGTAC AAGGAAATTT AAAGAATATG GAAAACTCTC GCATCAATCT ATTGATGAAC TAAAAGCGGG AGCGCGCATT GAAGTAGGGG AAAAGAAAGA AGATCCGCTT GATTTTGCGT TATGGAAAGC GGCAAAAGAA GGCGAAATTT GCTGGGATAG CCCGTGGGGA AAAGGGCGTC CGGGATGGCA TATCGAGTGC TCGGCAATGG CGCGTAAATA TTTAGGAGAT ACGATTGATA TTCATGCCGG TGGTCAAGAT TTGACGTTTC CTCATCATGA AAACGAAATT GCACAATCAG AAGCGTTAAC AGGAAAACCG TTTGCGAAAT ATTGGCTTCA TAACGGATAT TTAAATATTA ATAACGAAAA AATGTCAAAA TCGCTAGGTA ATTTTGTGCT CGTTCACGAT ATTATCCAGC AAATCGATCC GCAAGTATTA CGGTTCTTTA TGCTTTCTGT TCATTACCGA CATCCGATTA ACTATAGTGA AGAATTATTA GAAAGCGCGA AAAAAGGATT GGAACGGTTA AAAACGTCCT ATTTTAATTT AAAACATCGA TTGCAAAGCA GCACGAACTT AACCGATGAT GATGATCAAT GGCTTGCGCG CATTCAAGAG CAACATGAGG CATTTATTCG GGAGATGGAC GATGATTTTA ACACAGCCAA TGGGATTGCC GTGTTATTTG AGCTATCGAA ACAAGCGAAC TTATATTTGT TGGAAAAAAA TACGTCCGAA CGCGTCATTC ATGCATTTTT GCGTGAATTT GAACAGCTGT TAGATGTACT GGGCATTACG TTGCAAGAAG AGGAATTATT GGACGAGGAA ATCGAAGCGC TTATTCAAAA GCGGAACGAA GCGAGAAAGA ATCGGAATTT TGCTTTAGCG GATCAAATTC GTGATGAATT AAAGGCGAAA AACATTATTT TAGAAGATAC GCCGCAAGGT ACGAGATGGA AAAGAGGATA A
|
Protein sequence | MSSIRLYNTL TRKKELFEPL EPNKVKMYVC GPTVYNYIHI GNARAAIVFD TIRRYLEFRG YEVKYVSNFT DVDDKLIKAA RELGEDVPTI AERFIQAYFE DITALGCKKA DVHPRVTENI DTIIEFIQTL IDRGYAYEVD GDVYYRTRKF KEYGKLSHQS IDELKAGARI EVGEKKEDPL DFALWKAAKE GEICWDSPWG KGRPGWHIEC SAMARKYLGD TIDIHAGGQD LTFPHHENEI AQSEALTGKP FAKYWLHNGY LNINNEKMSK SLGNFVLVHD IIQQIDPQVL RFFMLSVHYR HPINYSEELL ESAKKGLERL KTSYFNLKHR LQSSTNLTDD DDQWLARIQE QHEAFIREMD DDFNTANGIA VLFELSKQAN LYLLEKNTSE RVIHAFLREF EQLLDVLGIT LQEEELLDEE IEALIQKRNE ARKNRNFALA DQIRDELKAK NIILEDTPQG TRWKRG
|
| |