Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0571 |
Symbol | cysS |
ID | 6143051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 578516 |
End bp | 579901 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615463 |
Product | cysteinyl-tRNA synthetase |
Protein accession | YP_001742670 |
Protein GI | 170683319 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0215] Cysteinyl-tRNA synthetase |
TIGRFAM ID | [TIGR00435] cysteinyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000388833 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAAAAA TCTTCAATAC TCTGACACGC CAAAAAGAGG AATTTAAGCC TATTCACGCC GGGGAAGTCG GCATGTACGT GTGTGGAATC ACCGTTTACG ATCTCTGTCA TATCGGTCAC GGGCGTACCT TTGTTGCCTT TGACGTGGTC GCGCGCTACC TGCGTTTCCT CGGCTATAAG CTGAAGTATG TGCGCAACAT TACTGATATC GACGACAAAA TCATCAAACG CGCCAATGAA AATGGCGAAA GCTTTGTGGC GCTGGTGGAT CGCATGATCG CCGAAATGCA CAAAGATTTT GATGCTTTGA ACATTTTGCG CCCGGATATG GAGCCGCGCG CGACGCACCA TATCGCAGAA ATTATTGAAC TCACTGAACA ACTGATCGCC AAAGGTCACG CTTATGTGGC GGACAACGGC GACGTGATGT TCGACGTCCC GACCGATCCA ACTTATGGCG TGCTGTCGCG TCAGGATCTC GACCAGCTGC AGGCAGGCGC GCGCGTTGAC GTGGTCGACG ACAAACGCAA CCCGATGGAC TTCGTTCTGT GGAAGATGTC GAAAGAGGGC GAACCGAGCT GGCCGTCTCC GTGGGGCGCG GGCCGTCCTG GCTGGCACAT TGAGTGTTCG GCAATGAACT GCAAGCAGCT GGGTAACCAC TTTGATATCC ACGGCGGCGG TTCTGACCTG ATGTTCCCGC ACCACGAGAA CGAAATTGCC CAGTCCACCT GTGCGCACGA TGGCCAGTAT GTGAATTACT GGATGCATTC CGGTATGGTG ATGGTTGACC GCGAGAAGAT GTCCAAATCG CTGGGTAACT TCTTCACCGT GCGTGACGTG CTGAAATACT ACGATGCGGA AACTGTGCGT TACTTCCTGA TGTCGGGTCA CTATCGCAGT CAGTTGAACT ACAGTGAAGA GAACCTGAAA CAGGCGCGTG CGGCGCTGGA ACGTCTCTAC ACTGCGCTGC GCGGCACAGA CAAAACCGTT GCGCCTGCCG GTGGCGAAGC GTTTGAAGCG CGCTTTATCG AGGCGATGAA CGACGATTTC AACACCCCGG AAGCCTATTC CGTACTGTTT GATATGGCGC GTGAAGTAAA CCGTCTGAAA GCAGAAGATA TGGCAGCGGC GAATGCAATG GCGTCTCACC TGCGTAAACT TTCTGCCGTA TTGGGCCTGC TGGAGCAAGA ACCGGAAGCG TTCCTGCAAA GCGGTGCGCA GGCAGACGAC AGCGAAGTGG CTGAGATTGA AGCGTTAATT CAACAGCGTC TGGATGCCCG TAAAGCGAAA GACTGGGCGG CGGCAGATGC GGCGCGTGAC CGTCTTAACG AGATGGGGAT CGTGCTGGAA GATGGCCCGC AAGGGACCAC CTGGCGTCGT AAGTAA
|
Protein sequence | MLKIFNTLTR QKEEFKPIHA GEVGMYVCGI TVYDLCHIGH GRTFVAFDVV ARYLRFLGYK LKYVRNITDI DDKIIKRANE NGESFVALVD RMIAEMHKDF DALNILRPDM EPRATHHIAE IIELTEQLIA KGHAYVADNG DVMFDVPTDP TYGVLSRQDL DQLQAGARVD VVDDKRNPMD FVLWKMSKEG EPSWPSPWGA GRPGWHIECS AMNCKQLGNH FDIHGGGSDL MFPHHENEIA QSTCAHDGQY VNYWMHSGMV MVDREKMSKS LGNFFTVRDV LKYYDAETVR YFLMSGHYRS QLNYSEENLK QARAALERLY TALRGTDKTV APAGGEAFEA RFIEAMNDDF NTPEAYSVLF DMAREVNRLK AEDMAAANAM ASHLRKLSAV LGLLEQEPEA FLQSGAQADD SEVAEIEALI QQRLDARKAK DWAAADAARD RLNEMGIVLE DGPQGTTWRR K
|
| |