Gene EcSMS35_0571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0571 
SymbolcysS 
ID6143051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp578516 
End bp579901 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content55% 
IMG OID641615463 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_001742670 
Protein GI170683319 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000388833 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAAAA TCTTCAATAC TCTGACACGC CAAAAAGAGG AATTTAAGCC TATTCACGCC 
GGGGAAGTCG GCATGTACGT GTGTGGAATC ACCGTTTACG ATCTCTGTCA TATCGGTCAC
GGGCGTACCT TTGTTGCCTT TGACGTGGTC GCGCGCTACC TGCGTTTCCT CGGCTATAAG
CTGAAGTATG TGCGCAACAT TACTGATATC GACGACAAAA TCATCAAACG CGCCAATGAA
AATGGCGAAA GCTTTGTGGC GCTGGTGGAT CGCATGATCG CCGAAATGCA CAAAGATTTT
GATGCTTTGA ACATTTTGCG CCCGGATATG GAGCCGCGCG CGACGCACCA TATCGCAGAA
ATTATTGAAC TCACTGAACA ACTGATCGCC AAAGGTCACG CTTATGTGGC GGACAACGGC
GACGTGATGT TCGACGTCCC GACCGATCCA ACTTATGGCG TGCTGTCGCG TCAGGATCTC
GACCAGCTGC AGGCAGGCGC GCGCGTTGAC GTGGTCGACG ACAAACGCAA CCCGATGGAC
TTCGTTCTGT GGAAGATGTC GAAAGAGGGC GAACCGAGCT GGCCGTCTCC GTGGGGCGCG
GGCCGTCCTG GCTGGCACAT TGAGTGTTCG GCAATGAACT GCAAGCAGCT GGGTAACCAC
TTTGATATCC ACGGCGGCGG TTCTGACCTG ATGTTCCCGC ACCACGAGAA CGAAATTGCC
CAGTCCACCT GTGCGCACGA TGGCCAGTAT GTGAATTACT GGATGCATTC CGGTATGGTG
ATGGTTGACC GCGAGAAGAT GTCCAAATCG CTGGGTAACT TCTTCACCGT GCGTGACGTG
CTGAAATACT ACGATGCGGA AACTGTGCGT TACTTCCTGA TGTCGGGTCA CTATCGCAGT
CAGTTGAACT ACAGTGAAGA GAACCTGAAA CAGGCGCGTG CGGCGCTGGA ACGTCTCTAC
ACTGCGCTGC GCGGCACAGA CAAAACCGTT GCGCCTGCCG GTGGCGAAGC GTTTGAAGCG
CGCTTTATCG AGGCGATGAA CGACGATTTC AACACCCCGG AAGCCTATTC CGTACTGTTT
GATATGGCGC GTGAAGTAAA CCGTCTGAAA GCAGAAGATA TGGCAGCGGC GAATGCAATG
GCGTCTCACC TGCGTAAACT TTCTGCCGTA TTGGGCCTGC TGGAGCAAGA ACCGGAAGCG
TTCCTGCAAA GCGGTGCGCA GGCAGACGAC AGCGAAGTGG CTGAGATTGA AGCGTTAATT
CAACAGCGTC TGGATGCCCG TAAAGCGAAA GACTGGGCGG CGGCAGATGC GGCGCGTGAC
CGTCTTAACG AGATGGGGAT CGTGCTGGAA GATGGCCCGC AAGGGACCAC CTGGCGTCGT
AAGTAA
 
Protein sequence
MLKIFNTLTR QKEEFKPIHA GEVGMYVCGI TVYDLCHIGH GRTFVAFDVV ARYLRFLGYK 
LKYVRNITDI DDKIIKRANE NGESFVALVD RMIAEMHKDF DALNILRPDM EPRATHHIAE
IIELTEQLIA KGHAYVADNG DVMFDVPTDP TYGVLSRQDL DQLQAGARVD VVDDKRNPMD
FVLWKMSKEG EPSWPSPWGA GRPGWHIECS AMNCKQLGNH FDIHGGGSDL MFPHHENEIA
QSTCAHDGQY VNYWMHSGMV MVDREKMSKS LGNFFTVRDV LKYYDAETVR YFLMSGHYRS
QLNYSEENLK QARAALERLY TALRGTDKTV APAGGEAFEA RFIEAMNDDF NTPEAYSVLF
DMAREVNRLK AEDMAAANAM ASHLRKLSAV LGLLEQEPEA FLQSGAQADD SEVAEIEALI
QQRLDARKAK DWAAADAARD RLNEMGIVLE DGPQGTTWRR K