Gene EcolC_3096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3096 
SymbolcysS 
ID6066247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3390404 
End bp3391789 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content55% 
IMG OID641602513 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_001726047 
Protein GI170021093 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0232678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAAAA TCTTCAATAC TCTGACACGC CAAAAAGAGG AATTTAAGCC TATTCACGCC 
GGGGAAGTCG GCATGTACGT GTGTGGAATC ACCGTTTACG ATCTCTGTCA TATCGGTCAC
GGGCGTACCT TTGTTGCTTT TGACGTGGTT GCGCGCTATC TGCGTTTCCT CGGCTATAAG
CTGAAGTATG TGCGCAACAT TACCGATATC GACGACAAAA TCATCAAACG CGCCAATGAA
AATGGCGAAA GCTTTGTGGC GCTGGTGGAT CGCATGATCG CCGAAATGCA CAAAGATTTT
GATGCGTTGA ACATTCTGCG CCCGGATATG GAGCCGCGCG CGACGCACCA TATCGCAGAA
ATTATTGAAC TCACTGAACA ACTGATCGCC AAAGGTCACG CTTATGTGGC TGATAACGGC
GACGTGATGT TCGACGTCCC GACCGATCCG ACTTATGGTG TGCTGTCGCG TCAGGATCTC
GACCAGTTGC AGGCAGGCGC GCGCGTTGAC GTGGTCGATG ACAAACGCAA CCCGATGGAC
TTCGTTCTGT GGAAGATGTC GAAAGAGGGC GAACCGAGCT GGCCGTCTCC GTGGGGCGCG
GGCCGTCCAG GCTGGCACAT TGAATGTTCG GCAATGAACT GCAAGCAGCT GGGTAACCAC
TTTGATATCC ACGGCGGCGG TTCTGACCTG ATGTTCCCGC ACCACGAAAA CGAAATCGCG
CAGTCCACCT GTGCCCATGA TGGTCAGTAT GTGAACTACT GGATGCACTC GGGGATGGTG
ATGGTTGACC GCGAGAAGAT GTCCAAATCG CTGGGTAACT TCTTTACCGT GCGCGATGTG
CTGAAATACT ACGACGCGGA AACCGTGCGT TACTTTCTGA TGTCGGGCCA CTATCGCAGC
CAGCTGAACT ACAGCGAAGA GAACCTGAAG CAGGCGCGTG CGGCGCTGGA GCGTCTCTAC
ACTGCGCTGC GCGGCACAGA TAAAACCGTT GCGCCTGCCG GTGGCGAAGC GTTTGAAGCG
CGCTTTATTG AAGCGATGGA CGACGATTTC AACACCCCGG AAGCCTATTC CGTGCTGTTT
GATATGGCGC GTGAAGTGAA CCGTCTGAAA GCAGAAGATA TGGCAGCGGC GAATGCAATG
GCATCTCACC TGCGTAAACT TTCCGCTGTA TTGGGCCTGC TGGAGCAAGA ACCGGAAGCG
TTCCTGCAAA GCGGCGCGCA GGCAGACGAC AGCGAAGTGG CTGAGATTGA AGCGTTAATT
CAACAGCGTC TGGATGCCCG TAAAGCGAAA GACTGGGCGG CGGCGGATGC GGCGCGTGAT
CGTCTTAACG AGATGGGGAT CGTGCTGGAA GATGGCCCGC AAGGGACCAC CTGGCGTCGT
AAGTAA
 
Protein sequence
MLKIFNTLTR QKEEFKPIHA GEVGMYVCGI TVYDLCHIGH GRTFVAFDVV ARYLRFLGYK 
LKYVRNITDI DDKIIKRANE NGESFVALVD RMIAEMHKDF DALNILRPDM EPRATHHIAE
IIELTEQLIA KGHAYVADNG DVMFDVPTDP TYGVLSRQDL DQLQAGARVD VVDDKRNPMD
FVLWKMSKEG EPSWPSPWGA GRPGWHIECS AMNCKQLGNH FDIHGGGSDL MFPHHENEIA
QSTCAHDGQY VNYWMHSGMV MVDREKMSKS LGNFFTVRDV LKYYDAETVR YFLMSGHYRS
QLNYSEENLK QARAALERLY TALRGTDKTV APAGGEAFEA RFIEAMDDDF NTPEAYSVLF
DMAREVNRLK AEDMAAANAM ASHLRKLSAV LGLLEQEPEA FLQSGAQADD SEVAEIEALI
QQRLDARKAK DWAAADAARD RLNEMGIVLE DGPQGTTWRR K