Gene EcHS_A0601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0601 
SymbolcysS 
ID5594790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp614132 
End bp615517 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content54% 
IMG OID640919785 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_001457368 
Protein GI157160050 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000694006 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAAA TCTTCAATAC TCTGACACGC CAAAAAGAGG AATTTAAGCC TATTCACGCC 
GGGGAAGTCG GCATGTACGT GTGTGGAATC ACCGTTTACG ATCTCTGTCA TATCGGTCAC
GGGCGTACCT TTGTTGCTTT TGACGTGGTT GCGCGCTATC TGCGTTTCCT CGGCTATAAA
CTGAAGTATG TGCGCAACAT TACCGATATC GACGACAAAA TCATCAAACG CGCCAATGAA
AATGGCGAAA GCTTTGTGGC GCTGGTGGAT CGCATGATCG CCGAAATGCA CAAAGATTTT
GATGCTTTGA ACATTCTGCG CCCGGATATG GAGCCGCGCG CGACGCACCA TATCGCAGAA
ATTATTGAAC TCACTGAACA ACTGATCGCC AAAGGTCACG CTTATGTGGC GGACAACGGC
GACGTGATGT TTGACGTCCC GACCGATCCA ACTTATGGCG TGCTGTCGCG TCAGGATCTC
GACCAGCTGC AGGCTGGCGC GCGCGTTGAC GTGGTTGATG ACAAACGCAA CCCGATGGAC
TTCGTTCTGT GGAAGATGTC GAAAGAGGGC GAACCGAGCT GGCCGTCTCC GTGGGGCGCG
GGTCGTCCTG GCTGGCACAT TGAATGTTCG GCCATGAACT GCAAGCAGCT GGGTAACCAC
TTTGATATCC ACGGCGGCGG TTCAGACCTG ATGTTCCCGC ACCACGAGAA CGAAATCGCG
CAGTCCACCT GTGCCCATGA TGGTCAGTAT GTGAACTACT GGATGCACTC CGGAATGGTG
ATGGTTGACC GCGAGAAGAT GTCCAAATCG CTGGGTAACT TCTTTACCGT GCGCGATGTG
CTGAAATACT ACGACGCGGA AACCGTGCGT TACTTCCTGA TGTCGGGCCA CTATCGCAGC
CAGTTGAACT ACAGCGAAGA GAACCTGAAG CAGGCGCGTG CGGCGCTGGA GCGTCTCTAC
ACTGCGCTGC GCGGCACAGA TAAAACCGTT GCGCCTGCCG GTGGCGAAGC GTTTGAAGCG
CGCTTTATTG AAGCGATGGA CGACGATTTC AACACCCCGG AAGCCTATTC CGTACTGTTT
GATATGGCGC GTGAAGTAAA CCGTCTGAAA GCAGAAGATA TGGCAGCGGC GAATGCAATG
GCATCTCACC TGCGTAAACT TTCCGCTGTA TTGGGCCTGC TGGAGCAAGA ACCGGAAGCG
TTCCTGCAAA GCGGCGCGCA GGCAGACGAC AGCGAAGTGG CTGAGATTGA AGCGTTAATT
CAACAGCGTC TGGATGCCCG TAAAGCGAAA GACTGGGCGG CGGCGGATGC GGCGCGTGAT
CGTCTTAACG AGATGGGGAT CGTGCTGGAA GATGGCCCGC AAGGGACCAC CTGGCGTCGT
AAGTAA
 
Protein sequence
MLKIFNTLTR QKEEFKPIHA GEVGMYVCGI TVYDLCHIGH GRTFVAFDVV ARYLRFLGYK 
LKYVRNITDI DDKIIKRANE NGESFVALVD RMIAEMHKDF DALNILRPDM EPRATHHIAE
IIELTEQLIA KGHAYVADNG DVMFDVPTDP TYGVLSRQDL DQLQAGARVD VVDDKRNPMD
FVLWKMSKEG EPSWPSPWGA GRPGWHIECS AMNCKQLGNH FDIHGGGSDL MFPHHENEIA
QSTCAHDGQY VNYWMHSGMV MVDREKMSKS LGNFFTVRDV LKYYDAETVR YFLMSGHYRS
QLNYSEENLK QARAALERLY TALRGTDKTV APAGGEAFEA RFIEAMDDDF NTPEAYSVLF
DMAREVNRLK AEDMAAANAM ASHLRKLSAV LGLLEQEPEA FLQSGAQADD SEVAEIEALI
QQRLDARKAK DWAAADAARD RLNEMGIVLE DGPQGTTWRR K