Gene EcE24377A_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1008 
SymbolrpsA 
ID5589647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1027909 
End bp1029582 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content51% 
IMG OID640924715 
Product30S ribosomal protein S1 
Protein accessionYP_001462129 
Protein GI157156817 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value3.83759e-11 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAT CTTTTGCTCA ACTCTTTGAA GAGTCCTTAA AAGAAATCGA AACCCGCCCG 
GGTTCTATCG TTCGTGGCGT TGTTGTTGCT ATCGACAAAG ACGTAGTACT GGTTGACGCT
GGTCTGAAAT CTGAGTCCGC CATCCCGGCT GAGCAGTTCA AAAACGCCCA GGGCGAGCTG
GAAATCCAGG TAGGTGACGA AGTTGACGTT GCTCTGGACG CAGTAGAAGA CGGCTTCGGT
GAAACTCTGC TGTCCCGTGA GAAAGCTAAA CGTCACGAAG CCTGGATCAC GCTGGAAAAA
GCTTACGAAG ATGCTGAAAC TGTTACCGGT GTTATCAACG GCAAAGTTAA GGGCGGCTTC
ACTGTTGAGC TGAACGGTAT TCGTGCGTTC CTGCCAGGTT CTCTGGTAGA CGTTCGTCCG
GTGCGTGACA CTCTGCACCT GGAAGGCAAA GAGCTTGAAT TTAAAGTAAT CAAGCTGGAT
CAGAAGCGCA ACAACGTTGT TGTTTCTCGT CGTGCCGTTA TCGAATCCGA AAACAGCGCA
GAGCGCGATC AGCTGCTGGA AAACCTGCAG GAAGGCATGG AAGTTAAAGG TATCGTTAAG
AACCTCACTG ACTACGGTGC ATTCGTTGAT CTGGGCGGCG TTGACGGCCT GCTGCACATC
ACTGACATGG CCTGGAAACG CGTTAAGCAT CCGAGCGAAA TCGTCAACGT GGGCGACGAA
ATCACTGTTA AAGTGCTGAA GTTCGACCGC GAACGTACCC GTGTATCCCT GGGCCTGAAA
CAGTTGGGCG AAGATCCGTG GGTAGCTATC GCTAAACGTT ATCCGGAAGG TACCAAACTG
ACTGGTCGCG TGACCAACCT GACCGACTAC GGCTGCTTCG TTGAAATCGA AGAAGGCGTT
GAAGGCCTGG TACACGTTTC CGAAATGGAT TGGACCAACA AAAACATCCA CCCGTCCAAA
GTTGTTAACG TTGGCGATGT AGTGGAAGTT ATGGTTCTGG ATATCGACGA GGAACGTCGT
CGTATCTCCC TGGGTCTGAA ACAGTGCAAA GCTAACCCGT GGCAGCAGTT CGCGGAAACC
CACAACAAGG GCGACCGTGT TGAAGGTAAA ATCAAGTCTA TCACTGACTT CGGTATCTTC
ATCGGCCTGG ACGGCGGCAT CGACGGCCTG GTTCACCTGT CTGACATCTC CTGGAACGTT
GCAGGCGAAG AAGCAGTTCG TGAATACAAA AAAGGCGACG AAATCGCTGC AGTTGTTCTG
CAGGTTGACG CAGAACGTGA ACGTATCTCC CTGGGCGTTA AACAGCTCGC AGAAGATCCG
TTCAACAACT GGGTTGCTCT GAACAAGAAA GGCGCTATCG TAACCGGTAA AGTAACTGCA
GTTGACGCTA AAGGCGCAAC CGTAGAACTG GCTGACGGCG TTGAAGGTTA CCTGCGTGCT
TCTGAAGCAT CCCGTGACCG CGTTGAAGAC GCTACCCTGG TTCTGAGCGT TGGCGACGAA
GTTGAAGCTA AATTCACCGG CGTTGATCGT AAAAACCGCG CAATCAGCCT GTCTGTTCGT
GCGAAAGACG AAGCTGACGA GAAAGATGCA ATCGCAACTG TTAACAAACA GGAAGATGCA
AACTTCTCCA ACAACGCAAT GGCTGAAGCT TTCAAAGCAG CTAAAGGCGA GTAA
 
Protein sequence
MTESFAQLFE ESLKEIETRP GSIVRGVVVA IDKDVVLVDA GLKSESAIPA EQFKNAQGEL 
EIQVGDEVDV ALDAVEDGFG ETLLSREKAK RHEAWITLEK AYEDAETVTG VINGKVKGGF
TVELNGIRAF LPGSLVDVRP VRDTLHLEGK ELEFKVIKLD QKRNNVVVSR RAVIESENSA
ERDQLLENLQ EGMEVKGIVK NLTDYGAFVD LGGVDGLLHI TDMAWKRVKH PSEIVNVGDE
ITVKVLKFDR ERTRVSLGLK QLGEDPWVAI AKRYPEGTKL TGRVTNLTDY GCFVEIEEGV
EGLVHVSEMD WTNKNIHPSK VVNVGDVVEV MVLDIDEERR RISLGLKQCK ANPWQQFAET
HNKGDRVEGK IKSITDFGIF IGLDGGIDGL VHLSDISWNV AGEEAVREYK KGDEIAAVVL
QVDAERERIS LGVKQLAEDP FNNWVALNKK GAIVTGKVTA VDAKGATVEL ADGVEGYLRA
SEASRDRVED ATLVLSVGDE VEAKFTGVDR KNRAISLSVR AKDEADEKDA IATVNKQEDA
NFSNNAMAEA FKAAKGE