Gene ECH_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0402 
SymbolrpsA 
ID3927480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp393235 
End bp394938 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content32% 
IMG OID637901526 
Product30S ribosomal protein S1 
Protein accessionYP_507222 
Protein GI88657606 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGACC TTCAAACAAA AACAAATTTA GGATCAGAAA ATTATTTACA TAATATAAAG 
AAAATAAAAT CCAACAAGTT TATTAGCCAT AATCTTGAAG TAGAAGATTC CGATAATGAT
AGTAGCTCAG AGTTTCAGCA TGCGTTAAAA GAATTTATTG ATGACAGTGT CAAAGAAGGA
CAAATAATAG AGGGCACTAT TATATGCATA GATAAGGGGT ATGTAACAAT AGACTCAGGA
TTAAAATCCG AAAGTATTGT TTCACTTAAA GAATTTGAGC TTGGTGATGA TTATCAGAAT
ATTAGTATAG GATCAAAAGT AAAGCTATAC TTAGAGAAAA TCGAAGGTCG TAATGGTAGT
GTAGTATTAA GTAGAGAAAA AGCTATCAGA GATGAACTAT GGCAGAAACT TGAAGAAGCT
GCAGAAAAAA AAGAAGATGT TGAAGGAGTA ATATTTAGCT CAATTAAATG TGGGTACACT
GTAGACATAA AAGGTGTTGT GGCATTTTTA CCAGCAAGCC ATGTAGCTCT AAGACAAGTT
AAAGACATCA CTCCCCTATT AGGTAAAAAG CAAAAGTTCC GTATATTAAA GATGGATAAA
AAGCAAGGTA ATATTGTAGT CTCAAGAAAA GCTATATTAG AAGAATCACT TGCAGATGCT
AAAAGTTTAT ATCTCAGCCA ATTAAATGAA GGAGATATTA TTGAAGGTAA AGTTAAAAGC
ATAACTAAAT ATGGTGTATT CATAGAAATA CATGAATCTC CTTCAGTAGG TGTAGTTGAC
GGGCTATTGC ATATAACTGA TATATCATGG AGCCGTGTAA GTCATCCTTC AGCTGTTTTC
TCATGTGGCC AAACTGTCAA AACAAAAATT ATAAAAATTG ACCGTGAAAA TAAGAAAATT
TCTCTTGGTG TAAAGCAATT AGAGGAAAGT CCATGGTCAA ATATTGAGAA AAAATATCCT
GTTGATAGTA TCCATAAAGG TATTGTCACA AGCATTGAGG AATATGGGGT CTTTGTTGAA
TTAGAAACAG GGATAGAGGG GTTAGTACAC GTATCCGAAG TAAGCTGGAC AAAAAATTCT
CTTCCTATAA ACCAGTTATT TATGAGAGGT GAAGAAATTC AAGTTAAGGT ATTAAGCATA
GACACAGCTA AAAGTCGCAT GAGTCTTAGT ATGAAAAGGT GTCAAGATAA CCCATGGCAG
GCGTTTACAC TAAAGTATCC TTTAGGTTCT ATTGTATCAA CAAAGATAAA AAATATTACA
AATCTTGGTA TATTTGTTTC ATTTCAGGAT AACACACTCA ATGATGGCAT TGAAGGGTTA
ATTCGTACTA CAGAGTTAAG TTGGTCCTTA TCACCAGAAG CAGCAATAAA AAAATATAAC
GTTGGAGATT CTGTAGAAGC AAAAATACTC ATGATAAATC CAAATAAAAG CAAGATAGAT
CTTGGAGTAA AACAACTAGA GTATGACCCT TTTCTTGATT TATTAAAGAA AATTAACTTA
GGAGATAAAA TACCAGTTAC AGTTACCAAA GTTGTTGAAG ATACTGGTAT ATTAGTTGAT
GTATTCAATG GATCAAATAG TTTCAACTTA CTAATTGAAC AAGAATACCT ACCTGATAAT
AAAAAATTCT TCCCAGGAGA TAGGTTAGAA GCTGAAGTAT TGCTTATTGA AACTTATAAT
ATGATACTAT CATTAAAGCA CTGA
 
Protein sequence
MEDLQTKTNL GSENYLHNIK KIKSNKFISH NLEVEDSDND SSSEFQHALK EFIDDSVKEG 
QIIEGTIICI DKGYVTIDSG LKSESIVSLK EFELGDDYQN ISIGSKVKLY LEKIEGRNGS
VVLSREKAIR DELWQKLEEA AEKKEDVEGV IFSSIKCGYT VDIKGVVAFL PASHVALRQV
KDITPLLGKK QKFRILKMDK KQGNIVVSRK AILEESLADA KSLYLSQLNE GDIIEGKVKS
ITKYGVFIEI HESPSVGVVD GLLHITDISW SRVSHPSAVF SCGQTVKTKI IKIDRENKKI
SLGVKQLEES PWSNIEKKYP VDSIHKGIVT SIEEYGVFVE LETGIEGLVH VSEVSWTKNS
LPINQLFMRG EEIQVKVLSI DTAKSRMSLS MKRCQDNPWQ AFTLKYPLGS IVSTKIKNIT
NLGIFVSFQD NTLNDGIEGL IRTTELSWSL SPEAAIKKYN VGDSVEAKIL MINPNKSKID
LGVKQLEYDP FLDLLKKINL GDKIPVTVTK VVEDTGILVD VFNGSNSFNL LIEQEYLPDN
KKFFPGDRLE AEVLLIETYN MILSLKH