Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0418 |
Symbol | rpsA |
ID | 4570972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 461536 |
End bp | 463311 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 639765017 |
Product | 30S ribosomal protein S1 |
Protein accession | YP_910900 |
Protein GI | 119356256 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0539] Ribosomal protein S1 |
TIGRFAM ID | [TIGR00717] ribosomal protein S1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000167097 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGAAA CACTAACACT GGAGAAAAAA GTCCAGGAAA GACTCGCGAG AAAAAAAGTT AAATACTTTG CTCATTATGA GCCTGCGGAA CTGGAACAGA TGGAGATGCT TTATACAAGC ACTCTGTGTG AGATCACTGA GGATGAAATT GTAAAGGGAC GTATTGTCTC GATTTCCAAT AAGGATGTCA CTATCGATGT AGGCTTCAAG TCTGAAGGTA TTGTCTCACT GCTTGAATTT CGTGATGATG ATGATGTCGC TGTTGGAGAT GAAGTTGAAG TCTATCTTGA GAATATTGAA GACAAGATGG GCCAGCTTAT TCTTTCGAAA AAGAAAGCTG ATGTGTTGAG GATTTGGGAC AAGATCTATG ATTCAATAGA AAACGACACG ATCATCAATG GCAAAATCAT CAATCGTGTC AAAGGTGGTA TGACGGTTTC CCTGTCCGGC GTCGAGGCAT TTCTTCCCGG CTCACAGATT GATGTCAAGC CGGTACGGGA CTTTGATGCG CTTGTCGGCC AGACCATGGA CTTCAGGGTT GTCAAAATCA ATCCTGTCAC CCAGAATATC GTTGTCAGTC ATAAAGTTAT TCTTGAAGAG GCATATGCTG CAAGGCGCGA AGAGATGCTT GCAAATATCA AGGTTGGTAT GGTGCTGGAA GGTACGGTCA AAAATATTAC CGATTTCGGT ATATTTGTAG ACCTTGGCGG TCTTGACGGT CTTGTTCATA TCACAGACAT TACCTGGGGC AGAATCAACC ATCCTTCAGA AGTTGTTGAT CTTGATCAGC CGATCAAGGT TGTTGTTGTC GGCTTTGACG AGAATACCAA GAGAGTCTCT CTCGGTATGA AGCAGCTTGA GGCTCACCCA TGGGAAAATA TCGAAATTAA ATATCCTGTC GGATCGAAAG CCAGAGGTCG CATTGTATCA ATTACCGATT ACGGCGCGTT TGTCGAAATC GAGAAAGGTA TCGAAGGCCT GGTTCATATT TCCGAAATGA GCTGGACTCA GCATATCAAG CATCCGAGCC AGTTTGTCAG TCTCAACCAG GAGGTAGAGT GCGTGATTCT CAATATCGAC AAGGATCATA CCAAGCTCTC TCTTTCCATG AAGCGTGTGA ATGAGGATCC ATGGATTGCC CTTTCAGAAA AATACATTGA GGCATCGCTG CACAAAGGTA CGGTCAGCAA TATTACCGAT TTCGGTGTTT TTGTTGAGCT TGAACCGGGT GTTGACGGTC TTGTGCATAT TTCAGACCTC TCGTGGACAA AGAAAATTCG CCATCCAAGC GAACTGGTGA AAAAGAATCA GGAACTTGAT GTCAAGGTGC TCAAGTTTGA TGTGAATGCA AGGCGTATCG CTCTTGGTCA CAAACAGATC AATCCTGATC CATGGGATGA TTTTGAACAG AAATATGCGG TTGGCGCTGA AACCCCTGCC CGTATTTCCC AGATCATCGA AAAAGGCGTT ATTGTGATTC TTCCTGGCGA TGTAGACGGA TTTGTACCGG TATCGCATTT GCTTCAGGGC GGAGTAAAGG ATATCCACTC CTCGTTTGCT CTGGGCGATG AACTGCCGCT TCGCGTGATC GAATTTGACA AGGAGAACAA GCGAATCATT CTCTCTGCGC TCGAATATTT CAAGGACAAG AGCAAAGAGG AGATTGAGGC CTATCTTCAG GCTCATCCAA ATGAGAAAAA AGAGATCGAA GACGCTACTG CCGAACTGGA ACCACAGGTT AAATCCGCCG AGAAAAAAAG CGGTGAGGCA AAATAG
|
Protein sequence | MAETLTLEKK VQERLARKKV KYFAHYEPAE LEQMEMLYTS TLCEITEDEI VKGRIVSISN KDVTIDVGFK SEGIVSLLEF RDDDDVAVGD EVEVYLENIE DKMGQLILSK KKADVLRIWD KIYDSIENDT IINGKIINRV KGGMTVSLSG VEAFLPGSQI DVKPVRDFDA LVGQTMDFRV VKINPVTQNI VVSHKVILEE AYAARREEML ANIKVGMVLE GTVKNITDFG IFVDLGGLDG LVHITDITWG RINHPSEVVD LDQPIKVVVV GFDENTKRVS LGMKQLEAHP WENIEIKYPV GSKARGRIVS ITDYGAFVEI EKGIEGLVHI SEMSWTQHIK HPSQFVSLNQ EVECVILNID KDHTKLSLSM KRVNEDPWIA LSEKYIEASL HKGTVSNITD FGVFVELEPG VDGLVHISDL SWTKKIRHPS ELVKKNQELD VKVLKFDVNA RRIALGHKQI NPDPWDDFEQ KYAVGAETPA RISQIIEKGV IVILPGDVDG FVPVSHLLQG GVKDIHSSFA LGDELPLRVI EFDKENKRII LSALEYFKDK SKEEIEAYLQ AHPNEKKEIE DATAELEPQV KSAEKKSGEA K
|
| |