Gene Cphamn1_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2000 
SymbolrpsA 
ID6375693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2151684 
End bp2153459 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content46% 
IMG OID642684492 
Product30S ribosomal protein S1 
Protein accessionYP_001960392 
Protein GI189500922 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000749746 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.10797 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAAA CGCAAACAAT CGAACAAAAG AAGGTAGTTG AGAAAGGGCC TAAAAACCAT 
CATGTGAAGT TTTTCGCAAA CTACGACTCT TCGGAACTTG ACCAGATGGA GCTGCTCTAT
TCGAGCACGC TTAACGAGAT TACCGAAGAG GAAATCGTTA AAGGTACTAT TGTCGCCATT
TCGAACAAGG ATGTTACCAT TGATGTCGGA TTTAAATCCG AGGGTATCGT TTCGAAGCTT
GAGTTCAAGG ACGAAGAAGA GCTGCAAGTC GGTGACGAGG TAGAAGTATA CCTCGAAAAC
ATCGAAGACA AAATGGGACA GCTTATTCTC TCGAAGAGAA AAGCGGACGT TCTGAGGATC
TGGGACAAAA TCTATGATTC CATCGAGAAC GACACCATTA TCAACGGAAA GATAATTAAC
CGTGTCAAGG GCGGTATGAC GGTTTCGCTT TCCGGAGTCG AGGCATTTCT TCCGGGTTCG
CAGATCGACG TCAAACCTGT GCGCGACTTC GATGCGCTCG TAGGACAGAC AATGGACTTC
AGAGTGGTAA AAATCAATCC TGTGACACAG AATATCGTTG TCAGTCACAA GGTTATTCTC
GAAGAAGAGT ACGCCGCGAA GCGCGAAGAG ATGCTTGCCA ATATCAAGGT GGGTATGATT
CTCGAGGGTT CAGTAAAGAA TATCACCGAT TTCGGTATTT TTGTCGACCT TGGCGGTCTC
GACGGGCTTG TTCATATTAC CGATATCACC TGGGGCAGGA TCAATCATCC TTCAGAAGTT
GTCGACCTTG ATCAGCCGAT CAAAGTTGTT GTTGTTGCTT TTGACGAAAA TACCAAGAGG
GTCTCTCTCG GGATGAAGCA GCTTGAATCT CATCCTTGGG AAAATATCGA GATCAAGTAC
CCTGTAGGCA TCAAAACGAA CGGTCGAGTT GTTTCCATTA CTGATTACGG CGCATTTGTC
GAGATAGAAA AAGGTATTGA AGGCCTTGTT CACATCTCTG AAATGAGCTG GACTCAGCAC
ATCAAGCATC CAAGCCAGTT TGTTACTCTC GGTCAGGAAG TTGAATGTGT TATCCTCAAT
GTCGATAAAG AGCACACCAA GCTATCGCTT TCCATGAAGC GGGTGAACGA AGATCCATGG
ATCGCTCTTT CAGAAAAATA TATCGAGAAT TCCCTGCACA AAGGTACAGT CAGCAACATT
ACCGATTTCG GTGTTTTTGT CGAGCTTGAG CCCGGAGTCG ATGGCCTTGT GCACATTTCA
GATCTCTCCT GGACGAAGAA GATTCGCCAC CCGAGTGAAC TGGTCAAAAA GAATCAGGAC
CTTGAGGTAA AAGTGTTGAA ATTTGATGTC AACGCCCGCC GAATCGCGCT TGGTCACAAG
CAGATCAACC AGGATCCGTG GGATGAATTC GAACAGAAAT ATGCGGTCGG GGCGGAGTGT
GCCGGAAAAA TATCGCAGAT CATAGAAAAA GGCGTTATCG TTATCCTTCC TGGTGACGTT
GACGGTTTTG TTCCGGTATC GCATCTTCTT CAGGGTGGCG TTAAGGACAT TAACGCATCC
TTCAAGGTTG AAGATGAACT GCCGCTTCGT GTTATCGAGT TCGATAAGGA AAACAAACGG
ATTATTCTCT CGGCGCTCGA ATATTTCAAA GATAAGAGCA AAGAGGAGAT TGAAGCCTAC
CTTCAGGCTC ATCCGAATGA AAAGAAAGAG ATTGAGGATG CCACTGCCGA GCTGGATTCA
CAATCAAATA CCGATGACGC TAAAGACGGC GAGTAA
 
Protein sequence
MSETQTIEQK KVVEKGPKNH HVKFFANYDS SELDQMELLY SSTLNEITEE EIVKGTIVAI 
SNKDVTIDVG FKSEGIVSKL EFKDEEELQV GDEVEVYLEN IEDKMGQLIL SKRKADVLRI
WDKIYDSIEN DTIINGKIIN RVKGGMTVSL SGVEAFLPGS QIDVKPVRDF DALVGQTMDF
RVVKINPVTQ NIVVSHKVIL EEEYAAKREE MLANIKVGMI LEGSVKNITD FGIFVDLGGL
DGLVHITDIT WGRINHPSEV VDLDQPIKVV VVAFDENTKR VSLGMKQLES HPWENIEIKY
PVGIKTNGRV VSITDYGAFV EIEKGIEGLV HISEMSWTQH IKHPSQFVTL GQEVECVILN
VDKEHTKLSL SMKRVNEDPW IALSEKYIEN SLHKGTVSNI TDFGVFVELE PGVDGLVHIS
DLSWTKKIRH PSELVKKNQD LEVKVLKFDV NARRIALGHK QINQDPWDEF EQKYAVGAEC
AGKISQIIEK GVIVILPGDV DGFVPVSHLL QGGVKDINAS FKVEDELPLR VIEFDKENKR
IILSALEYFK DKSKEEIEAY LQAHPNEKKE IEDATAELDS QSNTDDAKDG E