Gene Acel_1089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1089 
SymbolrpsA 
ID4484995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1206182 
End bp1207729 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content63% 
IMG OID639729864 
Product30S ribosomal protein S1 
Protein accessionYP_872847 
Protein GI117928296 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0119116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG CTACTGATGT ACGCCCCGAA ACCCCCGAGC CGCCCGCGTC CTCACGCATC 
GCCGTCAACG ATGTCGGCTC GACCGACGAT TTCATCGCTG CGATCGACAA GACGATCAAA
TACTTCAACG ACGGGGACAT CGTGGAGGGG ACCGTCGTCA AGGTTGATCG CGACGAGGTC
CTCCTCGACA TCGGCTACAA GACCGAGGGC GTGATCCCGT CGCGCGAGCT CTCGATCAAA
CATGACGTCG ATCCGCACGA GGTAGTCAAG GTCGGGGACC GGATCGAGGC CCTTGTCCTG
CAGAAGGAGG ACAAGGACGG CCGTTTGATC CTCTCGAAGA AGCGGGCGCA GTACGAGCGG
GCCTGGGGGA CCATCGAACA GATCAAGGAA GCGGACGGCG TCGTCACCGG CACGGTTATC
GAGGTCGTCA AGGGTGGGCT CATCCTCGAC ATCGGGCTGC GCGGGTTCCT GCCGGCGTCA
CTCGTCGAGA TGCGGCGAGT TCGCGATCTC GCTCCGTACG TGGGTCGGAA GCTAGAAGCG
AAGATCATCG AGCTGGACAA GAACCGAAAC AACGTCGTCC TGTCTCGGCG GGCTTACCTC
GAGCAGACCC AGTCGGAGGT ACGCCAGACC TTCCTCACCA CGCTGAAGAA GGGGCAGATC
CGCAACGGCG TGGTCAGTTC GATCGTGAAT TTCGGTGCGT TCGTCGACTT GGGCGGGGTG
GACGGGCTGG TCCACGTCTC CGAGCTCTCC TGGAAGCACA TCGACCACCC CAGTGAGGTC
GTCGAGGTCG GAATGCCGGT CACCGTTGAG GTCTTGGACG TCGACCTCGA CCGGGAACGG
GTTTCGCTCT CCCTCAAGTC GACGCAGGAA GATCCGTGGC AGCTGTTCGC CCGCACCCAC
AGTCTCGGCC AGGTGGTTCC GGGGAAGGTG ACCAAGCTGG TACCGTTCGG CGCCTTTGTT
CGGGTGGAAG ACGGCATCGA GGGATTGGTG CACATTTCCG AACTCTCCGA CCGGCATGTC
GAAATTCCCG AGCAGGTCGT CCAGGTCGGT GACGAGATCT TCGTAAAGAT CATCGATATT
GACCTTGAGC GCCGCCGGAT CAGTCTGTCA CTGAAGCAGG CGCTGGAAGG CGTGGAATCG
ACCAGCCCCG AGCAATTCGA CCCCACCGTT TACGGCATGC CAGCCAGTTA CGACGAGCAC
GGCAATTACA TCTATCCAGA GGGCTTTGAC CCGGAGACCA ACCAATGGCT TCCCGGCCAC
GAGGCGCAGC AGGCCGAGTG GGAACGCCAG TACGCCGAGG CGCGCGCCCG GTTCGAGGCG
CACCGCGCCC AGGTTGCCAA GATGCGCCAG GCGCAGGCAC AGGCGGAGGC GGAGCAGGCG
ACGACATCGT CTGACGGAGG TGCGGCCGAG ACGCCGGCTG AGCCGAGCGG CAGTCTCGCC
GGTGACGAGC AGCTCGCCGA GCTGCGACGC AAGCTTGCCG AGGATGCGGT CGAGCAGCCC
GCCGCCGCGG ATGCGAACCC GGACGAAGAG TCGAAGGATC CCGCCTGA
 
Protein sequence
MTTATDVRPE TPEPPASSRI AVNDVGSTDD FIAAIDKTIK YFNDGDIVEG TVVKVDRDEV 
LLDIGYKTEG VIPSRELSIK HDVDPHEVVK VGDRIEALVL QKEDKDGRLI LSKKRAQYER
AWGTIEQIKE ADGVVTGTVI EVVKGGLILD IGLRGFLPAS LVEMRRVRDL APYVGRKLEA
KIIELDKNRN NVVLSRRAYL EQTQSEVRQT FLTTLKKGQI RNGVVSSIVN FGAFVDLGGV
DGLVHVSELS WKHIDHPSEV VEVGMPVTVE VLDVDLDRER VSLSLKSTQE DPWQLFARTH
SLGQVVPGKV TKLVPFGAFV RVEDGIEGLV HISELSDRHV EIPEQVVQVG DEIFVKIIDI
DLERRRISLS LKQALEGVES TSPEQFDPTV YGMPASYDEH GNYIYPEGFD PETNQWLPGH
EAQQAEWERQ YAEARARFEA HRAQVAKMRQ AQAQAEAEQA TTSSDGGAAE TPAEPSGSLA
GDEQLAELRR KLAEDAVEQP AAADANPDEE SKDPA