Gene Anae109_2301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_2301 
SymbolrpsA 
ID5376103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2606604 
End bp2608340 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content66% 
IMG OID640843820 
Product30S ribosomal protein S1 
Protein accessionYP_001379487 
Protein GI153005162 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.75877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCCGA CAAAGGTTGG TCTTTACCGA ATGGAGAACC AGACTCCCCG TACGCCCGAA 
ATCGAGACCG AGGACTTCGC GACGCTCTTC GCGCAGAGCG AGGCGCAGGC CGGTCCCATC
GAGGAGGGCA AGGTCGTCCC TGGCACCGTC ATCGCGCTCT CCAAGGACTA CGCCGTCATC
GACATCGGCT ACAAGTCCGA GGGCCAGGTG CCGATCTCCG AGTTCATCTC GGCGCCCGGC
GGCGAGCCCG CCGTCAAGGT CGGCGACAAG GTCGAGGTGC TCGTCGAGTC CCGGGAGAAC
GACACCGGGA TGGTCGTCCT CTCCAAGGAG AAGGCCGACA AGATGCGCAT CTGGGACGAG
ATCTCCGCCG CGTGCGAGCG CGACGAGCTC GTCGAGGGCG TGATCGTCGG CCGCGTGAAG
GGCGGCCTCT CCGTCGACAT CGGCGTGAAG GCGTTCCTCC CCGGCTCGCA GGTCGACATC
CGGCCGGTGC GGAACCTCGA CAAGCTGATC GGCGAGAAGT TCAAGTTCAA GGTCATCAAG
TTCAACAAGA AGCGCGGCAA CATCGTCCTG TCGCGCCGGG TCCTCCTCGA GAAGGAGCGC
GAGGAGCTCA AGAAGGAGAC CCTGAAGAAC TTGAAGGAGG GCGCGATCCT CAAGGGCCAG
GTGAAGAACC TGACCGACTA CGGCGCCTTC ATCGACCTCG GCGGCATCGA CGGCCTGCTC
CACGTCACCG ACATGAGCTG GGGCCGCATC GGCCACCCGT CGGAGATGTT CGAGGTCGGC
CAGGAGGTCC GCGTCGTCGT CCTCAAGTTC GACCCGGCCT CCGAGCGCGT CAGCCTCGGC
CTCAAGCAGA TCCAGGAGGA CCCGTGGCAC CGCGCCGACG AGAAGTATCC GGTCGGCACG
CGCGTCAGGG GCAAGGTCGT CTCGCTCACC GACTACGGCG CGTTCATCGA GCTCGAGCAG
GGCGTCGAGG GCCTCGTCCA CGTCAGCGAG ATGAGCTGGA CGAAGCGGGT GAAGCACCCG
TCGAAGCTCG TCAACCAGGG CGATCAGGTC GAGGCGGTCG TGCTCGACAT CGATCCGAAG
GCGAAGCGGA TCAGCCTCGG CATGAAGCAG ATCGAGGCGA ACCCGTGGAC GCTGCTCGAG
GACAAGTATC CCATCGGCAC GACCATCCGC GGCGAGGTCC GGAACGTCAC CGACTTCGGC
GTGTTCGTCG GCGTCGAGGA GGGCATCGAC GGCCTCGTGC ACGTGTCCGA CATCTCCTGG
ACCGAGCGCA TCAAGCACCC GGGCGACAAG TTCAAGAAGG GTGACGTGGT CGAGGCGGTG
GTGCTCAACA TCGACGTCGA GAACGAGCGC TTCAGCCTCG GCATCAAGCA GGCCCACGTC
GATCCCTGGA CGACGCTCTC CGAGCGCCAC CCGGTGGGCG CGCGCGTGAA GGGCAAGGTG
ACGAAGGTCA CCGACTTCGG CGCGTTCGTC GAGATCGAGC CGGGCATCGA GGGCCTCGTC
CACGTCTCCG AGATGAAGGA CGAGCGCGTC GAGAACCCCC GCGACGTCGT GACCGAGGGC
CAGGAGGTCG AGGTCAAGGT CATCGACATG GACCTCCACG AGCGCAAGAT CGCGCTGTCG
ATCAAGCAGC TCAACCGCGA CGGCGGCGAG GAGGACTACC GCGAGTACCT GCGCCGTCAG
GGCGACGGCC GCGCGCGGCT GGGGGACCTC ATGGAGAAGT TCAACCGCAG GAAGTAG
 
Protein sequence
MIPTKVGLYR MENQTPRTPE IETEDFATLF AQSEAQAGPI EEGKVVPGTV IALSKDYAVI 
DIGYKSEGQV PISEFISAPG GEPAVKVGDK VEVLVESREN DTGMVVLSKE KADKMRIWDE
ISAACERDEL VEGVIVGRVK GGLSVDIGVK AFLPGSQVDI RPVRNLDKLI GEKFKFKVIK
FNKKRGNIVL SRRVLLEKER EELKKETLKN LKEGAILKGQ VKNLTDYGAF IDLGGIDGLL
HVTDMSWGRI GHPSEMFEVG QEVRVVVLKF DPASERVSLG LKQIQEDPWH RADEKYPVGT
RVRGKVVSLT DYGAFIELEQ GVEGLVHVSE MSWTKRVKHP SKLVNQGDQV EAVVLDIDPK
AKRISLGMKQ IEANPWTLLE DKYPIGTTIR GEVRNVTDFG VFVGVEEGID GLVHVSDISW
TERIKHPGDK FKKGDVVEAV VLNIDVENER FSLGIKQAHV DPWTTLSERH PVGARVKGKV
TKVTDFGAFV EIEPGIEGLV HVSEMKDERV ENPRDVVTEG QEVEVKVIDM DLHERKIALS
IKQLNRDGGE EDYREYLRRQ GDGRARLGDL MEKFNRRK