Gene Nmul_A2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2069 
SymbolrpsA 
ID3784387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2359041 
End bp2360756 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content54% 
IMG OID637812158 
Product30S ribosomal protein S1 
Protein accessionYP_412755 
Protein GI82703189 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000607618 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACTG TTTCCCCCGC TGTCGACTCT TCTGAAAGTT TCGCAGCGTT ATTTGAAGAA 
AGCCTTTCCC GTCAGGAAAT GCGTATCGGT GAAGTCATCA CCGCGCAGGT CGTCCGGGTC
GATTACAACA TCGTAGTCGT AAACGCCGGG CTCAAATCAG AGAGTTTCAT CCCCGTCGAC
GAGTTCAAGA ACGACAAGGG CGAAGTTGAA GCCAAGCCCG GCGATTTTGT CAGCGTCGCC
ATCGAGGCGC TGGAAGATGG CTATGGCGAA ACCCGCCTGT CACGGGACAA AGCAAAGCGC
CTTACCGCCT GGCATGACCT GGAGGCTGCA ATGGAAAGCG GCGCCATCGT ATCGGGTGTC
GTGAGCGGCA AGGTCAAAGG TGGATTGACT GCCATGATCA ATGGTATTCG CGCCTTTCTG
CCCGGCTCGC TGGTGGATAT CAGGCCGGTC AAGGATACGA CTCCTTACGA AAACAAGGAG
ATGGAATTCA AGGTTATCAA ACTTGACCGG AAGCGAAACA ACGTGGTGGT ATCTCGCCGT
GCAGTGCTGG AAGAAACCCA GGGGGCTGAC CGCCAGACGT TGCTCGCCAA TCTGACCGAA
GGCGCGATCG TCAAGGGTAT TGTCAAGAAT ATTACCGATT ACGGCGCATT CGTGGATCTG
GGGGGCATAG ACGGCTTGCT GCACATTACC GATCTTGCGT GGCGGCGGGT AAAGCACCCC
TCCGAGGTCA TCAGTGTCGG TGATGAAGTA ACCGCGAAAG TCCTCAAATT CGATCAGGAA
AAAAACCGCG TTTCACTGGG TATGAAGCAA TTGACGGAAG ATCCATGGGT AGGATTGTCG
CGGCGGTATC CGCCCCATAC CCGCTTGTTC GGCAAGGTCA GCAACCTTAC CGATTACGGC
GCGTTTGTTG AAATCGAGCA AGGCATTGAA GGCCTCGTGC ATGTCTCCGA AATGGATTGG
ACCAACAAGA ACGTGTACCC GTCCAAAGTT GTGCAATTGG GCGATGAAGT AGAAGTGATG
ATTCTTGAGA TCGATGAAGA GCGGCGTCGC ATTTCGCTCG GCATGAAGCA GTGCAAAGTG
AATCCCTGGG AAGATTTCGC CATGAATCAT CAAAAAGGCG ACAAGGTGCG AGGTCAGATC
AAATCCATTA CGGATTTCGG CGTCTTTATC GGGCTGCAAG GCGGGATAGA CGGACTGGTG
CATCTTTCCG ATCTTTCCTG GAATCAGCCG GGGGAAGAAG CCGTGCGCAA TTACAAGAAG
GGTGACGAGG TCGAGGCGGT CGTGCTGTCC ATCGATGTGG AGCGCGAGCG CATCTCGCTT
GGCATCAAGC AATTGGAAGG CGATCCCTTC AACAGCTTTG TCTCGGTGCA TGACAAGAAC
AGTATTGTCA AAGGGACGGT AAAGTCGATT GATGCGAAGG GCGCCGTGAT TTCACTCGAG
AATGATGTCG AAGGCTACCT GCGCGCGTCA GAAGTGTCGC GCGACCGGGT TGAGGACATT
CGTTCCCACT TGAAGGAAGG TGATGTGGTT GAGGCGATGA TCATTAACGT CGATCGCAAA
AATCGCGGCA TTAACCTTTC GATCAAGGCG AAGGATATGG CGGAGGAATC CGACGCGATG
CAGAAGGTGG CAGGCGACGC ATCCGCCAGC GCGGGAACCA CCAGTCTGGG TGCTTTGCTC
AAGGCCAAGA TGGATGTTAA GAATACGGAA CAATAA
 
Protein sequence
MATVSPAVDS SESFAALFEE SLSRQEMRIG EVITAQVVRV DYNIVVVNAG LKSESFIPVD 
EFKNDKGEVE AKPGDFVSVA IEALEDGYGE TRLSRDKAKR LTAWHDLEAA MESGAIVSGV
VSGKVKGGLT AMINGIRAFL PGSLVDIRPV KDTTPYENKE MEFKVIKLDR KRNNVVVSRR
AVLEETQGAD RQTLLANLTE GAIVKGIVKN ITDYGAFVDL GGIDGLLHIT DLAWRRVKHP
SEVISVGDEV TAKVLKFDQE KNRVSLGMKQ LTEDPWVGLS RRYPPHTRLF GKVSNLTDYG
AFVEIEQGIE GLVHVSEMDW TNKNVYPSKV VQLGDEVEVM ILEIDEERRR ISLGMKQCKV
NPWEDFAMNH QKGDKVRGQI KSITDFGVFI GLQGGIDGLV HLSDLSWNQP GEEAVRNYKK
GDEVEAVVLS IDVERERISL GIKQLEGDPF NSFVSVHDKN SIVKGTVKSI DAKGAVISLE
NDVEGYLRAS EVSRDRVEDI RSHLKEGDVV EAMIINVDRK NRGINLSIKA KDMAEESDAM
QKVAGDASAS AGTTSLGALL KAKMDVKNTE Q