Gene PMN2A_1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1689 
Symbol 
ID3607091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp356372 
End bp357481 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content38% 
IMG OID637688572 
Product30S ribosomal protein S1 
Protein accessionYP_292880 
Protein GI72383525 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.781588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAA ATCCAGCAAG CAAAATTGAA GAAAAAAATC CTGAAAAAGA GACATCTATA 
CCTGAAGAAA CTGTTTCAAA TGCCACAATT GCAGAGTTTG AAGAAAATTC AATTACTGAA
TTAAAAGAAG ACGATATTCC AAAAAACATT CCTGCTGCTG ATGACTCTTC AAGCAGAATT
AATAAGAGTG ATCTTGAAAC TGCAGGTTTC ACACTTGATG AATTTGCATC TTTACTAAGT
AAATACGACT ACAATTTTAA ACCTGGTGAC ATCGTCAATG GAACAGTTTT TGCTCTTGAA
TCGAAAGGAG CAATGATTGA CATCGGAGCG AAAACAGCTG CTTTTATGCC TATGCAAGAA
GTCTCAATAA ATAGAGTCGA GGGTCTGAGT GATGTTTTAC AGCCCTCAGA AATTAGAGAA
TTTTTTATAA TGACTGAGGA AAATGAGGAT GGCCAATTAT CCTTATCTAT CAGGAGAATT
GAATATCAAC GAGCTTGGGA AAGAGTTAGA CAATTACAAA AAGAAGATGC AACAATTTAT
TCCGAGGTTT TTGCTACAAA TAGAGGCGGT GCACTTGTTC GAGTTGAAGG GCTCAGAGGA
TTTATTCCTG GATCACACAT AAGCACTAGA AAGGCGAAAG AAGAACTAGT TGCTGATTTC
TTGCCATTGA AATTCTTAGA AGTTGATGAA GAAAGGAATA GGCTTGTTTT AAGTCATCGC
AGGGCTTTAG TCGAGAGAAA AATGAATCGC CTTGAAGTTG GAGAAGTTGT TGTAGGAGCA
GTCAGAGGAA TTAAACCTTA TGGTGCATTT ATAGACATTG GTGGCGTAAG TGGACTTCTT
CACATCTCTG AAATAAGCCA TGAGCATATT GAAACTCCTC ACTCCGTATT AAATGTCAAT
GATCAAATGA AGGTCATGAT TATTGATCTA GACGCTGAAA GAGGAAGAAT TTCTCTATCG
ACGAAGGCGC TTGAACCAGA ACCTGGAGAC ATGCTGACTG ACCCTCAAAA AGTTTTTGAT
AAGGCTGAAG AGATGGCAGC GAGATACAAA CAAATGCTTC TTGAGCAAGC AGAAGAAGGA
GAAGATCCTA TTGCAGTAAT GACTATTTGA
 
Protein sequence
MSENPASKIE EKNPEKETSI PEETVSNATI AEFEENSITE LKEDDIPKNI PAADDSSSRI 
NKSDLETAGF TLDEFASLLS KYDYNFKPGD IVNGTVFALE SKGAMIDIGA KTAAFMPMQE
VSINRVEGLS DVLQPSEIRE FFIMTEENED GQLSLSIRRI EYQRAWERVR QLQKEDATIY
SEVFATNRGG ALVRVEGLRG FIPGSHISTR KAKEELVADF LPLKFLEVDE ERNRLVLSHR
RALVERKMNR LEVGEVVVGA VRGIKPYGAF IDIGGVSGLL HISEISHEHI ETPHSVLNVN
DQMKVMIIDL DAERGRISLS TKALEPEPGD MLTDPQKVFD KAEEMAARYK QMLLEQAEEG
EDPIAVMTI