Gene NATL1_04021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04021 
Symbolrps1a 
ID4781090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp371332 
End bp372441 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content38% 
IMG OID640083671 
Product30S ribosomal protein S1 
Protein accessionYP_001014231 
Protein GI124025115 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.337887 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA ATCCAGCAAG CAAAATTGAA GAAAAAAATC CTGAAAAAGA GACATCTATA 
CCTGAAGAAA CTGTTTCAAA TTCCACAAGT GCAGAGTTTG AAGAAAATTC AATTAGTGAA
TTAAAAGAAG ACGATATTCC CAAAAACATT CCTGCTGCTG ATGACTCTTC AAGCAGAATT
AATAAGAGTG ATCTTGAAAG TGCAGGTTTC ACACTTGATG AATTCGCATC TTTACTAAGT
AAATACGACT ACAATTTTAA ACCTGGTGAC ATAGTCAATG GAACAGTTTT TGCTCTTGAA
TCGAAAGGAG CAATGATTGA CATTGGAGCG AAAACAGCTG CTTTTATGCC TATGCAAGAA
GTCTCAATAA ATAGGGTCGA GGGTCTGAGT GATGTTTTAC AGCCCTCAGA AATTAGAGAA
TTTTTTATAA TGACTGAGGA AAATGAGGAT GGTCAATTAT CCTTATCTAT CAGGAGAATT
GAGTATCAAC GAGCTTGGGA AAGAGTTAGA CAATTACAAA AAGAAGATGC AACAATTTAT
TCCGAGGTTT TTGCTACAAA TAGAGGCGGT GCACTTGTTC GAGTTGAAGG GCTCAGAGGC
TTTATTCCTG GATCACATAT AAGCACTAGA AAGGCGAAAG AAGAACTTGT TGCTGATTTC
TTGCCATTGA AATTCTTAGA AGTTGATGAA GAAAGGAATA GGCTTGTTTT AAGTCATCGC
AGGGCTTTAG TCGAAAGAAA AATGAATCGC CTTGAAGTTG GAGAAGTTGT TGTAGGAGCA
GTCAGAGGAA TTAAACCTTA TGGAGCATTT ATAGATATTG GTGGCGTAAG TGGACTTCTT
CACATCTCTG AAATAAGCCA TGAGCATATT GAAACTCCTC ACTCCGTATT AAATGTCAAT
GATCAAATGA AGGTCATGAT TATTGATCTA GACGCTGAAA GAGGAAGAAT TTCTCTATCG
ACGAAAGCGC TTGAACCAGA ACCTGGAGAC ATGCTGACTG ATCCTCAAAA AGTTTTTGAC
AAGGCTGAAG AGATGGCAGC AAGATACAAA CAAATGCTTC TTGAGCAAGC AGAAGAAGGA
GAAGATCCTA TTGCAGTAAT GACTATTTGA
 
Protein sequence
MSENPASKIE EKNPEKETSI PEETVSNSTS AEFEENSISE LKEDDIPKNI PAADDSSSRI 
NKSDLESAGF TLDEFASLLS KYDYNFKPGD IVNGTVFALE SKGAMIDIGA KTAAFMPMQE
VSINRVEGLS DVLQPSEIRE FFIMTEENED GQLSLSIRRI EYQRAWERVR QLQKEDATIY
SEVFATNRGG ALVRVEGLRG FIPGSHISTR KAKEELVADF LPLKFLEVDE ERNRLVLSHR
RALVERKMNR LEVGEVVVGA VRGIKPYGAF IDIGGVSGLL HISEISHEHI ETPHSVLNVN
DQMKVMIIDL DAERGRISLS TKALEPEPGD MLTDPQKVFD KAEEMAARYK QMLLEQAEEG
EDPIAVMTI