Gene P9515_05301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_05301 
Symbol 
ID4720043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp473274 
End bp474974 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content29% 
IMG OID640080205 
Productsecreted protein MPB70 precursor 
Protein accessionYP_001010846 
Protein GI123965765 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATTA CATCTATAAA ATCTGTGTTG CATGAATTAT CAGAAGAAAT TCTACCCTCA 
AGGTTTGAAA CCGCTCAACA ACCCGAACCT AATATAATTC AATTCTGTTT AAGAGGAATA
AATAGCCAAA CTTGGATAGA AGTATCTTGG AGCAGTGATT CTGCACGAAT ACTTAAAATA
AATAGGCCTG AAAAATTAGG AGCAGAAAGT ACACTCTCAA AACAATTGAG ATTTGGCTTG
AAATATATGG CTTTAGTAAC GATAAAGCAA GATAAATTTG AGAGGGTTAT TAAATTTGGA
TTTGCGAAAA AGCCTGGAGA TGAGATAAAC AAGTACTTAA TTTTTGAACT AATGGGTAAA
CATAGTAATA TTTTTTACTT AGATAATAAT CAAAAAATAA TTGCAGTTGG CAAACAAATA
AACTCTAATC AATCTAGCTT TCGTACCGTT TCTACAGGCT CTATCTATTC AGAGCCTCCC
AAAAACATGA AAAAGGAGCC CAGTCCAGAC GAATCTTATG AAAATTGGAA AGAGTCTATA
TCTTCAGTTC CAGAAACTCT TAAATACTGC TTAATAAATA CATATCAAGG TGTAAGTCCA
ATATTAACGA AGCAACTTGA GGCTTTTAGT AATCTAGGTA GTTCAGAAAT AATGAATAAA
AATATTGATT TCATAAGTGA AGTCAACTTA AAAAAGATAT ACCAAAGTTG GAAAATATGG
ATTGATAGGT TTAATAATAA CAATTTTAAT TTTTCAATTT TTGACAGTTT TTTTTATTCA
GTTTGGTTTT TAAAGAGTGA AATAAATAAC ATCAATAATA TTGATGAAAT TAATGGATTA
GAAAACTATT ATGATTTTTA TCTAAAACAA AGAAAAATTG ACGCCTTAAT AAAAAAAATT
GACGGAATAA TTTTTAAACA AACAAATCTT GAGAAAAAGA ATTTTAATTT GCAATCTGAT
CTTTTGATTA ACTCCGAAAA TTATCAAATA TATAAAGAAA AAGCTGACAA AATATTTATG
ACTCATGAGA TACAAAAAAA AGATATTATC AAAGGCCAAA AGCTTTATAA AAAGTCAAAA
AAGCTAAAAA GAGCACAAAA TTTAATCAAA GAGAGGATTG ATATTTATAA AAACAAGCTT
GATAGATTAG AGGAATTTAG TGCTCTAGTG GATAACTTAA ATTCTTTAAA AAATGAAAAT
CTGACAATTA GAATCAATTT ACTGGAAGAA ATTAAAGATG AAATTTGTAG AGAGTTTAAT
GTGAGAACCA GAAATACAAG ATCGCAAAAA AAAGACTTGT CTAGATTAGA GTCAGCCCCA
ATAGAAATAA ATACTCCTAA AGGCTTAAAA ATACAAATAG GAAGAAATAT GAGGCAAAAT
GATCTAATAA GTTTTAAGTT TTCAAAAAAA GGGGATCTTT GGTTCCATGC ACAAGAATCA
CCTGGAAGTC ATGTTGTTTT AAAGTCCTCA TCTCAAAAAC CTTCAGATGA AGATATTCAA
ATATCCGCAG ATTTAGCAGC TTTATTTAGT AAAGCAAAAA TGAATATTAA AGTCCCTATA
AGTCTTGTAA ATATAAAAGA CCTCCAAAAA ATAACTAAAG GCGGTCCTGG CTGTGTTTCG
TTCCACAATG TAGAAATTAT TTGGGGTAAT CCTACAAGAG GAAAAGATTA CATTAAAAAA
AATCTTAAAA GACTAATTTA G
 
Protein sequence
MDITSIKSVL HELSEEILPS RFETAQQPEP NIIQFCLRGI NSQTWIEVSW SSDSARILKI 
NRPEKLGAES TLSKQLRFGL KYMALVTIKQ DKFERVIKFG FAKKPGDEIN KYLIFELMGK
HSNIFYLDNN QKIIAVGKQI NSNQSSFRTV STGSIYSEPP KNMKKEPSPD ESYENWKESI
SSVPETLKYC LINTYQGVSP ILTKQLEAFS NLGSSEIMNK NIDFISEVNL KKIYQSWKIW
IDRFNNNNFN FSIFDSFFYS VWFLKSEINN INNIDEINGL ENYYDFYLKQ RKIDALIKKI
DGIIFKQTNL EKKNFNLQSD LLINSENYQI YKEKADKIFM THEIQKKDII KGQKLYKKSK
KLKRAQNLIK ERIDIYKNKL DRLEEFSALV DNLNSLKNEN LTIRINLLEE IKDEICREFN
VRTRNTRSQK KDLSRLESAP IEINTPKGLK IQIGRNMRQN DLISFKFSKK GDLWFHAQES
PGSHVVLKSS SQKPSDEDIQ ISADLAALFS KAKMNIKVPI SLVNIKDLQK ITKGGPGCVS
FHNVEIIWGN PTRGKDYIKK NLKRLI