Gene P9301_04911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_04911 
Symbol 
ID4912498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp428471 
End bp430171 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content29% 
IMG OID640160069 
Productsecreted protein MPB70 precursor 
Protein accessionYP_001090715 
Protein GI126695829 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.306879 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATTA CATCTATTAG ATCTGTCTTG CATTATTTGA CAAAGAACAT CTTACCTACA 
AAGTTTGAGA CTGCCCAACA ACCAGAGCCT AATACAATTC AATTATGTTT TAGAGGAGTT
GATTCTCAAA CATGGTTAGA AGTTTCATGG AATGGAGACT CCCCAAGAAT ACTAAAGATA
AATAAGCCAG AAAAGATTGG GAGAGAAAGC ACACTTTCTA AACAAATAAG ATACGGATTA
AAGTATATGG CTTTAATTTC GATTGATCAA GATGATTTCG AGAGAGTTAT TAAATTTAGT
TTTGCGAAAA AACCTGGAGA TGAAATTAAT AAGTATTTAA TTTTTGAATT AATGGGAAAA
CATAGTAATA TTTTTTATCT GGATAATAAA CATAAAATAA TTGCCGTTGG TAAACAAATT
AAATCAAGTC AATCTAGTTT TAGAACAATT TCAACAGGAT CAATTTATTC TGGCCCTCCA
GTCAATCTCA AAAAACAACC TAGAGAAGAT GAGTCTTTTC AATCATGGAA AGACTCAATT
TCAATAGTAC CTGAGTCTTT GAAATACTGT TTAATAAATA CCTATCAAGG AGTAAGCCCT
ATCCTCACAA AACAATTAGA GTTTGTTAGC GCAACTGTTA ATTCAGGAAT AATGGGAAAA
AATATTGATT TCATTAGCAA CTCAGACTTA AAGGAGATAT TTAAAAATTG GAAGATTTGG
ATAAACAGGT TTAAAAACAA TAACTTTAAT TTTTCTATAT TCAACAAAGA TTTTTATTGC
GTTTGGTTTT TTGATAAGGA AATTAATTTC GAAAATAAAA AAGATTTATG CACAAGCTTA
GAAAATTATT ATGATTATCA TCTGAAACAA AAAAAACTTG AATTATTGGA AAAGAAAATT
GAAGGGATAA TTTTTAAACA GTCCAATACT GAGAAAAAGA ATTTAAATAT TCAATCTGAT
CTTCTGACAA AATCAGAAAA CTACGAGAAA TATAAAGAAA AAGCTGATAA TATATTTGCC
TCACATGAAA TTAAAAAACA AGACATTATA AAGGGACAAA AACTATATAA AAAATCAAAA
AAACTAAAGA GATCTAGAGA ATTAATAAAA GAAAGATTAA GTATTTACAA AACAAATATA
GAGAGATTAG ACGAATTCAC AACGCTTCTA GAAAATTTAA ATTCTTTAAA TCATGAAAAA
CTTTCTATGA GAATCAAACT TCTAGAAGAA ATTATGGAAG AAATTTGTAA CGAGTTTAAT
ATCAATATCA AGAAGCAAAG AGAAGATCAG AAAAGTACAT ATGAGATAGA GTCTTCACCA
ATTCAAGTTG ACACTCCCAC AGGATTAAAG CTTCAGGTAG GGCGAAATAT GAGGCAAAAT
GATTTAATAA GCTTTAAATT CTCAAAAAAA GGCGATTTAT GGTTTCATGC ACAGGAATCA
CCAGGCAGTC ATGTAGTTTT GAAGTCTTCA TCTCAAGTAG CATCTGAACA AGATCTTCAA
ATAGCTGCAG ATTTAGCTGC TTTATTTAGT AAGGCAAAAA GAAACATTAA AGTTCCAATT
AATTTAGTAA GGATTAAAGA TTTACAAAAA ATCAAAAACG GAGGACCAGG TTGCGTTTCC
TTTAAAAATG GAGAAATTAT TTGGGGAAAT CCTACAAGAG GAGAAGATTA CATTAAAAAA
AATCTTAAAA CAGTAATTTA G
 
Protein sequence
MDITSIRSVL HYLTKNILPT KFETAQQPEP NTIQLCFRGV DSQTWLEVSW NGDSPRILKI 
NKPEKIGRES TLSKQIRYGL KYMALISIDQ DDFERVIKFS FAKKPGDEIN KYLIFELMGK
HSNIFYLDNK HKIIAVGKQI KSSQSSFRTI STGSIYSGPP VNLKKQPRED ESFQSWKDSI
SIVPESLKYC LINTYQGVSP ILTKQLEFVS ATVNSGIMGK NIDFISNSDL KEIFKNWKIW
INRFKNNNFN FSIFNKDFYC VWFFDKEINF ENKKDLCTSL ENYYDYHLKQ KKLELLEKKI
EGIIFKQSNT EKKNLNIQSD LLTKSENYEK YKEKADNIFA SHEIKKQDII KGQKLYKKSK
KLKRSRELIK ERLSIYKTNI ERLDEFTTLL ENLNSLNHEK LSMRIKLLEE IMEEICNEFN
INIKKQREDQ KSTYEIESSP IQVDTPTGLK LQVGRNMRQN DLISFKFSKK GDLWFHAQES
PGSHVVLKSS SQVASEQDLQ IAADLAALFS KAKRNIKVPI NLVRIKDLQK IKNGGPGCVS
FKNGEIIWGN PTRGEDYIKK NLKTVI