Gene A9601_05221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_05221 
Symbol 
ID4717220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp454004 
End bp455704 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content29% 
IMG OID640078234 
Productsecreted protein MPB70 precursor 
Protein accessionYP_001008917 
Protein GI123968059 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATTA CATCTATTAG ATCTGTCTTG CATTATTTGA CACAGAACAT CTTACCTACA 
AAGTTTGAGA CTGCCCAACA ACCAGAGCAT AGTACAATTC AATTATGTTT CAGAGGAGTT
GATTCTCAAA CATGGTTAGA AGTTTCATGG AATGGGGACT CTCCAAGAAT ACTAAAAATA
AATAAGCCAG AAAAGATTGG GAGAGAAAGC ACACTTTCTA AACAAATAAG ATACGGATTA
AAGTATATGG CTTTAGTTTC GATTGATCAA GATGATTTCG AGAGAGTTAT AAAATTTAGT
TTTGCGAAAA AACCTGGAGA TGAAATTAAT AAGTATTTAA TTTTTGAATT AATGGGAAAA
CATAGTAATA TTTTTTATTT AGATAATAAA CATAAAATAA TTGCCGTTGG TAAACAAATC
AAATCAAGTC AATCTAGTTT TAGAACAATT TCAACAGGAT CAATTTATTC TGGCCCTCCA
GTCAATCTTA AAAAACAACC TAGAGAAGAT GAGTCTTTTC AATCATGGAA AGACTCAATT
TCAATAGTAC CTGAGTCTTT GAAATACTGT TTAATAAATA CATATCAAGG AGTAAGCCCT
ATCCTCACAA AACAATTAGA GGTTATTAGC GCAACTGTTA ATTCAGAAAT AATGGGAAAA
AATATTGATT TCATTAGCAA CTCAGACTTA ATGGAGATAT TTAAAAATTG GAAGATTTGG
ATAAACAGGT TTAAAAACAA TAACTTTAAT TTTTCTACAT TCAACAAAGA TTTTTATTGC
GTTTGGTTTT TTAATAAGGA AATTAATTCC GAAAATAAAA TAGATTTATG CACAAGCTTA
GAGAATTATT ATGATTTTCA TCTGAAACAA AAAAAACTTG AATTATTGGA AAAGAAAATT
GAAGGGATAA TTTTTAAACA GACCATTACT GAGAAAAAGA ATTTAAATAT TCAATATGAT
CTTCTGTCAA AATCAGAAAA CTACGAGACA TATAAAGAAA AAGCTGATAA TATATTTGCT
TCACATGAAA TTAAAAAACA AGACATTATA AAGGGGCAAA AACTATATAA AAAATCAAAA
AAACTTAAGA GATCTAGAGA ATTAGTAAAA GAAAGATTAA ATATTTACAA AACAAACATA
GAGAGATTAG ACGAATTCAC TACGCTTCTA GAAAATTTAA ATTCTTTAAA TCATGAAAAA
CTTTCTATGA GAATAAAACT ATTAGAAGAA ATTATGGAAG AAATTTGTAA CGAGTTTAAT
ATCAATATTA AGAAGCAAAG AGAAGATCAG AAAAGGACAT ATGAGATAGA GTCTTCACCA
ATTCAAATTA ACACTCCCAC AGGATTAAAG CTTCAGGTAG GGCGAAATAT GAGGCAAAAT
GATTTAATAA GCTTTAAATT CTCAAAAAAA GGCGATTTAT GGTTTCATGC ACAGGAATCA
CCAGGCAGTC ATGTAGTTTT GAAGTCTTCA TCTCAAGTAG CATCTGAACA AGATCTTCAA
ATAGCTGCAG ATTTAGCTGC TCTATTTAGT AAGGCAAAAA GAAACATTAA AGTTCCAATT
AATTTAGTAA AGATTAAAGA TTTACAAAAA ATCAAAAACG GAGGACCTGG ATGCGTTTCC
TTTAAAAATG GAGAAATTAT TTGGGGAAAT CCTACAAGAG GAGAAGATTA CATTAAAAAA
AATCTTAAAA CAGTAATTTA G
 
Protein sequence
MDITSIRSVL HYLTQNILPT KFETAQQPEH STIQLCFRGV DSQTWLEVSW NGDSPRILKI 
NKPEKIGRES TLSKQIRYGL KYMALVSIDQ DDFERVIKFS FAKKPGDEIN KYLIFELMGK
HSNIFYLDNK HKIIAVGKQI KSSQSSFRTI STGSIYSGPP VNLKKQPRED ESFQSWKDSI
SIVPESLKYC LINTYQGVSP ILTKQLEVIS ATVNSEIMGK NIDFISNSDL MEIFKNWKIW
INRFKNNNFN FSTFNKDFYC VWFFNKEINS ENKIDLCTSL ENYYDFHLKQ KKLELLEKKI
EGIIFKQTIT EKKNLNIQYD LLSKSENYET YKEKADNIFA SHEIKKQDII KGQKLYKKSK
KLKRSRELVK ERLNIYKTNI ERLDEFTTLL ENLNSLNHEK LSMRIKLLEE IMEEICNEFN
INIKKQREDQ KRTYEIESSP IQINTPTGLK LQVGRNMRQN DLISFKFSKK GDLWFHAQES
PGSHVVLKSS SQVASEQDLQ IAADLAALFS KAKRNIKVPI NLVKIKDLQK IKNGGPGCVS
FKNGEIIWGN PTRGEDYIKK NLKTVI