Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_05221 |
Symbol | |
ID | 4717220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 454004 |
End bp | 455704 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 640078234 |
Product | secreted protein MPB70 precursor |
Protein accession | YP_001008917 |
Protein GI | 123968059 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATTA CATCTATTAG ATCTGTCTTG CATTATTTGA CACAGAACAT CTTACCTACA AAGTTTGAGA CTGCCCAACA ACCAGAGCAT AGTACAATTC AATTATGTTT CAGAGGAGTT GATTCTCAAA CATGGTTAGA AGTTTCATGG AATGGGGACT CTCCAAGAAT ACTAAAAATA AATAAGCCAG AAAAGATTGG GAGAGAAAGC ACACTTTCTA AACAAATAAG ATACGGATTA AAGTATATGG CTTTAGTTTC GATTGATCAA GATGATTTCG AGAGAGTTAT AAAATTTAGT TTTGCGAAAA AACCTGGAGA TGAAATTAAT AAGTATTTAA TTTTTGAATT AATGGGAAAA CATAGTAATA TTTTTTATTT AGATAATAAA CATAAAATAA TTGCCGTTGG TAAACAAATC AAATCAAGTC AATCTAGTTT TAGAACAATT TCAACAGGAT CAATTTATTC TGGCCCTCCA GTCAATCTTA AAAAACAACC TAGAGAAGAT GAGTCTTTTC AATCATGGAA AGACTCAATT TCAATAGTAC CTGAGTCTTT GAAATACTGT TTAATAAATA CATATCAAGG AGTAAGCCCT ATCCTCACAA AACAATTAGA GGTTATTAGC GCAACTGTTA ATTCAGAAAT AATGGGAAAA AATATTGATT TCATTAGCAA CTCAGACTTA ATGGAGATAT TTAAAAATTG GAAGATTTGG ATAAACAGGT TTAAAAACAA TAACTTTAAT TTTTCTACAT TCAACAAAGA TTTTTATTGC GTTTGGTTTT TTAATAAGGA AATTAATTCC GAAAATAAAA TAGATTTATG CACAAGCTTA GAGAATTATT ATGATTTTCA TCTGAAACAA AAAAAACTTG AATTATTGGA AAAGAAAATT GAAGGGATAA TTTTTAAACA GACCATTACT GAGAAAAAGA ATTTAAATAT TCAATATGAT CTTCTGTCAA AATCAGAAAA CTACGAGACA TATAAAGAAA AAGCTGATAA TATATTTGCT TCACATGAAA TTAAAAAACA AGACATTATA AAGGGGCAAA AACTATATAA AAAATCAAAA AAACTTAAGA GATCTAGAGA ATTAGTAAAA GAAAGATTAA ATATTTACAA AACAAACATA GAGAGATTAG ACGAATTCAC TACGCTTCTA GAAAATTTAA ATTCTTTAAA TCATGAAAAA CTTTCTATGA GAATAAAACT ATTAGAAGAA ATTATGGAAG AAATTTGTAA CGAGTTTAAT ATCAATATTA AGAAGCAAAG AGAAGATCAG AAAAGGACAT ATGAGATAGA GTCTTCACCA ATTCAAATTA ACACTCCCAC AGGATTAAAG CTTCAGGTAG GGCGAAATAT GAGGCAAAAT GATTTAATAA GCTTTAAATT CTCAAAAAAA GGCGATTTAT GGTTTCATGC ACAGGAATCA CCAGGCAGTC ATGTAGTTTT GAAGTCTTCA TCTCAAGTAG CATCTGAACA AGATCTTCAA ATAGCTGCAG ATTTAGCTGC TCTATTTAGT AAGGCAAAAA GAAACATTAA AGTTCCAATT AATTTAGTAA AGATTAAAGA TTTACAAAAA ATCAAAAACG GAGGACCTGG ATGCGTTTCC TTTAAAAATG GAGAAATTAT TTGGGGAAAT CCTACAAGAG GAGAAGATTA CATTAAAAAA AATCTTAAAA CAGTAATTTA G
|
Protein sequence | MDITSIRSVL HYLTQNILPT KFETAQQPEH STIQLCFRGV DSQTWLEVSW NGDSPRILKI NKPEKIGRES TLSKQIRYGL KYMALVSIDQ DDFERVIKFS FAKKPGDEIN KYLIFELMGK HSNIFYLDNK HKIIAVGKQI KSSQSSFRTI STGSIYSGPP VNLKKQPRED ESFQSWKDSI SIVPESLKYC LINTYQGVSP ILTKQLEVIS ATVNSEIMGK NIDFISNSDL MEIFKNWKIW INRFKNNNFN FSTFNKDFYC VWFFNKEINS ENKIDLCTSL ENYYDFHLKQ KKLELLEKKI EGIIFKQTIT EKKNLNIQYD LLSKSENYET YKEKADNIFA SHEIKKQDII KGQKLYKKSK KLKRSRELVK ERLNIYKTNI ERLDEFTTLL ENLNSLNHEK LSMRIKLLEE IMEEICNEFN INIKKQREDQ KRTYEIESSP IQINTPTGLK LQVGRNMRQN DLISFKFSKK GDLWFHAQES PGSHVVLKSS SQVASEQDLQ IAADLAALFS KAKRNIKVPI NLVKIKDLQK IKNGGPGCVS FKNGEIIWGN PTRGEDYIKK NLKTVI
|
| |