Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_06671 |
Symbol | |
ID | 4776525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 627414 |
End bp | 629201 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640086174 |
Product | secreted protein MPB70 precursor |
Protein accession | YP_001016683 |
Protein GI | 124022376 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCATTGCA TTGAATCGGA TCGCAGAGTA GGAAGGCTGC GTGCAATCCA GGTCCCGACT GAATCGATGG CCCCTGCTGC CCTCCAGGTC ATGGACCTCT CAAGCTTGCG CGCTGTACTT AGCGAGCTCC GCAAGGAGGT GCTGCCAAGC CGCTTCGAAA AGGTCCAACA ACCAGAACCT CATACCCTCC AGCTCGGGCT GCGCACACTC AAAGGTCTTG TGTGGCTGGA ACTCAGCTGG CGTGCTGATT GTCCCCGATT GGTCAAGATC ACCCCTCCCC CCCGCCTAGG CAGCGGCAGC ACTTTGGCCC AGCAGATTCA ACATGGCCTT CGACAGATGG CGCTCATTGA ACTGAAACAG AAAGGATTTG ATCGCGTGGT GGAGTGCGGC CTCGCCCACC GCCCTGGCGA ACCTATCGAG CGCACTCTGG TGCTGGAACT GATGGGACGC CACAGCAACC TGCTTCTACT TGATCGTCAA CGCCAGGTGA TCACCCTTGG ACGCCAAGTA CGCAACCATC AATCAAGAGT GAGACCAATC GGAACTGCAG ACATCTATGT CGCTCCACCG CCGATGCAAG GCAGAGAACC AAGCTCAAAG GAATCCCTAA AACGCTGGAA AGACATTCTC TGCCTTGTGC CGACCAAGCT GCGTAGCGCT CTGCAGCAGT GTTACCAAGG CATCAGTCCT GCACTCGCTT TGCAGCTGGT TGACGACGAT GCAAAAAAAG CACATGCCCT GCTGGAGCTT TCAGTACTTG AGATCACCGA CGAGCAATGG CAACACCTTT ACCACCGCTG GAGTCGATGG CTGGACTGCC TTGAGAAAGA GCGCTTCACG ATGAGGTTTG ACGGTCCAAG CAGCTACCGC GTCTGGGACT GTGACAACTC AACCTCCTCC TCCTCATTCA CCGATGGGCT CAGCCTCACT CTCGGGAGTT ATTACCGAAG GCATCTTGAA ACCCAGTCAC TCAATCAACT AGCCGAAGAC CTCCAGAAAA GACTTTGTCA GTGGCGACAA CGAGAGGAGC AGGCCCTTGG CGAACAACAG GGTCGACTCA ACGAAACAAG TCAAAGCAAT TCTCTTCAGC AACAAGCCGA CGCCATGCTT TGCCTGCCTT CGCCGAGCAA GGATCTGATC AACCAAACCC AAAAGCTCTA CCGCAAAGCT CAAAAATTCC GCCGCTCAGT GCCCGTGCTG AAAACACGGA TTGAACATCA CCAGCAAAGG CTGCAGCTGA TCCAAGGCAG TGAAATGTTT CTGGAAGATC TGCTTGGAAC CAGCTGGGAA GGACGGCGAG AACGGTCGAT AAGGCTGCAG GAGTTGCGGC AGGAGCTGGA CGAGTTGCTG ATCTCTCAAT CCCGCAATCG CCAGAAGCGG GGCCGTCGCA ATCAACAACC CCCAAGCCCC CTCGAACTGA CCACTCCTGG TGGACTCGTC GTTCAGATTG GTCGCAACCA CCGCCAAAAC GATTGGATCA GCCTTCGCCA AGCCCGCCCT GGTGATCTTT GGTTTCATGC CCAGGAATGT CCAGGCAGCC ATGTCGTTCT CAAGGCCTCC AATGGCCATG CTGAGGAGGC CGACCTGCAA CTAGCTGCAG ACCTGGCAGC TCACTTCAGT CGTGCTAGAG GCAATCAACG CGTACCCGTG GTGATGGTGC CCACAAGCAA CCTACAGCGA ATCCCAGGAG CAGGCCCAGG GACCGTGCGC TACCGGGACG GAAACCTTTG CTGGGCTGAA CCAGATCGAG GGCTTCAACA CCTCTCTGCC TCAGAACTCT TAGTCTGA
|
Protein sequence | MHCIESDRRV GRLRAIQVPT ESMAPAALQV MDLSSLRAVL SELRKEVLPS RFEKVQQPEP HTLQLGLRTL KGLVWLELSW RADCPRLVKI TPPPRLGSGS TLAQQIQHGL RQMALIELKQ KGFDRVVECG LAHRPGEPIE RTLVLELMGR HSNLLLLDRQ RQVITLGRQV RNHQSRVRPI GTADIYVAPP PMQGREPSSK ESLKRWKDIL CLVPTKLRSA LQQCYQGISP ALALQLVDDD AKKAHALLEL SVLEITDEQW QHLYHRWSRW LDCLEKERFT MRFDGPSSYR VWDCDNSTSS SSFTDGLSLT LGSYYRRHLE TQSLNQLAED LQKRLCQWRQ REEQALGEQQ GRLNETSQSN SLQQQADAML CLPSPSKDLI NQTQKLYRKA QKFRRSVPVL KTRIEHHQQR LQLIQGSEMF LEDLLGTSWE GRRERSIRLQ ELRQELDELL ISQSRNRQKR GRRNQQPPSP LELTTPGGLV VQIGRNHRQN DWISLRQARP GDLWFHAQEC PGSHVVLKAS NGHAEEADLQ LAADLAAHFS RARGNQRVPV VMVPTSNLQR IPGAGPGTVR YRDGNLCWAE PDRGLQHLSA SELLV
|
| |