Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_04661 |
Symbol | |
ID | 5731342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 438125 |
End bp | 439864 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641284823 |
Product | secreted protein MPB70 precursor |
Protein accession | YP_001550351 |
Protein GI | 159903007 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.252581 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAATT CACTGCTACA AATAATGGAC CTGACCAGTC TCAAGGCTGT AATGGTTGAT ATGCGTACAA AAATAATTCC AAGTCGTTTC GAGAAAGCTC AACAAGTAGA AGCAAATACA TTGCAATTAT CTTTCCGCAC AGTAACAAGC GTTGTCTGGG TTGAACTCAG CTGGGATGCA TCCTCGGCAA GGTTAGTAGA AATAAACTCT CCTCCCAGAT CCTTTGGAGA AAGCACACTA GCAAAGCAAA TTAATCATGG ACTTCGACAA ATGGCATTGA TCGAACTAAA GCAAGAAGGG TTTGAACGAA TAATAGAATT TGGCTTTGCA TTAAGACCAA ATCAAAAATT ACAAAGATTT TTAATCCTCG AATTAATGGG AAGGCACAGT AATTTCTTTA TGCTTAATAA AGAACGCAAG ACAATAACTC TTGGTAGGCA AATCCGTAAT CACCAATCTC GATTAAGACC CATTAGCACG GGTGATCCAT ATATCCAGCC ACCCCAGCCA AGAGGAGTAA AACCAGATAA AAATGAATCA TTTCTCCACT GGAAGGAAAG ACTTTGCCTA GTACCTATCA AACTAAAGGA TGCTTTTTTG CAAAATTTTC AAGGGGTTAG CCCTTCTTTA CTACTTCAAT TGGCTAGTGA AGAAAAAGAA TTAGCCAAAG AGATTATAAA TCTTTCAGTT AAGGAAATTT CCGAGGAAAC ATGGCACCTT CTCTACAAAA GATGGCTGAT TTGGTTAGAT CATATAGAAA AAGAAACTTA CTCTATATTC TTTGAAGGGC CAACGGCTTA TAAAGTATGG AACTACAATA ATGATGCCTC ATCAGAATTA AAAGAAGTCA GTCTTGCTCT TGGCTATTAC TATCGTACGA AATTAGAATT AAAAAGATTT AAGGAGTTGT CCAGTGGGCT TAATAAGAAG CTTCTAAAAA TAAAGGAATC TGAAGAGACT CAGATGAAAA AACAAAAATA TCTACTTGAA AATATTCCTA AAAATGATCT TATAAAAAGA CAAGCAGATA AAATCCTTTG TTCTCAAAAA CCAAGCAAAG ATCAAGTCAA GGAAGCCCAG GACCTTTATC AGAAGGCGAA GAAAATGAGG CGTTCTGAGT CTGTCCTAAT CGAGAGAATC AATCACCACA CAAAAAAACT TAATCTTATT AATGAAAGCG AACTTTTTCT TAACGAAATC ACTTCAAGCC AATGTGAAAA TAATTCAGAA AAAATAAAAG CAATAAGCGA ATTAAAAGAT GAGCTTGAAA AGCACTTATT CAGATCTCAA ATTAAAGCTT CTTCTACAAA TTCAATAGTT CAACCATCTC TAATACTTGA ATTAATTAGC CCTAGTGGCT TGTCAATACA AATTGGAAGA AATCACCGGC AGAATGAGCT TATTAGCCTC AAAGAATCTC GAAAGGGGGA TATATGGTTT CATGCACAAG AATGTCCAGG CAGCCATGTA GTCATGAAAG CATCAAATGG AATCTATGAA GAAAATGATT TGCAAATGGG TGCTGATCTC GCTGCTTTTT TTAGTCGAGC AAAGCTAAAT AAAAAAGTTC CTATAATTAT GGCTCAAACA AACCAGCTTA AAAAACTAAA AGGCGCAATC CCTGGAACGG TAAAGCATAA AGGCGGAAAA ATTCTTTGGG GGAATCCATT AAACGGCGAA GATCATTTCA AAAGAGCTAC AGCAAACGCC CAAAATGCTC TATCATCAGC ACCCAGTTAG
|
Protein sequence | MDNSLLQIMD LTSLKAVMVD MRTKIIPSRF EKAQQVEANT LQLSFRTVTS VVWVELSWDA SSARLVEINS PPRSFGESTL AKQINHGLRQ MALIELKQEG FERIIEFGFA LRPNQKLQRF LILELMGRHS NFFMLNKERK TITLGRQIRN HQSRLRPIST GDPYIQPPQP RGVKPDKNES FLHWKERLCL VPIKLKDAFL QNFQGVSPSL LLQLASEEKE LAKEIINLSV KEISEETWHL LYKRWLIWLD HIEKETYSIF FEGPTAYKVW NYNNDASSEL KEVSLALGYY YRTKLELKRF KELSSGLNKK LLKIKESEET QMKKQKYLLE NIPKNDLIKR QADKILCSQK PSKDQVKEAQ DLYQKAKKMR RSESVLIERI NHHTKKLNLI NESELFLNEI TSSQCENNSE KIKAISELKD ELEKHLFRSQ IKASSTNSIV QPSLILELIS PSGLSIQIGR NHRQNELISL KESRKGDIWF HAQECPGSHV VMKASNGIYE ENDLQMGADL AAFFSRAKLN KKVPIIMAQT NQLKKLKGAI PGTVKHKGGK ILWGNPLNGE DHFKRATANA QNALSSAPS
|
| |