Gene P9211_04661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_04661 
Symbol 
ID5731342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp438125 
End bp439864 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content36% 
IMG OID641284823 
Productsecreted protein MPB70 precursor 
Protein accessionYP_001550351 
Protein GI159903007 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.252581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAATT CACTGCTACA AATAATGGAC CTGACCAGTC TCAAGGCTGT AATGGTTGAT 
ATGCGTACAA AAATAATTCC AAGTCGTTTC GAGAAAGCTC AACAAGTAGA AGCAAATACA
TTGCAATTAT CTTTCCGCAC AGTAACAAGC GTTGTCTGGG TTGAACTCAG CTGGGATGCA
TCCTCGGCAA GGTTAGTAGA AATAAACTCT CCTCCCAGAT CCTTTGGAGA AAGCACACTA
GCAAAGCAAA TTAATCATGG ACTTCGACAA ATGGCATTGA TCGAACTAAA GCAAGAAGGG
TTTGAACGAA TAATAGAATT TGGCTTTGCA TTAAGACCAA ATCAAAAATT ACAAAGATTT
TTAATCCTCG AATTAATGGG AAGGCACAGT AATTTCTTTA TGCTTAATAA AGAACGCAAG
ACAATAACTC TTGGTAGGCA AATCCGTAAT CACCAATCTC GATTAAGACC CATTAGCACG
GGTGATCCAT ATATCCAGCC ACCCCAGCCA AGAGGAGTAA AACCAGATAA AAATGAATCA
TTTCTCCACT GGAAGGAAAG ACTTTGCCTA GTACCTATCA AACTAAAGGA TGCTTTTTTG
CAAAATTTTC AAGGGGTTAG CCCTTCTTTA CTACTTCAAT TGGCTAGTGA AGAAAAAGAA
TTAGCCAAAG AGATTATAAA TCTTTCAGTT AAGGAAATTT CCGAGGAAAC ATGGCACCTT
CTCTACAAAA GATGGCTGAT TTGGTTAGAT CATATAGAAA AAGAAACTTA CTCTATATTC
TTTGAAGGGC CAACGGCTTA TAAAGTATGG AACTACAATA ATGATGCCTC ATCAGAATTA
AAAGAAGTCA GTCTTGCTCT TGGCTATTAC TATCGTACGA AATTAGAATT AAAAAGATTT
AAGGAGTTGT CCAGTGGGCT TAATAAGAAG CTTCTAAAAA TAAAGGAATC TGAAGAGACT
CAGATGAAAA AACAAAAATA TCTACTTGAA AATATTCCTA AAAATGATCT TATAAAAAGA
CAAGCAGATA AAATCCTTTG TTCTCAAAAA CCAAGCAAAG ATCAAGTCAA GGAAGCCCAG
GACCTTTATC AGAAGGCGAA GAAAATGAGG CGTTCTGAGT CTGTCCTAAT CGAGAGAATC
AATCACCACA CAAAAAAACT TAATCTTATT AATGAAAGCG AACTTTTTCT TAACGAAATC
ACTTCAAGCC AATGTGAAAA TAATTCAGAA AAAATAAAAG CAATAAGCGA ATTAAAAGAT
GAGCTTGAAA AGCACTTATT CAGATCTCAA ATTAAAGCTT CTTCTACAAA TTCAATAGTT
CAACCATCTC TAATACTTGA ATTAATTAGC CCTAGTGGCT TGTCAATACA AATTGGAAGA
AATCACCGGC AGAATGAGCT TATTAGCCTC AAAGAATCTC GAAAGGGGGA TATATGGTTT
CATGCACAAG AATGTCCAGG CAGCCATGTA GTCATGAAAG CATCAAATGG AATCTATGAA
GAAAATGATT TGCAAATGGG TGCTGATCTC GCTGCTTTTT TTAGTCGAGC AAAGCTAAAT
AAAAAAGTTC CTATAATTAT GGCTCAAACA AACCAGCTTA AAAAACTAAA AGGCGCAATC
CCTGGAACGG TAAAGCATAA AGGCGGAAAA ATTCTTTGGG GGAATCCATT AAACGGCGAA
GATCATTTCA AAAGAGCTAC AGCAAACGCC CAAAATGCTC TATCATCAGC ACCCAGTTAG
 
Protein sequence
MDNSLLQIMD LTSLKAVMVD MRTKIIPSRF EKAQQVEANT LQLSFRTVTS VVWVELSWDA 
SSARLVEINS PPRSFGESTL AKQINHGLRQ MALIELKQEG FERIIEFGFA LRPNQKLQRF
LILELMGRHS NFFMLNKERK TITLGRQIRN HQSRLRPIST GDPYIQPPQP RGVKPDKNES
FLHWKERLCL VPIKLKDAFL QNFQGVSPSL LLQLASEEKE LAKEIINLSV KEISEETWHL
LYKRWLIWLD HIEKETYSIF FEGPTAYKVW NYNNDASSEL KEVSLALGYY YRTKLELKRF
KELSSGLNKK LLKIKESEET QMKKQKYLLE NIPKNDLIKR QADKILCSQK PSKDQVKEAQ
DLYQKAKKMR RSESVLIERI NHHTKKLNLI NESELFLNEI TSSQCENNSE KIKAISELKD
ELEKHLFRSQ IKASSTNSIV QPSLILELIS PSGLSIQIGR NHRQNELISL KESRKGDIWF
HAQECPGSHV VMKASNGIYE ENDLQMGADL AAFFSRAKLN KKVPIIMAQT NQLKKLKGAI
PGTVKHKGGK ILWGNPLNGE DHFKRATANA QNALSSAPS