Gene P9303_06671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_06671 
Symbol 
ID4776525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp627414 
End bp629201 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content56% 
IMG OID640086174 
Productsecreted protein MPB70 precursor 
Protein accessionYP_001016683 
Protein GI124022376 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCATTGCA TTGAATCGGA TCGCAGAGTA GGAAGGCTGC GTGCAATCCA GGTCCCGACT 
GAATCGATGG CCCCTGCTGC CCTCCAGGTC ATGGACCTCT CAAGCTTGCG CGCTGTACTT
AGCGAGCTCC GCAAGGAGGT GCTGCCAAGC CGCTTCGAAA AGGTCCAACA ACCAGAACCT
CATACCCTCC AGCTCGGGCT GCGCACACTC AAAGGTCTTG TGTGGCTGGA ACTCAGCTGG
CGTGCTGATT GTCCCCGATT GGTCAAGATC ACCCCTCCCC CCCGCCTAGG CAGCGGCAGC
ACTTTGGCCC AGCAGATTCA ACATGGCCTT CGACAGATGG CGCTCATTGA ACTGAAACAG
AAAGGATTTG ATCGCGTGGT GGAGTGCGGC CTCGCCCACC GCCCTGGCGA ACCTATCGAG
CGCACTCTGG TGCTGGAACT GATGGGACGC CACAGCAACC TGCTTCTACT TGATCGTCAA
CGCCAGGTGA TCACCCTTGG ACGCCAAGTA CGCAACCATC AATCAAGAGT GAGACCAATC
GGAACTGCAG ACATCTATGT CGCTCCACCG CCGATGCAAG GCAGAGAACC AAGCTCAAAG
GAATCCCTAA AACGCTGGAA AGACATTCTC TGCCTTGTGC CGACCAAGCT GCGTAGCGCT
CTGCAGCAGT GTTACCAAGG CATCAGTCCT GCACTCGCTT TGCAGCTGGT TGACGACGAT
GCAAAAAAAG CACATGCCCT GCTGGAGCTT TCAGTACTTG AGATCACCGA CGAGCAATGG
CAACACCTTT ACCACCGCTG GAGTCGATGG CTGGACTGCC TTGAGAAAGA GCGCTTCACG
ATGAGGTTTG ACGGTCCAAG CAGCTACCGC GTCTGGGACT GTGACAACTC AACCTCCTCC
TCCTCATTCA CCGATGGGCT CAGCCTCACT CTCGGGAGTT ATTACCGAAG GCATCTTGAA
ACCCAGTCAC TCAATCAACT AGCCGAAGAC CTCCAGAAAA GACTTTGTCA GTGGCGACAA
CGAGAGGAGC AGGCCCTTGG CGAACAACAG GGTCGACTCA ACGAAACAAG TCAAAGCAAT
TCTCTTCAGC AACAAGCCGA CGCCATGCTT TGCCTGCCTT CGCCGAGCAA GGATCTGATC
AACCAAACCC AAAAGCTCTA CCGCAAAGCT CAAAAATTCC GCCGCTCAGT GCCCGTGCTG
AAAACACGGA TTGAACATCA CCAGCAAAGG CTGCAGCTGA TCCAAGGCAG TGAAATGTTT
CTGGAAGATC TGCTTGGAAC CAGCTGGGAA GGACGGCGAG AACGGTCGAT AAGGCTGCAG
GAGTTGCGGC AGGAGCTGGA CGAGTTGCTG ATCTCTCAAT CCCGCAATCG CCAGAAGCGG
GGCCGTCGCA ATCAACAACC CCCAAGCCCC CTCGAACTGA CCACTCCTGG TGGACTCGTC
GTTCAGATTG GTCGCAACCA CCGCCAAAAC GATTGGATCA GCCTTCGCCA AGCCCGCCCT
GGTGATCTTT GGTTTCATGC CCAGGAATGT CCAGGCAGCC ATGTCGTTCT CAAGGCCTCC
AATGGCCATG CTGAGGAGGC CGACCTGCAA CTAGCTGCAG ACCTGGCAGC TCACTTCAGT
CGTGCTAGAG GCAATCAACG CGTACCCGTG GTGATGGTGC CCACAAGCAA CCTACAGCGA
ATCCCAGGAG CAGGCCCAGG GACCGTGCGC TACCGGGACG GAAACCTTTG CTGGGCTGAA
CCAGATCGAG GGCTTCAACA CCTCTCTGCC TCAGAACTCT TAGTCTGA
 
Protein sequence
MHCIESDRRV GRLRAIQVPT ESMAPAALQV MDLSSLRAVL SELRKEVLPS RFEKVQQPEP 
HTLQLGLRTL KGLVWLELSW RADCPRLVKI TPPPRLGSGS TLAQQIQHGL RQMALIELKQ
KGFDRVVECG LAHRPGEPIE RTLVLELMGR HSNLLLLDRQ RQVITLGRQV RNHQSRVRPI
GTADIYVAPP PMQGREPSSK ESLKRWKDIL CLVPTKLRSA LQQCYQGISP ALALQLVDDD
AKKAHALLEL SVLEITDEQW QHLYHRWSRW LDCLEKERFT MRFDGPSSYR VWDCDNSTSS
SSFTDGLSLT LGSYYRRHLE TQSLNQLAED LQKRLCQWRQ REEQALGEQQ GRLNETSQSN
SLQQQADAML CLPSPSKDLI NQTQKLYRKA QKFRRSVPVL KTRIEHHQQR LQLIQGSEMF
LEDLLGTSWE GRRERSIRLQ ELRQELDELL ISQSRNRQKR GRRNQQPPSP LELTTPGGLV
VQIGRNHRQN DWISLRQARP GDLWFHAQEC PGSHVVLKAS NGHAEEADLQ LAADLAAHFS
RARGNQRVPV VMVPTSNLQR IPGAGPGTVR YRDGNLCWAE PDRGLQHLSA SELLV