Gene P9303_18431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_18431 
Symbol 
ID4775998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1601654 
End bp1606090 
Gene Length4437 bp 
Protein Length1478 aa 
Translation table11 
GC content54% 
IMG OID640087352 
Producthypothetical protein 
Protein accessionYP_001017850 
Protein GI124023543 
COG category[S] Function unknown 
COG ID[COG3164] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACAGA AGAGTCAATC TTTGGCTAAA GGGCGATGGA AACGCCCTGC TGTTCTACTG 
CTAACCAGCA GCTTGGTTGT TTGGGTTGGC GCCGATCGAG TTGTTGCTGC CCTTCTTGAA
CGTCTGCGTC CGCAACTGGA GCAACAGCTT TCAAAACCTC TAGGCCATCC TTTAAAAATT
GGTGCCTATC AGGGCCTGCG GCCCTGGGGT ATGGCCATTG GTCCCACTGA AGTGCTGTCT
GGAAGTCATG ACGATTCCAC AGCATCGCTC TCTGGTCTAA CCATCAGCCT TGCCCCAATA
GCCAGCCTGT TGCGTTTGCG ACCCGTTGCC GTACTAACCC TTGAAGGATC ACGGTTAACG
CTGCGTCGTA ATCACAAGGG TTTCTACTGG GTCCCTGGTC CATCCAAAGG CGAGCCTCCC
CCAAAGGTTG ATCTTCAGCT GCGGTTGACT CAGCCCGCCA GGGTCCGCAT CGAACCCGCA
AACTTGGAGT TCACCGCTAC CACACGGGCA GACCTTCAGT TGGCGGAGGG ATGGGCCAAT
GGGTCTGTCC AATTTGTATT GCCAGATCGT GGAAGATTCT TCCTAAAGGG CCGTGGGCGC
TGGGATCGTT TAGAGCTTGA GGCCCATGCT CGTTTGGACC AAATCAGCCT TAAACCATTG
CAGGGGCTTT TGCCGGGAAC GCTGCCCATG CAAATGCAGG GTCAGATTGG CGGTGACCTC
CAAGTGAGTC TTAATCAGGG ACGTATGGGC TGTAAGGGAT CCCTGTCATT GGTTCGATTC
CAATTAGCGG GTGGTCCCCT CAAAAAGTCA CTCTCATCTC GTGAAGCCAA GATCAACTGT
CGTCAAGATC GTCTTCAGTT ACCTCTCAGC CAGTGGCGTT ACGGCCCTTG GACCGCTTCT
CTCAAGGGAG GTGCTCGTCT CAATCACTCC TACAACCTCG ACCTCAAGGT CAACCAGCGG
CAACAAGGGC ATGCCTTCCA AGCCCGGATC AATGGACCTT GGCATGAGCC CAATCTGCAC
GCCAGTGGCC GCTGGATTCT TGCCCCAAAG ATCCCCGTTG ATGGTCCGCT GCAACTGAAC
TTGCAGATGC GCACCAACTG GCGCAATCCC AAGGCCTTCA GGGCCGTTAT TGACACGTTT
GATGTTCGTG GTCCAGGTCT TCAGGTCCGT GCAAGGGGGC CCCTCTATCC CGAGTTAGGG
CTGAGCACCC AGCGCCTGGA GTTTGCAGGC CCTGCATGGC AGCGCATCCC TGTGCTTGCT
GATCTTCTCG GCAGCCAATC CTTGATCAAA GGCAAACTTC AGCTTGAGGG CCCTTCATCG
AGCCCCCAAC TTCAGCTAAG CCTTGCTCAG CAAGGTAATC CTTTGCTGGA AACCTGGTCT
CTGCGAGCGG GGTGGTCAGC AGATTCCGGT TTGCTGCGCT TAAGGCAGTT CAACAGTCCC
CTGCTCAAAG TTGTGGCGGA CTTGCCTCTC TCAGTTGATC AAGGACGCCT ACGCAGTGGA
GAACTGCAGG CCAATCTGAA TTTGAGTCCT TTCCCTTTAG CTCGTATTGG TCCGCTTCTC
GGCACCTCTT TGGCTGGAAC CCTTGCTGCT TCGGGTCAGG TGAGAGGGCC ATTCTCGGCC
CTTCGGCCAA ATCTTTCCCT CCGGGTGGTG AATCCAGAGG CTGGAGGGTT GCGATTGTTG
GAAGATTGGC AAGGGAACTT GGCCGGACTC CCTACTGGGG GTAGCACTCT GCTGATGGAA
TCAGTGGGGG CGGTGATCCC TGGGCAGCTC TCAGCTCGAT TGGGACGCGA CTGGTTGCCA
CAGGAGTTGG CGATTAACCG TGGCGATGGC CGTCTCTCGC TAAACGGCAT CCCAGCTCGC
TATCTCTGGG AGTTGAACAA TTTCAAAGTG GATGGAATTG AGGCGGCCCT GTCTTCAAAG
CAGCGATTTG AAGGCGTTTA TGGTCAACTC AGCGGGTCAG GCAGCCTGGG CCTTCAGCCC
CTGGCCATGG AAGGCCAAGT CACCATCAGC AATCCAGGGT TGATGGGTCT GCAATTTCAA
CAGGCTTTGC TTCAGGGGAG GGTCGCCAAC CAGCGCTACA AGATGACTGG TGAGCTGCTA
CCAGCGGATA CAGGCCAGAT CAATCTTGCA GCAGGAGGTC GCCTTGGTGG CGAACTCTCT
GCTAAGGCAA AGGCACGCGG TCTAAGCGCG CGTTGGCTGA TCTCTAGTGC AGAGCAACTT
TCTAATTTTA ATGATGTTCT TCCAGCTTCG ATTGGGCGAG CCCAAGATCT GGGCACTCTG
TTGATGCAAA CCTTGGGAAG GTCTCTAGAT GATCAGCTCA AAGTCTTGGC AGCGGCGCAG
GCTTCCGTCA ATCGTTTCGA CCAGCAAAAT CGTCGTAGCA AGATCATCCA TCCAGAAGAT
CTACGTGGGC AGATTGATGC GGTGATCGAT CTGAAGGGTC CCGATCTGTC CAAACTCAAT
CTGGGGCTAA AGGCCAGCGG TCATCTCTGG ACAGAAAGTG AGGATCAGGA TCACGCCCTA
CAAGTCAAAC CCATTTTCGC CTCGATTCAA GGACCACTGC ATGGTGGGGA AGGGTCGTTT
TCACTGCTTC ACGTTCCCTT TTCTCTGCTT TCGTTGGTTG CACCTCTACC TCAAGCACTG
CGAGGCGCTC TGGGACTTTC GGGTCGATAC AACCTCAGAC GGGGAACCCA TGAAATCACA
GCTGACCTAG TGATGGAAAG CGCCAGGTTG GCTGAGAGCA AATTGAGCTT GGACAAAGGG
CAGATCCTCT TGAATGATGC CCTCCTGAAC CTCGACTTCG CGCTGCGAAG CTCTTCTTCC
AAAGAGGCTG TAACCATTAC CGGGCAAGTG CCTCTGGATC CCTCCTTGCC GATAGACGTC
AGGGTTGAGA GCCACGGCGA CGGCCTACAT TTTCTGGCTG ATTTTGCTGA AGGTGCGGTC
GCCTGGAAGG GCGGTAACTC CGACCTCAAG TTGTTGTTTA GCGGCAGCCT TAGCGCTCCT
CAGGCGAATG GATTTTTGGT GGTGCAAAAC GGCGAATTCG TCGTCATGGA ACAAGTTGTC
AAGGGGTTGG AGGCTGCCAT GGTTTTTGAC TTCAATCGCC TTGAGGTGCA GCGTCTCAAA
GCCAAGATTG GTTCGAAGGG CATCTTGCAA GGTGCGGGAT CCATTGCCTT GCTTCGCCCT
GCACCTGAAG ACCAGCCCCT GACCATCGAG ATCAGTAAGT CCCGTTTTAA GTTGCCAAAG
GCTGATGTAG GGGTGGCCGC CAAGCTCAAG TTCACGGGTG CCTTACTGAA ACCCTTGATT
GGTGGTGAGC TCACGATCAA AGAAGGAACG ATCTCACCCG CCGGCTCAGG GCTTCTGCGA
CCGATCAACT CTGCCATTCA ATCCACCAAA AGGCCTGGAG CCGGAGAGGC GATAGCGACT
TCCAGCAGCC CCAAAGTTGT TAATGCCAAC ACCCTGCTTG AGGAGCAGTG GGATTTCAAG
AAACCCTTGG TTTTGCTTGG ACCAGATGTA GACGTCAGTC GAAGGAAGAT GTTGAGCTCC
GTGATTCCCA ACATCCCCTC TATCAGCTTC GACAACCTGC GTTTAAAGCT GGGGCCAAAT
TTGCGCATCA CTGCTAATGC GCTGGCCAAT TTCAGTACAG AAGGGTTGCT CAGTTTGAAT
GGCCCCCTAG ATCCAAAGCT TCAGGCTCGT GGTGTGATTC GGCTACTGAA TGGGCGGCTG
AATCTGTTTA CGACGTTCTT CAGTCTTGAT CAGCGAGCTG CGAATGTCGC TGTGTTTACT
CCTTCTCTGG GCCTAATCCC TTACGTTGAT GTCGCCATGA ACAGCCAGGT CTCCGACAGC
ATCAGCATCG GCACCGACAG TAATGCCGCA TCAGCGAATG TATTTGATAC CAATGGCACA
GGTGCTCTTG GGGCCGGAGG ACAGTTCCGC TTGATCAAAG TAATGGTGAA GGCCGAGGGA
CCTGCAAATC GTCTTTTCCA AAACATTGAC TTACGCAGCT CGCCGTCACT GCCTCGTGCT
CAATTGCTGG GGTTAATTGG AGGAAATTCA CTGGCAGGGT TGTCGGGAGA AGGTGGTGGT
GCGGCACTAG CGACTGTGAT CGGTCAATCT CTTCTCACGC CTGTTCTTGG AACAATCTCT
GATGCTTTCA GTCAACGAAT GCAAATTGCC CTTTATCCTG CATACGTCTC GCCGGTTGTG
ACGAGTCAAC AAGAGCGCGT TTCTGGGCAA GTCCCACCCA CTCTCGAGGT AGTCACAGAC
ATTGGTATTG ATATCACTAA GCGACTTAAC GTTTCTATTT TGGCGACACC AGACCGCAAC
GATATTCCTC CGCAAGGCAC CCTTACTTAT CAAATCAGTC CCAGCATGAA TCTCTCAGGT
TCTGTAGACA GCCAGGGAAT TTGGCAGAGC CAATTGCAAT TATTTTTCCG CTTTTGA
 
Protein sequence
MGQKSQSLAK GRWKRPAVLL LTSSLVVWVG ADRVVAALLE RLRPQLEQQL SKPLGHPLKI 
GAYQGLRPWG MAIGPTEVLS GSHDDSTASL SGLTISLAPI ASLLRLRPVA VLTLEGSRLT
LRRNHKGFYW VPGPSKGEPP PKVDLQLRLT QPARVRIEPA NLEFTATTRA DLQLAEGWAN
GSVQFVLPDR GRFFLKGRGR WDRLELEAHA RLDQISLKPL QGLLPGTLPM QMQGQIGGDL
QVSLNQGRMG CKGSLSLVRF QLAGGPLKKS LSSREAKINC RQDRLQLPLS QWRYGPWTAS
LKGGARLNHS YNLDLKVNQR QQGHAFQARI NGPWHEPNLH ASGRWILAPK IPVDGPLQLN
LQMRTNWRNP KAFRAVIDTF DVRGPGLQVR ARGPLYPELG LSTQRLEFAG PAWQRIPVLA
DLLGSQSLIK GKLQLEGPSS SPQLQLSLAQ QGNPLLETWS LRAGWSADSG LLRLRQFNSP
LLKVVADLPL SVDQGRLRSG ELQANLNLSP FPLARIGPLL GTSLAGTLAA SGQVRGPFSA
LRPNLSLRVV NPEAGGLRLL EDWQGNLAGL PTGGSTLLME SVGAVIPGQL SARLGRDWLP
QELAINRGDG RLSLNGIPAR YLWELNNFKV DGIEAALSSK QRFEGVYGQL SGSGSLGLQP
LAMEGQVTIS NPGLMGLQFQ QALLQGRVAN QRYKMTGELL PADTGQINLA AGGRLGGELS
AKAKARGLSA RWLISSAEQL SNFNDVLPAS IGRAQDLGTL LMQTLGRSLD DQLKVLAAAQ
ASVNRFDQQN RRSKIIHPED LRGQIDAVID LKGPDLSKLN LGLKASGHLW TESEDQDHAL
QVKPIFASIQ GPLHGGEGSF SLLHVPFSLL SLVAPLPQAL RGALGLSGRY NLRRGTHEIT
ADLVMESARL AESKLSLDKG QILLNDALLN LDFALRSSSS KEAVTITGQV PLDPSLPIDV
RVESHGDGLH FLADFAEGAV AWKGGNSDLK LLFSGSLSAP QANGFLVVQN GEFVVMEQVV
KGLEAAMVFD FNRLEVQRLK AKIGSKGILQ GAGSIALLRP APEDQPLTIE ISKSRFKLPK
ADVGVAAKLK FTGALLKPLI GGELTIKEGT ISPAGSGLLR PINSAIQSTK RPGAGEAIAT
SSSPKVVNAN TLLEEQWDFK KPLVLLGPDV DVSRRKMLSS VIPNIPSISF DNLRLKLGPN
LRITANALAN FSTEGLLSLN GPLDPKLQAR GVIRLLNGRL NLFTTFFSLD QRAANVAVFT
PSLGLIPYVD VAMNSQVSDS ISIGTDSNAA SANVFDTNGT GALGAGGQFR LIKVMVKAEG
PANRLFQNID LRSSPSLPRA QLLGLIGGNS LAGLSGEGGG AALATVIGQS LLTPVLGTIS
DAFSQRMQIA LYPAYVSPVV TSQQERVSGQ VPPTLEVVTD IGIDITKRLN VSILATPDRN
DIPPQGTLTY QISPSMNLSG SVDSQGIWQS QLQLFFRF