Gene P9303_14611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_14611 
Symbol 
ID4776780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1255510 
End bp1258362 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content46% 
IMG OID640086970 
Producthypothetical protein 
Protein accessionYP_001017472 
Protein GI124023165 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.62237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTGG CTGGTTTCGC CCGACCCTTG CAATGGTTCA AGAGCCAGTG GCTTTGGCTC 
TTGCTCTCTA TCGCAGCTTT CTGGTTGTTG ATGCGGGTTC AAGTCGAATG GTTGTGGTTC
GGTCAATTTG ATTGGCAAGG GATGCTTCTC CGCCGTTGGC TCTGGCAACT GGGAGGACTT
CTACTCGCCC TTCTGGTTGT AGCGACTTGT CAGCTTTGGC AGCGCAACTG GATCAAACTC
GAAGGTGCAA GCAACTTGGC AGAGCCAGCG CTTCCCCTGC ATGGATGGCG TTATGGATTG
GGGCTACTTG GCTGCTTCGT GGTGGTGGTC GGTGATCTTG TATTACTCAC TCGATTGGCG
TGGTTAGCTT GTTTTAAGCC ATTTGCCCTT GGGCATTGGT GGAGTGAACC TTTTGAAGAC
ATTTGGGCGT TGGTGATTCC GCTGTCCTGT GTATTCATCT CAATTTGCGT GATGCTTGGC
AATGCCCGAG GGGGAAGGAT TGCTCATTTG ATGGGTTGTT TTTGCTTCAG TATCTCCATC
GCTAGAGGCT GGGGATTATG GTCACTCGCC CTTGCTATCC CTCCTACAGG TATTAAAGAA
CCACTACTGG GCGCTGATGT GAGTTTTGGG CTCGGTCAAT TTCCAGCACT TGCCTTTGCC
TTGGTTGTTC TATTGGCCCA ACTCGTCTTA ACAACGAGTA CGACAATATG GATGAAGTTA
GCCCAGCCTG AATCTCTTTC AGATTGGGTT TTCAAAGGTT TATCGCCTAG ACAGTGTGAT
GTTATGAGGC CATTAATTGG CATCATACTT TTAACACTTT CAGCACTTTT ATGGTTGTCA
CGTCATGAAC TTCTGTGGAC GCAGAACGGT ACGGTAGCCG GAGCTGGTTG GTTGGATGCT
CACCTGATAC TTCCTCTACG AAGTTTGGCG AGCTTGGCCA TTTTGGTTCT CGCATTTCTG
GTGATTCCAT TTCCATGGAT ACAACAAAGG CGTTTATGGA GATTAATCGC TTCAATTATT
GGCGTAGGCG CAATTCTTCT CGAAGTGCTT TTGGCTCCTT TTGTCCAGTG GATGGTCGTA
AAACCTAGAG AACTGAAACT AGAAACCCCT TACATCATTC GAGCAATTAA AGCCACAAGG
AAAGCATTTC AGCTTGACTC GATTACGACT ACACTTATTA ATCCTCAACC TCAGTTAACT
CAACTTGATC TTGAACAAGG TGCAAGCACC CTAAGGAATA TTCGTCTATG GGACAGCCAG
CCTCTCTTGG CAACCAATCG TCAATTACAG CAATTGAGGG TTTACTATCG CTTTTCAAAT
GCGGCGGTAG ATCGTTATCG CTTTGTGCCT GACAAGGCGA ACCGGCAACA GGTGATGATC
ACAGCACGCG AACTTGATCA GGCTGCACTT CCAAAACGTT CACGTACTTG GTTGAATCGT
CATTTTGTTT TCACGCATGG ATATGGCTTT ACTCTCAGTC CAGTCAATAC AAGAGCACCC
GATGGCCTCC CTGACTATTT TATCAGTGAT CTAGGAACCT CAACACGTCT TGAAGGTAGT
TCTGAGCTCG GCATTACTCG TGAAGATGTC AAAGAGGCAG TGCCTATTGG GCGAGCAGCT
CTCTATTTTG GAATGCTTCC CTCTCCTTAT GCTCTTGCTC CGAGCAAACT AAAAGAACTA
GATTATCCAG TAGGCGATAA GAATATCTAC AACCACTATT TAGGATCAGG AGGTGTGCCG
GTGGGTCATC CCTGGCAACA ATTAGCAGCA GCTATGTACC TTTTTGAACC CCGCCTTCTT
AATACAGGGT CGCTAACGAT TAATTCCAAA CTTCTTATTA GGCGAGAAGT GAGACAAAGA
GTGAGTGCAA TAGCGCCATT CCTTGAAGTT ATTGGTGATC CATATTTGGT CTCTACATCT
GTTAATTCTA GAGATCATGA TTATCAAGCT AAGCAAAACC AATATTGGAT TGTTGAAGCT
TATACAAGCT CACGCACATA TCCCTACGCA GCAAACCTGC CAGATGGTCG CCCTGTGCGA
TATTTACGTA ACTCAGTCAA AGCTATTGTT GATGCATACA GTGGTCGTGT TCACTTGTAT
GTGAGCGAAC CGCGGGATCC GATAATATTG GGTTGGCAGC GATTGTTTCC AGATCTTTTC
AAACCTCTTG AGGAGATGCC TTCAAGCTTA CGAGAGCATC TTAAGGTTCC TACAGATTTG
TTTAATGTAC AGGTACAGCA ATTGCTTAGA TACCACGTTA CTGATCCTCG TATATTTTAT
AGCGGTGATG ATGTTTGGCA GGTTCCAAAG GAACTCTATG GTAAGCGGCA GGTTCCTGTT
GATCCCTATC ACATCACTGC ACAATTAGGT AGTCAAGAAA GTTCAGAATT TCTCTTATTA
CAACCACTAA CACCACTAGC TCGCCCCAAT CTTTCTGCGT GGCTCGCTGC TCGAAGTGAT
GGTGATCATT ATGGAAAATT AGTGTTGCTA CGTTTCCCAA GTCAAACGCC AATCTTTGGC
CCTGAACAGA TTCAGGCCCT TATCAATCAG GACCCGCAGA TCAGCCAACA GTTTGGTCTT
TGGGATCGTG CTGGGTCTGA AGTTGTGCAA GGGAACCTTC TTGTTGTACC CCTAGGCAAG
GCCCTTCTTT ATGTAGAACC TGTTTACCTA AGGGCACGTC AAGGTGGTCT ACCAACCCTT
ACCAGGGTAG TTGTTAGTGA TGGCAAAAGG ATTGCCATGG CTGAGGATCT CGGAGAAGGC
CTAAGGGCCT TGGTTGACGG ATCAAGCAAA AAAGCAGTGT ATTTAAATAG AAATGATCTG
CCACCGATTA AGGCAGCAGA TCAATCAAAT TAA
 
Protein sequence
MGLAGFARPL QWFKSQWLWL LLSIAAFWLL MRVQVEWLWF GQFDWQGMLL RRWLWQLGGL 
LLALLVVATC QLWQRNWIKL EGASNLAEPA LPLHGWRYGL GLLGCFVVVV GDLVLLTRLA
WLACFKPFAL GHWWSEPFED IWALVIPLSC VFISICVMLG NARGGRIAHL MGCFCFSISI
ARGWGLWSLA LAIPPTGIKE PLLGADVSFG LGQFPALAFA LVVLLAQLVL TTSTTIWMKL
AQPESLSDWV FKGLSPRQCD VMRPLIGIIL LTLSALLWLS RHELLWTQNG TVAGAGWLDA
HLILPLRSLA SLAILVLAFL VIPFPWIQQR RLWRLIASII GVGAILLEVL LAPFVQWMVV
KPRELKLETP YIIRAIKATR KAFQLDSITT TLINPQPQLT QLDLEQGAST LRNIRLWDSQ
PLLATNRQLQ QLRVYYRFSN AAVDRYRFVP DKANRQQVMI TARELDQAAL PKRSRTWLNR
HFVFTHGYGF TLSPVNTRAP DGLPDYFISD LGTSTRLEGS SELGITREDV KEAVPIGRAA
LYFGMLPSPY ALAPSKLKEL DYPVGDKNIY NHYLGSGGVP VGHPWQQLAA AMYLFEPRLL
NTGSLTINSK LLIRREVRQR VSAIAPFLEV IGDPYLVSTS VNSRDHDYQA KQNQYWIVEA
YTSSRTYPYA ANLPDGRPVR YLRNSVKAIV DAYSGRVHLY VSEPRDPIIL GWQRLFPDLF
KPLEEMPSSL REHLKVPTDL FNVQVQQLLR YHVTDPRIFY SGDDVWQVPK ELYGKRQVPV
DPYHITAQLG SQESSEFLLL QPLTPLARPN LSAWLAARSD GDHYGKLVLL RFPSQTPIFG
PEQIQALINQ DPQISQQFGL WDRAGSEVVQ GNLLVVPLGK ALLYVEPVYL RARQGGLPTL
TRVVVSDGKR IAMAEDLGEG LRALVDGSSK KAVYLNRNDL PPIKAADQSN