Gene P9303_01331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_01331 
Symbol 
ID4776391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp144261 
End bp147179 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content42% 
IMG OID640085632 
Producthypothetical protein 
Protein accessionYP_001016153 
Protein GI124021846 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCCAA GCTTATTGCT CCGCAATCAG ATAGCTGAGC ACAAAAGGCC TGCTTATGAC 
TATTTTGAAG ATATAGATAT AGTTGATCTC CAGGATAAAA CCCTTGTTAA ATTAGATCGT
CATTGGAGTA GTTATAAGGC TTCATGGCCT CTTGCCGATA TAAATGAAGG CTTCTCAAGT
AATAGCCATG TTCGTGACTT TGAAGAAAGC TTGGTTTTAT TGTTGGAGGC TCGAATGGCT
TTTTGTCAGT CAGGCGATGA TTCTTCTATG TGGGTTGTTA AGGATCCTCG TACTCCCATA
CTTCTTGCAA GTTGGTTACG CGTCATGGAA TGGCTCCAAA TTGAGCCTGT ATTTATAATT
GTCCATCGTA ACGCTTCTGA CAATATCCAA TCTTTCAGCA AAAAGGGACA GGTCCCACAG
CGTTGGGCAG AGGCTCTTTG GCAGCAAAGT TATGTGCAGA TTGGGAAATC CATCCCTTCT
GCATCTTCTG TGTATTCTCT TGATTTCTCA CAGATATTAG AGAATCCTCT GCATGTTGCT
CAAGAACTTA AAAACTTCCT AGGCATGCAT ACAGATCAAC CATTAGTTCC AACTTTAAAG
AAAGCAGTAG ATCCCTCCCT CCCCTCTAAG CATTCAGAAT ACAAGCTTAG TTCTATTAGT
GAAAAGATCG AGTTGTGCTT AAAACAGTCT CGCTTTTCGG ATTTACCTTC ACCTGATGAT
ATTACGCTCG AGGCTTATAA AATACAAGCT GATTTGACCC CAGTTTCACA ACTAACTCTA
CATAATTTTG GCATAGAACT TCGAAAAAAT AAAGCTATGT CTCACTCCAG TCGTAAGCGA
ATTTGTATTC TCACAGCAGA GCTCCAGGGA TATGGTCCGA GTGGAGGGAT TGGAACGGCA
ATGTTGGAGT TGGCGATAGA GCTTGTCAGT TCTGGACACC TAGTTGAGGT TTGGTTGGTT
GGTAGTAGTA ATGATCCCAT ACCCTCAAGT CGACTTGACT CTATTCATAT AAGGCATTTA
CCTGGAGAAA CTGTAGATCA AGACCCTGCT AACTTTCGGC AACAAATAGC AGAGGCTGTA
TTAACGGAAT CTTTCGATAT TATTCATTGT CATGATTGGT TAGGACTTGG TGCTTGCTTG
CATTTATCAA CCCTTGAAAC CGAATCCCCA ATTGTAATAT GTGGCTTGCA TGGCCCAACT
CAGTGGGTAA GAGAAGGAAG TCCTTCGATT TCCAATTGGA CCAAAAGAGA TTCAACAATC
ATTGAACTGG AATGGCAGGC GATCATCAAT GCAGATGTAC TGTTTAGTCC CTCTGCATAT
ATGAAAAACT GGGTATCAAA ACACCTTGGA AATAGAAAAT ATTATCCAGA TATTCATGTG
CAGTTGAATT GTCCTAGCGT TGCCCCAAAT GCAAAACTAA GTCGTAATAT TGATCTTCCT
CCAGAAAGCA AGAGTAGTTT GATCTTCTTT GGTCGATTAG AGGAACGCAA AGGAATAGTT
TTGTTTTTAG ATGCTCTATC AATTCTTGGC CTGCAGACCC ATCCAATTTA TTTTATAGGT
GCTGACGCAC CTTTAGATGG TTGTTGGGCG AGTCAATTAG TAGAGAGACG ACTAAAAGAA
AGCGGCCAGC TCTATCAGTG GCTGCCTGAT TTGAATCGCG ATGAAGCCCA TGCTGTACTA
CATGCAATAG GCGGAATTGT TGTAATTCCC TCTTTGATTG AAAATAGCCC TTACACCGTT
CAAGAATTGC TAGATACAAC ATTGAGTGTA GTTACAACAA ATGTAGGTGG AACCCCGGAA
CTGGTAGCTA ATGCTCAAGC AACTTTGTCA GAGCCCAACC CTCGCGACTT GGCTAACAAG
ATTCAAGATG CTCTGGCGGA CAACATTGAT TCAAAATCTA TTTTTAAAAT CAAATCAATT
GTTGATAAAT CGAGGATTCG TTTGAGTTGG CAAGAGTTTC ACTCTCGTTT GCCATCTTGT
GAATTTACTG TTCCAAGATG GAACCCTAAA GAAGCCATTG TGCTTATTAC TTTAGATTCA
TGTCGCCTTG ATACTTTTCA GTCTAGCTCT ACCGTTAACA TTAGCAAGAT CGGGCCTTTG
CATAAGGCGA AATCCCCAAG CTATTTTACT TATGCAAGCC ATGCAGCAAT GTTTATGGGT
TTCTTGCCTA GCACCTTAGA TCCAGTTGGC TTCGTTAACT CTAAATTTGC CAAAGTATTT
CGGCTTTCTC ACTCAGGTTT CCAGGCTTCA CGCACAGAAG AAAGCTTTGA ACTTTCTGGT
AATTCGATTA TTACTGGCCT GAGGCGAAAA GGTTATTTCA CTATTGGAAC TGCATCCGTT
AATTGGTTTG ACCCAGCTAC AGAAACTGGA CAACAGCTTG TTAAAGATTT TGATACTTTC
TGGTTTTCTG GCAACACCTG GAGCTTGAAC CGTCAGTTAT TATGGATTGA TAGTCAGTTG
CAACAAGAGC TTGATCGCCC ACCTTTTATT TTTTTGAACG TTGGCGAAAC TCATGTGCCT
TATTGGCATG AAGGTGCTTC GTGGTCTAGG GACGACCACC CATGCATACC ATTTCAGACA
CAGGATCGTC GTAAAGATTG CCAGGAACGT CAGCGTGCTT GCCTCGAGTT CATCGACAAA
CAACTTGGTT CTCTTCTTGA ACGTTTTAGT GAATCAACCG TAATTATTTG TTCGGATCAT
GGTGATTGCT GGGGGGAGGA TGGCCTATGG GAGCATGGTA TTTCCCATGA AAAAACTCTT
TCTGTTCCAT TATTGATGAG AATACGGGGC TGCCCGATCC CACCTCCGAC ACCTCCGCTT
AGTTTTCGAC AACGTATTTC AAATTTTGCC AGGAAACGTA TTTCAAAGCC TGTCAGGTGC
AAGCTAGCCT CTGCTCTTAG AAGAACTAAA GTCCTTTGA
 
Protein sequence
MGPSLLLRNQ IAEHKRPAYD YFEDIDIVDL QDKTLVKLDR HWSSYKASWP LADINEGFSS 
NSHVRDFEES LVLLLEARMA FCQSGDDSSM WVVKDPRTPI LLASWLRVME WLQIEPVFII
VHRNASDNIQ SFSKKGQVPQ RWAEALWQQS YVQIGKSIPS ASSVYSLDFS QILENPLHVA
QELKNFLGMH TDQPLVPTLK KAVDPSLPSK HSEYKLSSIS EKIELCLKQS RFSDLPSPDD
ITLEAYKIQA DLTPVSQLTL HNFGIELRKN KAMSHSSRKR ICILTAELQG YGPSGGIGTA
MLELAIELVS SGHLVEVWLV GSSNDPIPSS RLDSIHIRHL PGETVDQDPA NFRQQIAEAV
LTESFDIIHC HDWLGLGACL HLSTLETESP IVICGLHGPT QWVREGSPSI SNWTKRDSTI
IELEWQAIIN ADVLFSPSAY MKNWVSKHLG NRKYYPDIHV QLNCPSVAPN AKLSRNIDLP
PESKSSLIFF GRLEERKGIV LFLDALSILG LQTHPIYFIG ADAPLDGCWA SQLVERRLKE
SGQLYQWLPD LNRDEAHAVL HAIGGIVVIP SLIENSPYTV QELLDTTLSV VTTNVGGTPE
LVANAQATLS EPNPRDLANK IQDALADNID SKSIFKIKSI VDKSRIRLSW QEFHSRLPSC
EFTVPRWNPK EAIVLITLDS CRLDTFQSSS TVNISKIGPL HKAKSPSYFT YASHAAMFMG
FLPSTLDPVG FVNSKFAKVF RLSHSGFQAS RTEESFELSG NSIITGLRRK GYFTIGTASV
NWFDPATETG QQLVKDFDTF WFSGNTWSLN RQLLWIDSQL QQELDRPPFI FLNVGETHVP
YWHEGASWSR DDHPCIPFQT QDRRKDCQER QRACLEFIDK QLGSLLERFS ESTVIICSDH
GDCWGEDGLW EHGISHEKTL SVPLLMRIRG CPIPPPTPPL SFRQRISNFA RKRISKPVRC
KLASALRRTK VL