Gene P9303_01471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_01471 
Symbol 
ID4776603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp161853 
End bp165002 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content42% 
IMG OID640085646 
Producthypothetical protein 
Protein accessionYP_001016167 
Protein GI124021860 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTAT ACCCTCAGCC TTGGCCTCCT TTCCCAGTTA ATGAAATTTA CCCAAAATCA 
GTGAGAGGGC TGATAGACGA TTGGGCTCGT GCACCAGTCA GGTCACGTGA TCAGGTAGTT
GATTGTCAGC TAGCACATTT GGTGTCAATT GTTGTTCCAT GCTTTGACCC TGATCCATCT
CAATTTAGCC AGTTGTTGTT TTCTTTGCAG CAGCAAGGCG ATCAGGAGTT TGATGTAGTA
TTGATTAATG ATGGATCGGA TAATAATTCC TGGAGAAATA TTCAACTTAA ACTTAAGAAC
TTTCCTTGGA TACGGGTAAT TAATCAGCCT GAGAATAAAG GTATATCGGC AGCGTTAAAC
TTGGCTGTGG ATAATATCCA GACTCCCTAT ATCGCAATCG TTGATCAGGA CGACTTGTTG
CATCCGGCTG CGGTTTCGAT TGTAAATGGC TATCTCAGAG AGAATGTTGA TTGTCGTTTG
CTTTACACTG ATCACCTTGC ATTTCAGAAT GATGGAAGCA AGGCAACATA TACCCCTAAG
TTCCCTTGGA ATCCAGATGC TCTGCTTGAA TTTAATTATT TAATTCATCT TAGCGTCATC
AGTGTGGATT CCTATCGTGC TTGTGGCGGG ATGAACTCTT ACTTTGACGG CATTCAAGAT
TGGGAATTTT ATCTCCGCTT GGCAAAAGGA TTGACTTCGC AGACTGTGGC TTATTTGCCT
CTGCCTCTTT ATGCTTGGCG ACTCTCAGAT CAATCTGTGG CATCCTCCGC AACACCGAAG
CAGGAATTGC GCAATAAAGC CTTGGAATTT TTACAGCTTG CTCATCAAGA GCTAGGCGAA
GGTACTCGCG CGATGCAATC TCCAGATGAT TCCAGTCACT ACAAATTTCG TGTGGACCGT
AGTGATATGT CTAAGTTGCC TGTTTCATGC AATGTTTTGT TGCTAGGTGA GCGAGATTCG
GAGAATCCTC TTCATCAAAC CCTTCAATCG CTGCAAAGCT CAGAATTATG TTTTGGGCAA
ATATTTCTCA TTCAGACTCC TAATAGTTCT TTCTCAGCCT CAACTTCTTC AATGGTAGAA
GGTTTGGATG TCAATCGGGT TGAAATCGGT GAACTAGCAT CTTCAATCCC TGATGATCAA
CCTTTATTGG TTTTGCAAGT TGGGGTCTCC TCCAAGGGCA AGATTGATCC AGGTTTGAAT
GCTTGGCTGG AGCGCACATC TCGTTGGCAG GTAATTACAT TCCCCGTTTG TTCTTCGGTA
GATCTTCCGC TATGTGTCTC GGCTGGATAT GCAAGGCTAC CTGCAGTCTC AGATACCTAT
ATTCCTATAG GGCAAGCTTT TACTAAAAAG AACTACAACA ATAACTTCGC TTTTTTCTCT
CATGTTCGCC CTGTAGACCT TCCATCGCCT GCTGTTCAGT TGTTGCGCTC TTCTGTGGTA
CGGCAGACGT TAACAACATT TGCCTCAAGT TGGGATCAGA AGATGGATAT TAGGGCTAAA
TGGTGGAGCT GTTTGATGGA GCTAAGTTGG GATTGTTGCT GTATCTCTGA TGTTTGGTTC
GGATTAGATA CTTTTCTTTC TGATGCTGAG CATCAGAAAC TGGTCCAGAA GAGAGAGCAA
TCGCTTTTTG CCATAGAATC TCAAAATTGG TTGGGAACTA TATCACCTTG GAAGAAAGCC
ATATATTCAT ATTGGCTTAA AGGGTCTTTA CGTGACTTGA CTGCCAGAGC TCATCCATTG
CAGGCTGAAT TCTTTTTTAG CATCCACATA CCATTGGAGA TTCCCGCCGA TAATGTTTTT
GCGGGCGAGC GTAATTCTTC CAATCTTTCC CTATTGCCTC AGCTTTTGCA TAGATCTCTG
GTGATATTGA TTCCAACCGA ACTGAATCCC CGTAGTAATG GTCATGCATG CATTCTTAGT
TTGGCTCTAA AGCTTATAGA TGCAGGACAT TCTGTTTATC TCCTGCCGTT TAAGCCATTT
AAGTTTTTCC GAGAGTTTTA TCCTAATTTG CCTGAGAGAT TTCAAGACTT ACCTTTTATC
TCTGATCCGC AAGGATTTTC ACAAGCAATG CTTTTGGTTC CAGAATCAAC ACCTAGAAGT
TTAGTTAAAA GATTACGTCC ATACTTTAAG CAACTTATTT GGTGGGTTTT GGCTCCTGCA
GGAGTTCTCA CTGAGTTTCG TCCAAATATT CGGATTGGAG ACTTTCTGGT TGCATTCTCT
GAATTTGCTT TGCCTGGGCA GTCTGATTAT CTTTTCATCC ATCCTGATGT TGACCAAGAT
GTCGATCCTC TTTTCCCAAG ATATTTGAAA CAATATCGCC ACCAACCTCC CCATAAAAAG
CGTCTTCTTC TCTATACTGG AAAAGGAAGG CTTAAAGCAC TGCCGCGCAA TTTGCATCGC
AACTTGTTGC ATTATGAAGT GACTTTGGTG ACAAGATCTT TCCCATCCAC TAAGGCTGAG
TTGACAGATT TATTGATCAA TTCGAGTGGC TTGATAAGTT GTGACCCAAT GACAAATTTA
AACTTGGAGG CAGCTATGTT GGGATTACCA GTTTATCTTC TTGCCAACCC TTTCCCCGCA
GAATACTTTC GAAATTTTCC CGTTGATTTA TCTTATTCGA TAACTGATTC TGCAGAAGAC
TTTATAGTTC GCTTAGCAGA TAAAGGACCT TTAAAGCAGT TGCGGACAGT AGGCATGGAA
TTGAAAAGTC GATCTGCTGT TGATATTTTC GATTTACTTT TGTTGAATCC GCCTTTGCAT
AATGATCATG CTCTTGATCA TGCTCTTACT CTTCCTGTTG GAGCATACCG AGTGAACGAG
AGTACTCTCT ACCAGATAGA GCAATATAGA AAGTTTCTTA TGCGAAGCCG TACAATTCAA
GCTTTGAAGG AAGGGCAGTC TTTCTCTTCA GCTTTTTTAG GTGAGTATGT TGACAGTCTT
AAGTACCCTT TCTGGGCGCA TTCATTGATT TGTCAAGGAT TAGCTAGATT GGATGATCTG
GCAGATTTTT TGGCGACTAT CAGAGTTCTC TACCTATGCT TGCTCCTGTG GAAAAAGCTT
GGTTTAGCTA CTCTTTTCAG ATTCTTCCTT AATAGAATAA TCAACTTTAA CCGAATCATT
GCAGTCAGCA TGTTGGCGAA AAAAGCATAA
 
Protein sequence
MSLYPQPWPP FPVNEIYPKS VRGLIDDWAR APVRSRDQVV DCQLAHLVSI VVPCFDPDPS 
QFSQLLFSLQ QQGDQEFDVV LINDGSDNNS WRNIQLKLKN FPWIRVINQP ENKGISAALN
LAVDNIQTPY IAIVDQDDLL HPAAVSIVNG YLRENVDCRL LYTDHLAFQN DGSKATYTPK
FPWNPDALLE FNYLIHLSVI SVDSYRACGG MNSYFDGIQD WEFYLRLAKG LTSQTVAYLP
LPLYAWRLSD QSVASSATPK QELRNKALEF LQLAHQELGE GTRAMQSPDD SSHYKFRVDR
SDMSKLPVSC NVLLLGERDS ENPLHQTLQS LQSSELCFGQ IFLIQTPNSS FSASTSSMVE
GLDVNRVEIG ELASSIPDDQ PLLVLQVGVS SKGKIDPGLN AWLERTSRWQ VITFPVCSSV
DLPLCVSAGY ARLPAVSDTY IPIGQAFTKK NYNNNFAFFS HVRPVDLPSP AVQLLRSSVV
RQTLTTFASS WDQKMDIRAK WWSCLMELSW DCCCISDVWF GLDTFLSDAE HQKLVQKREQ
SLFAIESQNW LGTISPWKKA IYSYWLKGSL RDLTARAHPL QAEFFFSIHI PLEIPADNVF
AGERNSSNLS LLPQLLHRSL VILIPTELNP RSNGHACILS LALKLIDAGH SVYLLPFKPF
KFFREFYPNL PERFQDLPFI SDPQGFSQAM LLVPESTPRS LVKRLRPYFK QLIWWVLAPA
GVLTEFRPNI RIGDFLVAFS EFALPGQSDY LFIHPDVDQD VDPLFPRYLK QYRHQPPHKK
RLLLYTGKGR LKALPRNLHR NLLHYEVTLV TRSFPSTKAE LTDLLINSSG LISCDPMTNL
NLEAAMLGLP VYLLANPFPA EYFRNFPVDL SYSITDSAED FIVRLADKGP LKQLRTVGME
LKSRSAVDIF DLLLLNPPLH NDHALDHALT LPVGAYRVNE STLYQIEQYR KFLMRSRTIQ
ALKEGQSFSS AFLGEYVDSL KYPFWAHSLI CQGLARLDDL ADFLATIRVL YLCLLLWKKL
GLATLFRFFL NRIINFNRII AVSMLAKKA