Gene P9303_03231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_03231 
Symbol 
ID4776418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp334005 
End bp335360 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content57% 
IMG OID640085826 
Productlipid A disaccharide synthetase-like protein 
Protein accessionYP_001016341 
Protein GI124022034 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0763] Lipid A disaccharide synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTTCGG CCTTGCGAGT TCAAGTCTCG CCATCCGCAT TCTTTCCTTA CAACTTGACC 
TCACTCCCTC CAGCTGTCAT GCCGGACACT TCGCTGGCGG TTGTTCTGGT TTCCAATGGT
CCAGGGGAAC TAGCGACTTG GGTTAGACCC ATTGCTGAGA GCCTGCATCG CAGCATATTG
ATGCGTCCTC GCGCTGCAAG CTCACCTGTT GACCTGAGGC TGGTGTTGGT GCCTTGCCCG
AATGCCACAG GACAGGAGGC TGATGCGGCT AGGCGATGGA TTCAGTTTGA GCAGATCACG
CCAGCAAATC AGTTTTGGAA TTTGTTGTTA TTTCCCCGAC GCTATGGGCC ATGGCCTCGC
CAAGGAGTGG TCGTTTTCCT TGGGGGAGAT CAGTTCTGGA GCGTGCTTCT TTCAGGCCGC
TTGGGCTACC GCCATCTCAC CTACGCCGAG TGGGTGGCCC GCTGGCCTCG TTGGAATGAC
TGCATTGCTG CGATGTCTCC CAAAGTGCGG GATCAGCTGC CACGTCGTTT CCGGGAACGT
TGCACGGTGG TGGGAGACCT GATGGCGGAT TTGTCTTGTC TTGCACGCGC TGAAGCTCCG
CTGCCACAAG GCGACTGGGT GGCTTTATTG CCGGGGTCTA AACGGGCCAA GCTCTGCGTG
GGGGTTCCTT TCCTTTTGGA GGCGGCTGAT CGATTGGCTC GGCTGCGACC TGGATGTCGG
TTTTTGCTTC CTGTTGCACC CACCACAAGC GTTAAGGAGT TGGAAAGCTT TATGAGTTCG
AGCAATCCGA TTGCTGCTGC CTATCGCTCA GCTATTGCCA TGGTCAGGCC AGCTGAGCTT
GATCAGCCTT GGCGAAGATT GATCACCAGG GCCGGCACGG TCATCTATCT CCAAGAGGAC
CATCCTGCGC ATGGTCCCTT GAGCCAATGT GATTTAGCTC TCACCACTGT TGGGGCAAAT
ACGGCCGAAT TGGGGGCGCT CGGTTTGCCA ATGATTGTGA TCGTGCCCAC CCAGCATTTG
GCTGTGATGC AGGCATGGGA TGGCTGGATT GGTTTGCTGG CTCGGCTCCC TGGATTGCGT
TGGTGCATTG GTGTCTTGCT CAGTGCATGG CGGCTGCGAC GCCATGGATT CCTGGCTTGG
CCCAATATTT CTGCTGGACG CATGGTGGTT CCGGAGCGAG TGGGCTCAAT CTCGCCACAG
GACATTGCTA ATGAGGCGTC AGCTTGGCTG GAATCTCCTG AACGGCTCAG GGGGCTGCGC
GAGGATCTGC GCAGTCTGCG TGGGCAACCT GGGGCTGTGT CTGCACTGGT TCAGCAGGTG
CGTCGTTTGC TGCCCAAAGC TCTCGGTGCT TTTTAG
 
Protein sequence
MASALRVQVS PSAFFPYNLT SLPPAVMPDT SLAVVLVSNG PGELATWVRP IAESLHRSIL 
MRPRAASSPV DLRLVLVPCP NATGQEADAA RRWIQFEQIT PANQFWNLLL FPRRYGPWPR
QGVVVFLGGD QFWSVLLSGR LGYRHLTYAE WVARWPRWND CIAAMSPKVR DQLPRRFRER
CTVVGDLMAD LSCLARAEAP LPQGDWVALL PGSKRAKLCV GVPFLLEAAD RLARLRPGCR
FLLPVAPTTS VKELESFMSS SNPIAAAYRS AIAMVRPAEL DQPWRRLITR AGTVIYLQED
HPAHGPLSQC DLALTTVGAN TAELGALGLP MIVIVPTQHL AVMQAWDGWI GLLARLPGLR
WCIGVLLSAW RLRRHGFLAW PNISAGRMVV PERVGSISPQ DIANEASAWL ESPERLRGLR
EDLRSLRGQP GAVSALVQQV RRLLPKALGA F