Gene P9303_30171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_30171 
Symbolsps 
ID4777383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2670161 
End bp2672284 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content52% 
IMG OID640088541 
Productsucrose phosphate synthase 
Protein accessionYP_001019012 
Protein GI124024705 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0438] Glycosyltransferase
[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID[TIGR02471] sucrose phosphate synthase, sucrose phosphatase-like domain, bacterial
[TIGR02472] sucrose-phosphate synthase, putative, glycosyltransferase domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.169301 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTTGA GATTGCTTCA TCTGCATCTT CATGGACTGT TTCGATCTCA CGACTTGGAA 
TTGGGTCGCG ATGCTGATAC AGGAGGCCAG GCTCTTTATG TGCTCGAGCT TGTCAGGGGC
CTGGCTGCGC GTTCGGAAAT TGAGCAGGTG GAGGTGGTTA CCCGCTTAAT TCATGATCGT
CGGGTTTCTA CTGACTATGC CAATCCTATT GAGGATATTG CTCCTGGGGC CAAGATCATC
AGGCTGCCTT TTGGACCAAG ACGATATTTG CGCAAGGAGT TGTTTTGGCC CTATTTGGAT
GATTTAGCGG ATCAGACCGT GAGCCATCTC CAACAGCAAG AACATCTTCC TGACTGGATT
CATGCTCATT ACGCCGATGC AGGCTATGTG GGAGCCCTTG TGAGTCGCAG GCTGGGTGTT
CCCTTGGTGT TTACAGGTCA TTCCTTGGGG AGGGAAAAGT TGCGGCGCCT GTTGGGAGTT
GGTGGAGACC ATGAACAGAT CGAGCAAACA TATGCCATTG GTCAACGCAT TGATGCTGAA
GAGTTGACTC TTGCTCATTG CAGTTTGGTG ATTACGAGTA CTCGCCAGGA AATTGATTAT
CAATATGCTC GCTATGGGCG TTTTGTTCCG GAGCAAGCGG AGGTGGTGCC TCCCGGCGTT
GATTCCATTC GCTTCCATCC ACTCCAATCT TCTAGTGAAA CGGATGTTGT TGATGGATTG
CTCGCGCCTT TTTTGAGGAA ACCTGCTTTA CCTCCTCTTT TGGCCATTTC CAGAGCTGTG
CGACGTAAAA ATATTCCTTT TTTGGTGGAG GCTTACGGCC GCTCGCCTGT GTTGCGTCAA
CGGCATAATC TTGTGCTTGT ACTCGGCTGT CGGGATGATC CTCGTCAGTT GGAGAAGCAA
CAGCGGGAGG TATTCCAGCA GGTTTTTGAT CTTGTGGATC GTTACGACCT CTATGGCCGG
GTGGCTTATC CCAAACAGCA CCGTCGTGAT CAGATCCCAG CGATTTACCG ATGGGCAGCC
TTGCATCGTG GTTTGTTTGT GAATCCAGCG CTTACTGAGC CCTTTGGGCT CACTTTGCTT
GAGGCGGCGG CCTGTGGTTT GCCGATGGTG GCGACTGATG ACGGTGGTCC ACGCGACATC
CTTGCTCGTT GTGAAAACGG CTTGTTGGTT GACGTTACTG ATCTGGAGGC ACTTCAGGAT
GTAATGGAGC AGGCTGGTTC AGACGCAGAT CAGTGGCGTC TTTGGAGTGA TAACGGGATT
GAGGCGGTCA GTCGGCATTT CAGTTGGGAT GCTCATGTTT GCCATTACTT GGCATTGATG
AAGCAACGTC TTGAATTGTC TCAGCCACGA ATCTGGGCCA CAGACAAAGA GTGTCTGGGC
AGTCCACTGG GGGAGAGTTT GTTGTTGCTT GATCTTGATA GTTCTCTCGA AGAACCCGAG
GCAGAGGGTT TGGCTTCGCT GCGGGAGGGA TTGGAATCCA TTGGCTCTGG CGATGCGCAT
GGTTTGGGAG TATTGACGGG TCGTTCTGTG CAGGCTGCAA AGAAGCGTTA TGCAGAGCTG
CATCTGCCTT TACCTCGGGT TTGGATTAGT CGCGCGGGTA CTGAAATCCA TTACGGATTG
GAGGATCAGG CGGATCGCTT TTGGCAGGCC CATATCGACG TTGATTGGCA GCGACAGGCA
GTGGTGTCTG CTTTGGCTGA TCTCAAGGAC CATTTGACCC TTCAGGACGA TCAGGAGCAG
GGTCCTCACA AGGTGAGTTA TCTATTGAAA GAGCATGGCG AGGCGATTTT GCCACTCGTC
CGTCAGCGTT TAAGGCAAAG GGGTCAGGCG GCACGTCCTC ATTTGCGTTG TCATTGGTTT
TTGGATGTGG TGCCTTTGCG GGCCTCCCGT AGCGAAGCCA TTCGCTATCT CGCTCTTCGT
TGGGGGCTGC CTCTTGAGCA GATTTTGGTT GTTGCCAGTC AACAAGGCGA TGCTGAATTG
GTAAGGGGAT TGACTGCTTC AGTAGTCCTT GCAGAGCATG ATCCTTGTCT GGAGGGTTTG
CGACATCAGC AACGAGTGTT TTTTGCTAGC AGCCCGCATT TGTTTGGACT TTTAGATGGC
TTAAATCACT ATCGGTTTTT CTGA
 
Protein sequence
MGLRLLHLHL HGLFRSHDLE LGRDADTGGQ ALYVLELVRG LAARSEIEQV EVVTRLIHDR 
RVSTDYANPI EDIAPGAKII RLPFGPRRYL RKELFWPYLD DLADQTVSHL QQQEHLPDWI
HAHYADAGYV GALVSRRLGV PLVFTGHSLG REKLRRLLGV GGDHEQIEQT YAIGQRIDAE
ELTLAHCSLV ITSTRQEIDY QYARYGRFVP EQAEVVPPGV DSIRFHPLQS SSETDVVDGL
LAPFLRKPAL PPLLAISRAV RRKNIPFLVE AYGRSPVLRQ RHNLVLVLGC RDDPRQLEKQ
QREVFQQVFD LVDRYDLYGR VAYPKQHRRD QIPAIYRWAA LHRGLFVNPA LTEPFGLTLL
EAAACGLPMV ATDDGGPRDI LARCENGLLV DVTDLEALQD VMEQAGSDAD QWRLWSDNGI
EAVSRHFSWD AHVCHYLALM KQRLELSQPR IWATDKECLG SPLGESLLLL DLDSSLEEPE
AEGLASLREG LESIGSGDAH GLGVLTGRSV QAAKKRYAEL HLPLPRVWIS RAGTEIHYGL
EDQADRFWQA HIDVDWQRQA VVSALADLKD HLTLQDDQEQ GPHKVSYLLK EHGEAILPLV
RQRLRQRGQA ARPHLRCHWF LDVVPLRASR SEAIRYLALR WGLPLEQILV VASQQGDAEL
VRGLTASVVL AEHDPCLEGL RHQQRVFFAS SPHLFGLLDG LNHYRFF