Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_30171 |
Symbol | sps |
ID | 4777383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2670161 |
End bp | 2672284 |
Gene Length | 2124 bp |
Protein Length | 707 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640088541 |
Product | sucrose phosphate synthase |
Protein accession | YP_001019012 |
Protein GI | 124024705 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0438] Glycosyltransferase [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR02471] sucrose phosphate synthase, sucrose phosphatase-like domain, bacterial [TIGR02472] sucrose-phosphate synthase, putative, glycosyltransferase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.169301 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTTGA GATTGCTTCA TCTGCATCTT CATGGACTGT TTCGATCTCA CGACTTGGAA TTGGGTCGCG ATGCTGATAC AGGAGGCCAG GCTCTTTATG TGCTCGAGCT TGTCAGGGGC CTGGCTGCGC GTTCGGAAAT TGAGCAGGTG GAGGTGGTTA CCCGCTTAAT TCATGATCGT CGGGTTTCTA CTGACTATGC CAATCCTATT GAGGATATTG CTCCTGGGGC CAAGATCATC AGGCTGCCTT TTGGACCAAG ACGATATTTG CGCAAGGAGT TGTTTTGGCC CTATTTGGAT GATTTAGCGG ATCAGACCGT GAGCCATCTC CAACAGCAAG AACATCTTCC TGACTGGATT CATGCTCATT ACGCCGATGC AGGCTATGTG GGAGCCCTTG TGAGTCGCAG GCTGGGTGTT CCCTTGGTGT TTACAGGTCA TTCCTTGGGG AGGGAAAAGT TGCGGCGCCT GTTGGGAGTT GGTGGAGACC ATGAACAGAT CGAGCAAACA TATGCCATTG GTCAACGCAT TGATGCTGAA GAGTTGACTC TTGCTCATTG CAGTTTGGTG ATTACGAGTA CTCGCCAGGA AATTGATTAT CAATATGCTC GCTATGGGCG TTTTGTTCCG GAGCAAGCGG AGGTGGTGCC TCCCGGCGTT GATTCCATTC GCTTCCATCC ACTCCAATCT TCTAGTGAAA CGGATGTTGT TGATGGATTG CTCGCGCCTT TTTTGAGGAA ACCTGCTTTA CCTCCTCTTT TGGCCATTTC CAGAGCTGTG CGACGTAAAA ATATTCCTTT TTTGGTGGAG GCTTACGGCC GCTCGCCTGT GTTGCGTCAA CGGCATAATC TTGTGCTTGT ACTCGGCTGT CGGGATGATC CTCGTCAGTT GGAGAAGCAA CAGCGGGAGG TATTCCAGCA GGTTTTTGAT CTTGTGGATC GTTACGACCT CTATGGCCGG GTGGCTTATC CCAAACAGCA CCGTCGTGAT CAGATCCCAG CGATTTACCG ATGGGCAGCC TTGCATCGTG GTTTGTTTGT GAATCCAGCG CTTACTGAGC CCTTTGGGCT CACTTTGCTT GAGGCGGCGG CCTGTGGTTT GCCGATGGTG GCGACTGATG ACGGTGGTCC ACGCGACATC CTTGCTCGTT GTGAAAACGG CTTGTTGGTT GACGTTACTG ATCTGGAGGC ACTTCAGGAT GTAATGGAGC AGGCTGGTTC AGACGCAGAT CAGTGGCGTC TTTGGAGTGA TAACGGGATT GAGGCGGTCA GTCGGCATTT CAGTTGGGAT GCTCATGTTT GCCATTACTT GGCATTGATG AAGCAACGTC TTGAATTGTC TCAGCCACGA ATCTGGGCCA CAGACAAAGA GTGTCTGGGC AGTCCACTGG GGGAGAGTTT GTTGTTGCTT GATCTTGATA GTTCTCTCGA AGAACCCGAG GCAGAGGGTT TGGCTTCGCT GCGGGAGGGA TTGGAATCCA TTGGCTCTGG CGATGCGCAT GGTTTGGGAG TATTGACGGG TCGTTCTGTG CAGGCTGCAA AGAAGCGTTA TGCAGAGCTG CATCTGCCTT TACCTCGGGT TTGGATTAGT CGCGCGGGTA CTGAAATCCA TTACGGATTG GAGGATCAGG CGGATCGCTT TTGGCAGGCC CATATCGACG TTGATTGGCA GCGACAGGCA GTGGTGTCTG CTTTGGCTGA TCTCAAGGAC CATTTGACCC TTCAGGACGA TCAGGAGCAG GGTCCTCACA AGGTGAGTTA TCTATTGAAA GAGCATGGCG AGGCGATTTT GCCACTCGTC CGTCAGCGTT TAAGGCAAAG GGGTCAGGCG GCACGTCCTC ATTTGCGTTG TCATTGGTTT TTGGATGTGG TGCCTTTGCG GGCCTCCCGT AGCGAAGCCA TTCGCTATCT CGCTCTTCGT TGGGGGCTGC CTCTTGAGCA GATTTTGGTT GTTGCCAGTC AACAAGGCGA TGCTGAATTG GTAAGGGGAT TGACTGCTTC AGTAGTCCTT GCAGAGCATG ATCCTTGTCT GGAGGGTTTG CGACATCAGC AACGAGTGTT TTTTGCTAGC AGCCCGCATT TGTTTGGACT TTTAGATGGC TTAAATCACT ATCGGTTTTT CTGA
|
Protein sequence | MGLRLLHLHL HGLFRSHDLE LGRDADTGGQ ALYVLELVRG LAARSEIEQV EVVTRLIHDR RVSTDYANPI EDIAPGAKII RLPFGPRRYL RKELFWPYLD DLADQTVSHL QQQEHLPDWI HAHYADAGYV GALVSRRLGV PLVFTGHSLG REKLRRLLGV GGDHEQIEQT YAIGQRIDAE ELTLAHCSLV ITSTRQEIDY QYARYGRFVP EQAEVVPPGV DSIRFHPLQS SSETDVVDGL LAPFLRKPAL PPLLAISRAV RRKNIPFLVE AYGRSPVLRQ RHNLVLVLGC RDDPRQLEKQ QREVFQQVFD LVDRYDLYGR VAYPKQHRRD QIPAIYRWAA LHRGLFVNPA LTEPFGLTLL EAAACGLPMV ATDDGGPRDI LARCENGLLV DVTDLEALQD VMEQAGSDAD QWRLWSDNGI EAVSRHFSWD AHVCHYLALM KQRLELSQPR IWATDKECLG SPLGESLLLL DLDSSLEEPE AEGLASLREG LESIGSGDAH GLGVLTGRSV QAAKKRYAEL HLPLPRVWIS RAGTEIHYGL EDQADRFWQA HIDVDWQRQA VVSALADLKD HLTLQDDQEQ GPHKVSYLLK EHGEAILPLV RQRLRQRGQA ARPHLRCHWF LDVVPLRASR SEAIRYLALR WGLPLEQILV VASQQGDAEL VRGLTASVVL AEHDPCLEGL RHQQRVFFAS SPHLFGLLDG LNHYRFF
|
| |