Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21951 |
Symbol | sps |
ID | 4779273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1853939 |
End bp | 1856047 |
Gene Length | 2109 bp |
Protein Length | 702 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640085493 |
Product | sucrose phosphate synthase |
Protein accession | YP_001016015 |
Protein GI | 124026900 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0438] Glycosyltransferase [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR02471] sucrose phosphate synthase, sucrose phosphatase-like domain, bacterial [TIGR02472] sucrose-phosphate synthase, putative, glycosyltransferase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.269726 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGATTAA GACTTTTACA TTTAAATTTA CATGGTTTAA TCCGTTCACA TGATCTTGAG TTGGGTAGAG ACTCAGATAC TGGAGGGCAA ACTTTATATG TTTTAGAATT AGTAAAAGGA CTTGCAGCAA GACCAGAAGT TGAAAAAGTT GAACTAATTA CAAGATTAAT TAATGATAGG AGAGTATCTT CTGACTACTC AAAGCCTGTT GAAAAGATAT CAAGTTGTGC GGAAATTATT CGATTGCCTT TTGGTCCTAA GCGTTATATG AGAAAGGAGT TGTTATGGCC TTACTTAGAT GATTTAGCTG ATCGCATAGT TCAAAGATTG CAGCAAGAAA ATAAATTCCC TGATTGGATT CATGCTCATT ATGCCGATGC TGGTTATGTA GGAGCATTGG TTAGTCGAAG ATTAGGATTA CCATTAGTTT TTACTGGCCA TTCATTGGGC CGTGAAAAAC TGAGAAGATT ATTGGCTGCT GGAATTGATC ATGATCAGAT AGAGCAAACT TATTCAATTA GCAAAAGAAT TGATGCAGAG GAATTAGCTT TAGCACATTC AAATTTACTT GTCACAAGCA CTAAGCAGGA ATCTCAGGAG CAGTACGCTC GTTATGGACG ATTTAGCTCA AAAAACATAG AAATAATTCC ACCTGGTGTT GATTTAAATC GCTTTTACTC AGCAGAGCTT AATTTAAAGG ATGAGGAAAA AGAATTAAAT AAACTTTTTA ATCCTTTTTT GAGAGATTTA AGTCTTCCAC CACTCTTAGC TATATCAAGA GCAGTAAGGA GAAAAAATAT TCCAGCGTTA ATTGAGATTT ATGGACGTTC GTCAATTTTG CAGCAAAGAC ATAATCTTAT TTTAATTTTA GGTTGTCGCC AAGATTCTCG TCAGCTTGAG AAACAGCAAA GAGAAGTATT TCAGCAAGTT TTTGAATTGG TTGATAAATA TAATTTATAT GGAAAAGTTG CTTTTCCAAA ACAACATAAA AGAGAGCAAA TCCCATCAAT TTATCGTTGG GCTGCAAATA GAAGCGGACT ATTTGTTAAT CCAGCTTTAA CTGAGCCATT TGGATTGACA TTATTGGAAG CCGCTGCGTG TGGTTTGCCT ACGGTAACTA CTGATGATGG CGGCCCAAGG GATATTCTTT CTCGTTGTGA AAATGGTCTA CTTGTAGATG TAACTGATTT AGAAGCATTT AGAGATGGTT TAGAGACAGC TGGTTCAAAC CTTTCTTTAT GGAAAACCTG GAGCAACAAT GGAGTTGAGG GAGTAAGCAG GCATTTTAGT TGGGATGCTC ATGTATGTAA TTACATTGCA TTGATGCAAA AGCGTCTTAA ATTTCTTGCA CCTCGACACT GGACACTTGG AAATATTAAA GAAACCTCTC CTATCGGTCA AAAAATAATA TTTCTTGATT TAGATAATTA CCTTGAGCAA TCAAAATCCT TATCAAAATT GAGAAACAAG TTAGAGAATA ATTCATTGAA TACTGATATC CAATTGGGAA TTTTAACTGG TAGATCTATT AAAGCTGCTC GTTATAGATA TGCAGAAACC CGACTACCCA AACCTGCGGT ATGGGTATGT CAAGCTGGGA CTGAAATTTA TTATTCTGAT GAAAATAAAT CAGATATCTT TTGGCAAGAT TCAATAACTG TTGATTGGAA TCGTAAAGAT GTAGAAAAGG TTTTGTTTGA TTTAAAAGAT TACTTAGAGC TTCAATCATC TGAACACCAA GCTCCTTATA AGGTTAGTTA TTTATTAAAA GAAACTAGTG ATGCAATTTT ACCTTTGGTA CGAAAAAGAC TTAGGCAGTC AGGACTTGCT GCTTCTCCTC ATCTAAAATG TCATTGGTAT TTAGATGTTG TTCCTTTGCG AGCATCACGA GCGGAAGCTA TTAGATACTT GACGTTACGT TGGGGTTTAT CTTTAGAAAA AGTTTTCGTA GTAGCAAGTC AGCAAGGCGA TGCTGAGTTG ATAAGGGGAT TAACCACAAG TATTATTCCT TTTGATCATG ATTCGTCATT AGATGGATTT AGATCCCAAA AAAGAGTTTT CTTTTCTGAG GCTCGAGATG GTTTTTTAGA TGGCTTAAAG GATTTTTGA
|
Protein sequence | MGLRLLHLNL HGLIRSHDLE LGRDSDTGGQ TLYVLELVKG LAARPEVEKV ELITRLINDR RVSSDYSKPV EKISSCAEII RLPFGPKRYM RKELLWPYLD DLADRIVQRL QQENKFPDWI HAHYADAGYV GALVSRRLGL PLVFTGHSLG REKLRRLLAA GIDHDQIEQT YSISKRIDAE ELALAHSNLL VTSTKQESQE QYARYGRFSS KNIEIIPPGV DLNRFYSAEL NLKDEEKELN KLFNPFLRDL SLPPLLAISR AVRRKNIPAL IEIYGRSSIL QQRHNLILIL GCRQDSRQLE KQQREVFQQV FELVDKYNLY GKVAFPKQHK REQIPSIYRW AANRSGLFVN PALTEPFGLT LLEAAACGLP TVTTDDGGPR DILSRCENGL LVDVTDLEAF RDGLETAGSN LSLWKTWSNN GVEGVSRHFS WDAHVCNYIA LMQKRLKFLA PRHWTLGNIK ETSPIGQKII FLDLDNYLEQ SKSLSKLRNK LENNSLNTDI QLGILTGRSI KAARYRYAET RLPKPAVWVC QAGTEIYYSD ENKSDIFWQD SITVDWNRKD VEKVLFDLKD YLELQSSEHQ APYKVSYLLK ETSDAILPLV RKRLRQSGLA ASPHLKCHWY LDVVPLRASR AEAIRYLTLR WGLSLEKVFV VASQQGDAEL IRGLTTSIIP FDHDSSLDGF RSQKRVFFSE ARDGFLDGLK DF
|
| |