Gene NATL1_21951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21951 
Symbolsps 
ID4779273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1853939 
End bp1856047 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content36% 
IMG OID640085493 
Productsucrose phosphate synthase 
Protein accessionYP_001016015 
Protein GI124026900 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0438] Glycosyltransferase
[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID[TIGR02471] sucrose phosphate synthase, sucrose phosphatase-like domain, bacterial
[TIGR02472] sucrose-phosphate synthase, putative, glycosyltransferase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.269726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGATTAA GACTTTTACA TTTAAATTTA CATGGTTTAA TCCGTTCACA TGATCTTGAG 
TTGGGTAGAG ACTCAGATAC TGGAGGGCAA ACTTTATATG TTTTAGAATT AGTAAAAGGA
CTTGCAGCAA GACCAGAAGT TGAAAAAGTT GAACTAATTA CAAGATTAAT TAATGATAGG
AGAGTATCTT CTGACTACTC AAAGCCTGTT GAAAAGATAT CAAGTTGTGC GGAAATTATT
CGATTGCCTT TTGGTCCTAA GCGTTATATG AGAAAGGAGT TGTTATGGCC TTACTTAGAT
GATTTAGCTG ATCGCATAGT TCAAAGATTG CAGCAAGAAA ATAAATTCCC TGATTGGATT
CATGCTCATT ATGCCGATGC TGGTTATGTA GGAGCATTGG TTAGTCGAAG ATTAGGATTA
CCATTAGTTT TTACTGGCCA TTCATTGGGC CGTGAAAAAC TGAGAAGATT ATTGGCTGCT
GGAATTGATC ATGATCAGAT AGAGCAAACT TATTCAATTA GCAAAAGAAT TGATGCAGAG
GAATTAGCTT TAGCACATTC AAATTTACTT GTCACAAGCA CTAAGCAGGA ATCTCAGGAG
CAGTACGCTC GTTATGGACG ATTTAGCTCA AAAAACATAG AAATAATTCC ACCTGGTGTT
GATTTAAATC GCTTTTACTC AGCAGAGCTT AATTTAAAGG ATGAGGAAAA AGAATTAAAT
AAACTTTTTA ATCCTTTTTT GAGAGATTTA AGTCTTCCAC CACTCTTAGC TATATCAAGA
GCAGTAAGGA GAAAAAATAT TCCAGCGTTA ATTGAGATTT ATGGACGTTC GTCAATTTTG
CAGCAAAGAC ATAATCTTAT TTTAATTTTA GGTTGTCGCC AAGATTCTCG TCAGCTTGAG
AAACAGCAAA GAGAAGTATT TCAGCAAGTT TTTGAATTGG TTGATAAATA TAATTTATAT
GGAAAAGTTG CTTTTCCAAA ACAACATAAA AGAGAGCAAA TCCCATCAAT TTATCGTTGG
GCTGCAAATA GAAGCGGACT ATTTGTTAAT CCAGCTTTAA CTGAGCCATT TGGATTGACA
TTATTGGAAG CCGCTGCGTG TGGTTTGCCT ACGGTAACTA CTGATGATGG CGGCCCAAGG
GATATTCTTT CTCGTTGTGA AAATGGTCTA CTTGTAGATG TAACTGATTT AGAAGCATTT
AGAGATGGTT TAGAGACAGC TGGTTCAAAC CTTTCTTTAT GGAAAACCTG GAGCAACAAT
GGAGTTGAGG GAGTAAGCAG GCATTTTAGT TGGGATGCTC ATGTATGTAA TTACATTGCA
TTGATGCAAA AGCGTCTTAA ATTTCTTGCA CCTCGACACT GGACACTTGG AAATATTAAA
GAAACCTCTC CTATCGGTCA AAAAATAATA TTTCTTGATT TAGATAATTA CCTTGAGCAA
TCAAAATCCT TATCAAAATT GAGAAACAAG TTAGAGAATA ATTCATTGAA TACTGATATC
CAATTGGGAA TTTTAACTGG TAGATCTATT AAAGCTGCTC GTTATAGATA TGCAGAAACC
CGACTACCCA AACCTGCGGT ATGGGTATGT CAAGCTGGGA CTGAAATTTA TTATTCTGAT
GAAAATAAAT CAGATATCTT TTGGCAAGAT TCAATAACTG TTGATTGGAA TCGTAAAGAT
GTAGAAAAGG TTTTGTTTGA TTTAAAAGAT TACTTAGAGC TTCAATCATC TGAACACCAA
GCTCCTTATA AGGTTAGTTA TTTATTAAAA GAAACTAGTG ATGCAATTTT ACCTTTGGTA
CGAAAAAGAC TTAGGCAGTC AGGACTTGCT GCTTCTCCTC ATCTAAAATG TCATTGGTAT
TTAGATGTTG TTCCTTTGCG AGCATCACGA GCGGAAGCTA TTAGATACTT GACGTTACGT
TGGGGTTTAT CTTTAGAAAA AGTTTTCGTA GTAGCAAGTC AGCAAGGCGA TGCTGAGTTG
ATAAGGGGAT TAACCACAAG TATTATTCCT TTTGATCATG ATTCGTCATT AGATGGATTT
AGATCCCAAA AAAGAGTTTT CTTTTCTGAG GCTCGAGATG GTTTTTTAGA TGGCTTAAAG
GATTTTTGA
 
Protein sequence
MGLRLLHLNL HGLIRSHDLE LGRDSDTGGQ TLYVLELVKG LAARPEVEKV ELITRLINDR 
RVSSDYSKPV EKISSCAEII RLPFGPKRYM RKELLWPYLD DLADRIVQRL QQENKFPDWI
HAHYADAGYV GALVSRRLGL PLVFTGHSLG REKLRRLLAA GIDHDQIEQT YSISKRIDAE
ELALAHSNLL VTSTKQESQE QYARYGRFSS KNIEIIPPGV DLNRFYSAEL NLKDEEKELN
KLFNPFLRDL SLPPLLAISR AVRRKNIPAL IEIYGRSSIL QQRHNLILIL GCRQDSRQLE
KQQREVFQQV FELVDKYNLY GKVAFPKQHK REQIPSIYRW AANRSGLFVN PALTEPFGLT
LLEAAACGLP TVTTDDGGPR DILSRCENGL LVDVTDLEAF RDGLETAGSN LSLWKTWSNN
GVEGVSRHFS WDAHVCNYIA LMQKRLKFLA PRHWTLGNIK ETSPIGQKII FLDLDNYLEQ
SKSLSKLRNK LENNSLNTDI QLGILTGRSI KAARYRYAET RLPKPAVWVC QAGTEIYYSD
ENKSDIFWQD SITVDWNRKD VEKVLFDLKD YLELQSSEHQ APYKVSYLLK ETSDAILPLV
RKRLRQSGLA ASPHLKCHWY LDVVPLRASR AEAIRYLTLR WGLSLEKVFV VASQQGDAEL
IRGLTTSIIP FDHDSSLDGF RSQKRVFFSE ARDGFLDGLK DF