Gene A9601_19201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_19201 
Symbol 
ID4718660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1659862 
End bp1661271 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content31% 
IMG OID640079655 
Productsucrose phosphate synthase 
Protein accessionYP_001010309 
Protein GI123969452 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR02472] sucrose-phosphate synthase, putative, glycosyltransferase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.8791 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTTGA AATTTTTACA TTTACATTTA CATGGTCTTA TACGTTCTAA AAATCTTGAA 
TTAGGCAGGG ATGCAGATAC AGGAGGGCAA ACAAAATACG TTTTAGAGTT AATTAAAAGC
TTGGCTAATA CTTCAGAAGT GGATCAAGTG GATTTAGTTA CTCGTTTAAT AAAAGACCCT
AAAGTCGATG ATGAATATTC TCAAGAAGAA GAATTTGTAG AACCTGGAGT TAGAATTTTA
AGATTCAAAT TTGGACCCAA TAAATATTTA AGAAAGGAAT TGCTTTGGCC TTATTTAGAT
CATTTAACTG AAACCCTAAT TTCTTACTAT AAAAAAAGCA AAAAGCCTAA TTTCATCCAT
GCTCATTATG CAGATGCTGG ATATGTAGGA GTTAAACTAA GTAAATCTTT AAACGTTCCT
CTTATTTTTA CAGGTCATTC TTTAGGAAGA GAAAAAAAAA GGAAATTGCT TGATACTGGT
TTAAAAACTA ATCAAATAGA AAAACTTTAT TTTATTAGCA AAAGAATTGA GGCAGAAGAA
AAAGCATTGA AGTCTGCAGA TATTGTTGTT ACAAGCACTA AACAAGAGTC AGTGTATCAA
TATTCCCAAT ATTCTTCTTT TTCACCTCAC AAAGCTAAAG TTATTCCTCC TGGTGTTGAC
CATAAAAAAT TTCATCATAT TCACTCCACA AGCGAGACAG TCGAAATTGA TAATATGATG
AAACCTTTTC TAAAGGATTC TACTAAACCT CCATTTTTGA CTATTTCTAG AGCTGTACGA
AGAAAAAATA TCCCATCTTT GATTGAGGCA TATGGAAGAT CTGAAAAATT AAAAAGAAAA
ACTAATTTAA TTCTGATTTT GGGTTGTAGA GATAGTCCTT CAAAACTTGA TCCTCAACAA
AAAGATGTTT TCAATAATAT TTTTGAAATA ATTGATAAAT ATAATTTGTA TGGAAAGGTA
GCTTATCCAA AAAAACATCT TCCAAGTCAG ATTCCTGCTT TATATAGGTG GGCTGCTAGC
AGAGGGGGTG TATTTGTAAA TCCAGCTTTA ACAGAGCCTT TTGGTTTAAC TCTTCTTGAA
GCTTCTTCCT GTGGATTACC AATAATATCA ACAAATGATG GAGGGCCAAA AGAAATTCGT
TCAAAATGTG AAAATGGACT TCTAGTAGAT GTTACTGATA TTAATGAGTT AAAAGTTATT
CTTGAAAAAG GAATTTCAAA TAATAATCGG TGGAAATTAT GGAGCAGAAA CGGAATTGAG
GGTGTTAGCA GGCACTTTAG TTGGAACACT CATGTACGCA ATTATTTATC AGTACTAACT
GAAGAATTTT TAAGTTCAAA TAGTTATTCT TCATCTGACA TTAAACAAAG TTGTTTAAAA
GGAACTTCCT CACTTATAAA ACCCCATTGA
 
Protein sequence
MRLKFLHLHL HGLIRSKNLE LGRDADTGGQ TKYVLELIKS LANTSEVDQV DLVTRLIKDP 
KVDDEYSQEE EFVEPGVRIL RFKFGPNKYL RKELLWPYLD HLTETLISYY KKSKKPNFIH
AHYADAGYVG VKLSKSLNVP LIFTGHSLGR EKKRKLLDTG LKTNQIEKLY FISKRIEAEE
KALKSADIVV TSTKQESVYQ YSQYSSFSPH KAKVIPPGVD HKKFHHIHST SETVEIDNMM
KPFLKDSTKP PFLTISRAVR RKNIPSLIEA YGRSEKLKRK TNLILILGCR DSPSKLDPQQ
KDVFNNIFEI IDKYNLYGKV AYPKKHLPSQ IPALYRWAAS RGGVFVNPAL TEPFGLTLLE
ASSCGLPIIS TNDGGPKEIR SKCENGLLVD VTDINELKVI LEKGISNNNR WKLWSRNGIE
GVSRHFSWNT HVRNYLSVLT EEFLSSNSYS SSDIKQSCLK GTSSLIKPH