Gene OSTLU_49086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49086 
Symbol 
ID5000958 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp174897 
End bp176462 
Gene Length1566 bp 
Protein Length506 aa 
Translation table 
GC content61% 
IMG OID640416379 
ProductMFS family transporter: sugar (sialic acid) 
Protein accessionXP_001416575 
Protein GI145344098 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0133076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCGCG CGAGAGGGGG ACGCGGGGAG GAGGCGCGAG AACGCGGCGA CGTCGCGGCG 
CGCGCGAGAG CCGAACGACG CGATCGACGG GACGCGGTCG CGGCGAGCGG GAGCTCGAAG
ACGACGCGCG AGGATCGGGA CGAGGGCGAG GAGGATCCGG ACGAGGCGTG GGCGTTTGAG
TTTCCGAAGT CGGCGAGCGA CGTCACGACG CTCTGGCGAG GCGCGCCGAG TCGGTATCGC
GTGCTCTTCG TCACGGTGTT CGCGTTCATC GTGTGTAACA TGGATAAGGT GAATATTTCG
GTGGCGATCA TTCCCATGGC GCGGGAGTTC GGGTGGACGA GCACGCAGGC GGGGTTCGTG
CAGAGCGCGT TTTTTTACGG TTTCGCGGCG TCGCAGTTAC CGGGCGGGTA CTTGTCGACG
AAATTCGGTG GTGCCAAGGT GTTGCCGATC GGGATGTTGA TTTTGTCGTT GGCGACGATC
GCGATTCCGA TCGTCGGCGT GAACGAGCAG AGCATTTTCC TGTCGCGCGT GCTCGTGGGT
TTGGGCGAAG GCGTGGCGCC GAGCGCGGCG ACGGATATCA TCGCGAGAAG CGTCAGCGTG
GGCGAGCGTT CGCGCGCCGT CGGGTTCGTG TTCAGTGGGT TCAACATAGG TTCGGTGCTT
GGTTTGGGGG TGGCGCCACT ATTGATAGAG GCGACGAATT GGAGGACGGT GTTCGCATTC
TTCGGCTCGT GCGGTTTAGT TTGGAGTTTT TGGGCTTGGA AGCTGTACGG CGACGGCGGG
ATGGTTGACG AAAGTTACAA GGACGACGGC GTCACGGGTT TGACGGGTAA GCGCATATTC
ACCGTCGACG CAAAGGCGAT AGCGAGCGGG AAGAGCCCGG CGGAAGACCC TCCGGTGCCG
TGGGGGGAGT TTATATCGAA TCCGTCGGTG CGCGCGCTCA TGTACGTGCA CTTTTGCAAC
AACTGGGGCT TCTACGTCCT ACTCGCTTGG CTTCCGACGT ACTTTACCGA CGAGCTCGGG
GTGACACTGA CGAACGCATC GCTGTTGACT CTGCTTCCGC CGCTCGCGAA CGTCGCGATG
GCGTCCGTCG CCGGTCCGAC TGCGGACCGC CTCATCGGCA GCGGCATGGA GATCACGAAG
GTGCGTAAAA CGATGCAAGC AGTCGCCTTC ATGGGACCGG CGCTCGCCAT GGGCTCGGCC
GCATTGGTAG ATCAGCCGGT GGCGACCGTG GGTCTGCTCA CGCTCGGCCT TTCGCTAGGC
GCGTTTTCGT ACGCGGGTTT GTACTCAAAC CATCAAGATT TGTCGCCCAA GTACGCGAGT
ATCCTGTTGG GCATGACAAA CACGTGCGGC GCGCTTCCGG GCGTCATCGG CGTTCCGTTG
ACTGGGTACT TGATCAAAGA AACGGAAAAT TGGGAGCTTA GCATGTTCGT TCCGGCGATG
TTCTTCTACT TTACGGGAAC GCTCGTATTC AGCAAGTACG GCAGCGGCGA TCGACAAGCG
TTCACGGGAC AACCTATGCC CGAACCAGGC GAGATTCCGC CATCGTGCGA TGGCGGCGGA
CATTAA
 
Protein sequence
MGRARGGRGE EARERGDVAA RARAERRDRR DAVAASGSSK TTREDRDEGE EDPDEAWAFE 
FPKSASDVTT LWRGAPSRYR VLFVTVFAFI VCNMDKVNIS VAIIPMAREF GWTSTQAGFV
QSAFFYGFAA SQLPGGYLST KFGGAKVLPI GMLILSLATI AIPIVGVNEQ SIFLSRVLVG
LGEGVAPSAA TDIIARSVSV GERSRAVGFV FSGFNIGSVL GLGVAPLLIE ATNWRTVFAF
FGSCGLVWSF WAWKLYGDGG MVDESYKDDG AIASGKSPAE DPPVPWGEFI SNPSVRALMY
VHFCNNWGFY VLLAWLPTYF TDELGVTLTN ASLLTLLPPL ANVAMASVAG PTADRLIGSG
MEITKVRKTM QAVAFMGPAL AMGSAALVDQ PVATVGLLTL GLSLGAFSYA GLYSNHQDLS
PKYASILLGM TNTCGALPGV IGVPLTGYLI KETENWELSM FVPAMFFYFT GTLVFSKYGS
GDRQAFTGQP MPEPGEIPPS CDGGGH