Gene OSTLU_38010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38010 
Symbol 
ID5004080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp255693 
End bp257423 
Gene Length1731 bp 
Protein Length576 aa 
Translation table 
GC content63% 
IMG OID640419501 
ProductMFS family transporter: sugar 
Protein accessionXP_001420126 
Protein GI145351527 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.286894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCG CCGCGCCCGA CGTCGACGTC GACGCCGACG TCGACGACGA CGACGGCGGC 
GACATCCCGC GTCCACCGCG CGCGCCCGAC GGCGCGGCGT CCACCGCGCG CGACGACGCG
CGCGATGACG ACGCCGACGA CCAAAGACTC GGCCTGCTCG ACGCCGCGAC GCCGCGCGGC
GAATTCGTCG TCGTCGCCCT CGACGCGCCG AACGACGACG GCGACGGCGA CGGCGACGAT
GCGCGTCGAC GAACGTACAC CGCGAACGAG GCGCTCGATC ACGTGGGATT CGGGAAGTTT
CAAATCCACG CGCTGTGCTT CGTCGGTTTC GCGTGGGCGG CGGACGCGAT GGAGATGATG
CTGCTCAGTT TCATCGGACC CGCGATGCGA TGCGAATTCG GGGTGTCGAG CGACGCGGAA
GGCGCGCTGA CGAGCGTGGT GTTCGTGGGA ATGGCTCTCG GGGCGCCGGC GTGGGGAGTG
GTGAGCGACG CGCGCGGTCG GAAGCCGGCG CTGCTGTTCA GCTCGACGAC GACGCTCGCG
GCGGGGATCG GAAGCGCGCT GGGCGGGAGC TTCGGGAGCG TGTTGTTCTT TAGATGCCTC
GTGGGCGTTG GTTTGGGTGG GGTACCCGTG GCGTACGGGT TGTTCATGGA GTTTTTACCG
AGGGAAAATC GCGGCGCGCG CTTGTCGTAC ATTGAGGCGT TTTGGACGCT GGGATCGATG
TTGGAGTCCG CGCTGGCGTG GATCGTGCTT CCGCGGCATT CGTGGCGAGT TTTACTGTTG
ATTTCAGCGG CGCCGTTGCT TGGATTGATC GCATGCATCT TTATCGTCCC GGAGAGCGTC
TTGTACTCGG TAAACGCCGG TCGAATGGAG GAGGCGAAGG AGACGTTGCG TCGCGTCGCG
GCGACGAATG GTAAATCTCT ACCGCAGGGC GAGCTCGTGG GGCCGAACGA TCGCGCGTCA
TCGAGTGGCG AGTTCGAAGA TCGTACTTCG TATGGCATGG GCGCGTCGGG AGCGTCGTCT
TCTACGATGA TGCAAAGGTT CGTTCCGAGC GGCGTTCGCG CGTTGCTGTC CAAGAAGCAC
GCGAAGACGT CTCTCTTGGT TTGGGTGATT TTCTTCGGTG TGGCGTTTTT GTACTACGGC
ATTGTCCTTC TCACAACGTC ACTCAACGTG CGCGACGACG AGTCCAAGCG TGGGGGAGAG
TTGGCGTGTC TAGCGCACGG TGCGCCACAT TTGAGCGACG GCGAGTACGC CGACATCTTT
CTCAGCTCGT TCGGCGAAAT TCCAGGCTTG ATCGTCGCGA TTATGATCGT CGACAAGATC
GGTCGCAGGC GCTCGATGGC GTTCACCGTG ATTGCCACCG CTGTGTTCTT GCTCCCCGTG
GCTTCATCGA GCATAAGTAA GGCGGTTCGT GACATCATGC TCTTCGGTGG AAGAAGCGCC
GCGTTCGCGG CGTTCACCGT CTTGTACATA TTCGCCGGCG AAGTCTATCC GACGTCGATC
CGTTCGACCG GTGTCGGCCT CGGAAACGGG TTCGCGCGCA TCGGTGGAAT AACATGCCCG
ATATTCGCTG TGACTTTGAT TGAGTCCGGA CATCTGACGC TCTCCGTCGT CGTCTTCATC
GCCGTCGCCG CCGTCGCGTG CGCCGCCGCG CTCTCGCTCG CCGTCGAAAC CGCCGGTCGC
GAGCTCGACG CCGACGACGA GCCGGGCGTC GAGCTCGCCC CAGTGGCCTA A
 
Protein sequence
MSAAAPDVDV DADVDDDDGG DIPRPPRAPD GAASTARDDA RDDDADDQRL GLLDAATPRG 
EFVVVALDAP NDDGDGDGDD ARRRTYTANE ALDHVGFGKF QIHALCFVGF AWAADAMEMM
LLSFIGPAMR CEFGVSSDAE GALTSVVFVG MALGAPAWGV VSDARGRKPA LLFSSTTTLA
AGIGSALGGS FGSVLFFRCL VGVGLGGVPV AYGLFMEFLP RENRGARLSY IEAFWTLGSM
LESALAWIVL PRHSWRVLLL ISAAPLLGLI ACIFIVPESV LYSVNAGRME EAKETLRRVA
ATNGKSLPQG ELVGPNDRAS SSGEFEDRTS YGMGASGASS STMMQRFVPS GVRALLSKKH
AKTSLLVWVI FFGVAFLYYG IVLLTTSLNV RDDESKRGGE LACLAHGAPH LSDGEYADIF
LSSFGEIPGL IVAIMIVDKI GRRRSMAFTV IATAVFLLPV ASSSISKAVR DIMLFGGRSA
AFAAFTVLYI FAGEVYPTSI RSTGVGLGNG FARIGGITCP IFAVTLIESG HLTLSVVVFI
AVAAVACAAA LSLAVETAGR ELDADDEPGV ELAPVA