Gene OSTLU_43281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43281 
Symbol 
ID5005496 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp182592 
End bp184151 
Gene Length1560 bp 
Protein Length488 aa 
Translation table 
GC content57% 
IMG OID640420917 
ProductMFS family transporter: sugar 
Protein accessionXP_001421412 
Protein GI145354267 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.210537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0304903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCTG AGTCCTCCGA CGCGTCCCGA GCGCCTCGCG AAGGCGAAGC TATCCCAGCG 
GCGGGGACAT CATCTTCCGT GAGTTTTCAA CGGCGTAGAC TTGCGCGACG CGAGCGATGG
AAGCGCTACG GCGTCGTCGG GTTGATCACG GCCACTTCTC TGTTCCTGTA CGCCGATCAA
AACCTCATCG GGCCGAACAT GAGCGCCATA GCGGAGGAGT TCGGGATGGA TGAAAAAGAG
AAGGATGTCA AGCTGGGCGG GTTGCTCCAG CTGGCGTTTT TCGTCGTCGG CTCGCCGGCG
TCGCTGATCA TCGGGTACTA CGCCGATCGA GTGCGTCGAG TGCGTTTGTT CTTTTGGACG
ACACTGATCG GTGAAGGGCC GTGCATGGCG ACGTATTGGG TGACGAGTTA TTGGCAGTTG
TTTGCGCTTC GCGCGATGAC GGGCATCGCG GTGGGCGGAT GCTTACCGCT GTTGTTCTCG
TTGTGCGGAG ATTTGTTTGC GAGTCATGAG CGGAGTTACG TGGCGAGTTT TTTGACGATC
GCGACGGGAG CGGGAGTGGC GCTCGGACAA GTGGTGGCGG GTACGGTAGG GCCGGCGTAC
GGGTGGCGAC TACCGTTCAT CCTCACATCC GCGCCCGCGG TCGTGCTCGC AACGGTGATG
GTTCTCGTCG TCGACGAGCC CAAGCGTGGC GCTCAGGAGG AAGAAGTTCA TCGAATGACG
ATGCGAAGGA ATTCAACGAA AAATGTCGCT GAGCGGGATG ATTCCGACGA ACAAACGACG
GCGCAGACGG AGGACGAGCT CGAAAGCGGC GACGAAGCGG GAGATGTTTC GTATAAAGCC
AAGATGAATT GGAGCAAGGT CAAGAAGCAA TTGCTTGTGA AGACCAACAT CCTCGTGCTT
GCGCAAGGCT TGCCGGGCAC GGTGCCGTGG GGGGTGTTCA ATTCGTACTT TGTCGACTTT
TTACATAAGC AAAAGGGTAT GACGGTGCAA AACGCCACGG CGGCCATCAC CGTCTTCGGC
TTGGGCTCCG CCTTCGGCAC CATTGGGGGA GGATTCATCG GTCAGAGGAT GTACAATAAG
AAGAAGAGCG AACTTCCGAT TTTAATGGGA CTGACGACCG CTATTGGAGC ACTTCCAGCG
TATTACTATC TCAACGTCAA CGACTACGGT CCGGGGAGGG TCGGTTTGTA CCTGTCGTGC
CTCGTCGGCG GCGTTTTCTG CTCGGTGACA CCGCCCAATG TGCGCGCAAT TTTGTTGAAT
GTCAACCCGC CGGAAACTCG TGGAAGTATG TTTGCATTTT ACTCACAAAT TGACGACGTC
GGTAAAGGCG GTGGACCCGC GCTGGTCGCC TTGCTCATCG TGAGCATGGG CAGACGAGTG
GCGTTCAACG TGGCGTTTAC CTTTTGGTTT GTGTGTGGCG TTATTTTGGC CTGCATTACA
TTTACCATCG ATCACGACGT AGAGATGGAA CGAAGGGCCG TGCTCGAGGC TTTACAGACG
GACGAAAACG AGGTGGCACC GATAGAGAAC GACGGGGATT TGAACGCGGA CAGAGGGTAG
 
Protein sequence
MPSESSDASR APREGEAIPA AGTSSSVSFQ RRRLARRERW KRYGVVGLIT ATSLFLYADQ 
NLIGPNMSAI AEEFGMDEKE KDVKLGGLLQ LAFFVVGSPA SLIIGYYADR VRRVRLFFWT
TLIGEGPCMA TYWVTSYWQL FALRAMTGIA VGGCLPLLFS LCGDLFASHE RSYVASFLTI
ATGAGVALGQ VVAGTVGPAY GWRLPFILTS APAVVLATVM VLVVDEPKRG AQEEELESGD
EAGDVSYKAK MNWSKVKKQL LVKTNILVLA QGLPGTVPWG VFNSYFVDFL HKQKGMTVQN
ATAAITVFGL GSAFGTIGGG FIGQRMYNKK KSELPILMGL TTAIGALPAY YYLNVNDYGP
GRVGLYLSCL VGGVFCSVTP PNVRAILLNV NPPETRGSMF AFYSQIDDVG KGGGPALVAL
LIVSMGRRVA FNVAFTFWFV CGVILACITF TIDHDVEMER RAVLEALQTD ENEVAPIEND
GDLNADRG