Gene OSTLU_43351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43351 
Symbol 
ID5005580 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp364858 
End bp366261 
Gene Length1404 bp 
Protein Length444 aa 
Translation table 
GC content59% 
IMG OID640421001 
ProductMFS family transporter: sugar 
Protein accessionXP_001421273 
Protein GI145353977 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCGT CGAACAAGCG CGCGTTCGTG TTCACGTGCC TGGGCCTCGG GTTGGTGTTC 
GCCGATCAAA ATTTGTTAGC GCCGAATTTG ACCGCCATCG CGAACGATTT GAATCTGTCG
CCGAATGAGC GGGACTATAA ATTAGGCGGG CAGATCGCGT TCGCGTTCTT CTTGCTCGGC
GCGCCGGCGG CGGTGTTGAT CGGGTCGATG GCGGATTACT ATCCGCGCAC GAAGCTCTTC
GCGTGGACGA TGCTGTTGGG ATCGTGCCCC AACGTCCTGG CGTGGATGCC GGGGGTGACG
ACTTTTGGTC AGTTGTATTG GTTGCGCGCG CTGACGGGAA TCGCGGTAGG CGGCGCGGCG
CCGTTGACGT ACTCCTTAAT GTCAGACTTA TTTCCGCCGA GCGAGCGGAC GAAGATGAGC
GCGGTGACGG GGCTATCGAT GACGCTCGGG ATCGTCTGCG GGCAAGCGAT CGCGGGATTT
TTGGGGGAGT CGTACGGCTG GCGCTTGCCG TTCGCGGTGG TGGCGATTCC GGCGATATGC
GTGGCCATGG TGCTGATGTT CTTTGTCGAG GAGCCCGAGC GCGGGGCGAT GGAGGCGCCG
CCGGACGAGG AGGCAAGTCT CGAGCAAGAG TCGCTCGTCA TCCAATCGTC CTCACGACCC
TCGACACCGC CGATACCACC GCGAGGGCTG CACTTCAGCG CGACCGCGGC GAAGATGTAT
GCGCGAAAGT TGCACGGCAT CGTCTCCGTG CGCACCGTGG CGTTGTTCTT GGCGCAAGGC
GTGTCGGGTT GCGTGCCGTG GAGTATGATC AACACGTTCT TCAACGATTA CTTAGCGCAA
GATAAGGGGT TAGGTGTGAA GCAGAGCACG TCGATGCTGA TTTTATTCGC CGTCGGCGGC
ATGTTAGGGA CGATTTGGGC CGGGTGGTAC GGCCAAATTC TTTTCAACAA CAAACCGGAG
AACGTATCGA TTTTCATGGG ATCGAGCGCA ATCGTGGGGG TGTTCCCGGT GGCTTACTTG
GTGTTGGCGA ATTACGACAA CTCCGCGTCC GACATCGCGA TCAAGTCGAT CTTATCCTTT
ATTTCGGGAG GCATCGCTTC GTGCGTCGGC GTGAACATTC GAGCGCTGTT GCTCAACGTC
CTGCATCCCA TGAATCGCGG CACCGCGTTT TCCCTCTTCA TGCTCACCGA CGATTTAGGC
AAAGGATTCG GACCTCTCGT CGTCGCCGGC TTCGTCGCCG CGTTCGGTCG CGAGACGGCC
TTTTTCATCT CGGTTTTGTT CTGGATTCCT TGCGGGTGTC TTCTCGCCGC CTCGTGCTAC
ACGCTGAAAC GAGATTTACA AACCGCCGCT GCGCGTTACC AAACAGAACG CGCCGAAGAA
AATCGTTCGG TTCGATTGGA TTAG
 
Protein sequence
MSASNKRAFV FTCLGLGLVF ADQNLLAPNL TAIANDLNLS PNERDYKLGG QIAFAFFLLG 
APAAVLIGSM ADYYPRTKLF AWTMLLGSCP NVLAWMPGVT TFGQLYWLRA LTGIAVGGAA
PLTYSLMSDL FPPSERTKMS AVTGLSMTLG IVCGQAIAGF LGESYGWRLP FAVVAIPAIC
VAMVLMFFVE EPERGAMEAP PDEEASLEQD ATAAKMYARK LHGIVSVRTV ALFLAQGVSG
CVPWSMINTF FNDYLAQDKG LGVKQSTSML ILFAVGGMLG TIWAGWYGQI LFNNKPENVS
IFMGSSAIVG VFPVAYLVLA NYDNSASDIA IKSILSFISG GIASCVGVNI RALLLNVLHP
MNRGTAFSLF MLTDDLGKGF GPLVVAGFVA AFGRETAFFI SVLFWIPCGC LLAASCYTLK
RDLQTAAARY QTERAEENRS VRLD