Gene OSTLU_43234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43234 
Symbol 
ID5005505 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp222030 
End bp223433 
Gene Length1404 bp 
Protein Length444 aa 
Translation table 
GC content59% 
IMG OID640420926 
ProductMFS family transporter: metabolite 
Protein accessionXP_001421238 
Protein GI145353903 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.104794 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCGT CGAACAAGCG CGCGTTCGTG TTCACGTGCC TGGGCCTCGG GTTGGTGTTC 
GCCGATCAAA ATTTGTTAGC GCCGAATTTG ACCGCCATCG CGAACGATTT GAATCTGTCG
CCGAATGAGC GGGACTATAA ATTAGGCGGG CAGATCGCGT TCGCGTTCTT CTTGCTCGGC
GCGCCGGCGG CGGTGTTGAT CGGGTCGATG GCGGATTACT ATCCGCGCAC GAAGCTCTTC
GCGTGGACGA TGCTGTTGGG ATCGTGCCCC AACGTCCTGG CGTGGATGCC GGGGGTGACG
ACTTTTGGTC AGTTGTATTG GTTGCGCGCG CTGACGGGAA TCGCGGTAGG CGGCGCGGCG
CCGTTGACGT ACTCCTTAAT GTCAGACTTA TTTCCGCCGA GCGAGCGGAC GAAGATGAGC
GCGGTGACGG GGCTATCGAT GACGCTCGGG ATCGTCTGCG GGCAAGCGAT CGCGGGATTT
TTGGGGGAGT CGTACGGCTG GCGCTTGCCG TTCGCGGTGG TGGCGATTCC GGCGATATGC
GTGGCCATGG TGCTGATGTT CTTTGTCGAG GAGCCCGAGC GCGGGGCGAT GGAGGCGCCG
CCGGACGAGG AGGCAAGTCT CGAGCAAGAG TCGCTCGTCA TCCAATCGTC CTCACGACCC
TCGACACCGC CGATACCACC GCGAGGGCTG CACTTCAGCG CGACCGCGGC GAAGATGTAT
GCGCGAAAGT TGCACGGCAT CGTCTCCGTG CGCACCGTGG CGTTGTTCTT GGCGCAAGGC
GTGTCGGGTT GCGTGCCGTG GAGTATGATC AACACGTTCT TCAACGATTA CTTAGCGCAA
GATAAGGGGT TAGGTGTGAA GCAGAGCACG TCGATGCTGA TTTTATTCGC CGTCGGCGGC
ATGTTAGGGA CGATTTGGGC CGGGTGGTAC GGCCAAATTC TTTTCAACAA CAAACCGGAG
AACGTATCGA TTTTCATGGG ATCGAGCGCA ATCGTGGGGG TGTTCCCGGT GGCTTACTTG
GTGTTGGCGA ATTACGACAA CTCCGCGTCC GACATCGCGA TCAAGTCGAT CTTATCCTTT
ATTTCGGGAG GCATCGCTTC GTGCGTCGGC GTGAACATTC GAGCGCTGTT GCTCAACGTC
CTGCATCCCA TGAATCGCGG CACCGCGTTT TCCCTCTTCA TGCTCACCGA CGATTTAGGC
AAAGGATTCG GACCTCTCGT CGTCGCCGGC TTCGTCGCCG CGTTCGGTCG CGAGACGGCC
TTTTTCATCT CGGTTTTGTT CTGGATTCCT TGCGGGTGTC TTCTCGCCGC CTCGTGCTAC
ACGCTGAAAC GAGATTTACA AACCGCCGCT GCGCGTTACC AAACAGAACG CGCCGAAGAA
AATCGTTCGG TTCGATTGGA TTAG
 
Protein sequence
MSASNKRAFV FTCLGLGLVF ADQNLLAPNL TAIANDLNLS PNERDYKLGG QIAFAFFLLG 
APAAVLIGSM ADYYPRTKLF AWTMLLGSCP NVLAWMPGVT TFGQLYWLRA LTGIAVGGAA
PLTYSLMSDL FPPSERTKMS AVTGLSMTLG IVCGQAIAGF LGESYGWRLP FAVVAIPAIC
VAMVLMFFVE EPERGAMEAP PDEEASLEQD ATAAKMYARK LHGIVSVRTV ALFLAQGVSG
CVPWSMINTF FNDYLAQDKG LGVKQSTSML ILFAVGGMLG TIWAGWYGQI LFNNKPENVS
IFMGSSAIVG VFPVAYLVLA NYDNSASDIA IKSILSFISG GIASCVGVNI RALLLNVLHP
MNRGTAFSLF MLTDDLGKGF GPLVVAGFVA AFGRETAFFI SVLFWIPCGC LLAASCYTLK
RDLQTAAARY QTERAEENRS VRLD