Gene OSTLU_42838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42838 
Symbol 
ID5003337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp102588 
End bp104006 
Gene Length1419 bp 
Protein Length472 aa 
Translation table 
GC content69% 
IMG OID640418758 
ProductMFS family transporter 
Protein accessionXP_001419283 
Protein GI145349734 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.183784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.513972 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCGC GCGGGGAATC GCCGTCCGAC GACGACGTCG ACGACGCCGC GAGCGCCGAG 
GCGCTGCTGG GACTCTTCAT GCTCGTCAGC GCGCTGGTGT ACGTCGATCG AGGGATCGCG
AGCTCGGCCG CGGTGAGCGG GGCGCCGAGG AGCGCGCGCG AGCCCGCGGG GCGAGGCTTG
CAGGGCGCGC TCGGGTGCTC GTACGCGGCG TACGGAGCGC TGAACGCGGC GTTCATGATC
GGACTGCTCT CGGGCGCGCC CGCGTTCAGC GCGATGGCGA ATAAAGCGTG CGCGTTCAGA
TTGATCGCGA TCGGGCTGGC GATGGCGGCG GTGGGGGAGC TCGGATGCGC GCTGGCGCCG
ACGTGCGGGT GGGCGTTCGC GGCGCGCGCG CTGGTGGGCG CGGGAGAGGC GAGTTTTATC
GCGCTCGCGG CGCCGTTCAT CGATGATAAA GCGCCGAAAG GCGCGAAGAC GATGTGGTTG
GCGATGTTCT ACGCGTGCGT GCCGTTCGGG GTGGCGTTCG GGATCGCGTT CGGGGGGGCG
GTGACGCCGG CGATGGGGTG GCGATGGGCG TTCGGGTTGA ACGCGTGCGC GATGGCGCCC
GCGGCGGCGT ACTGCTTCTG GCGTCCGGCG GTGCGCATGC GAGGCGTCGG AGGCGATGCG
AATGCGCGCG AGGCGGCGGC GACGTCGACG GTGGCGTCGT TGACGCGCGC GTTCGCGCGA
GATTGTAAAG AGTTGTTCGT GCGCGAGACG TACGTCGTCG TCGTGCTCGG GTACGCCGCG
TACACCGCCG TCATCGGCGT GTACGCGGCG TGGGGACCGA AAGCCGGGTT CGCGATATTT
CGAGATGAGT TACACACGTC GACGAACGCG GACATGCTCC TCGGTGCGAT CACCGTCGTG
AGCGGGATCG CGGGCACGCT TCTCGGCGGC GGCGTCGTGG ACAAGTTGGG GAGCTCGACG
GCGACGGCGT TGCGCACGTC CGCCATCGCC GCCGTCGGGG GATTCGTGTG CCTCGAGCTC
GCTTTCAGGT GTCAAACGTT CGCATCGTTC GCGGTGTGCT TGCTCATCGG ACAAATGTTC
GCTTTCGCGT TACAGGCGCC GATCAACGCC GTCGTGCTCT GGAGCGTCCC CGCGCGTCTG
CGCCCGCTCG CGTGTTCGAT GACCACCGTC ACCATTCACC TCTTCGGCGA CGTCCCATCG
CCGCCGCTCT TCGGGCACTT CCTCGAGCGC GATGGCGCCC CCACGCCCGA GCGCTGGCGA
ACCATGTGTT CGACGTTCAC GCTCTTATTC GTCGTCGCCG CGGGCGTCTT CGCGACGGCG
GCGCGGCGAG CCGGCGGCGA CGCGCGCCGA CAACGCGTCT TAGACGACGA CGACGACGAC
GACTCGCGCG ACGTCGACGA CAGGCTCTTA CCGACGTAG
 
Protein sequence
MTPRGESPSD DDVDDAASAE ALLGLFMLVS ALVYVDRGIA SSAAVSGAPR SAREPAGRGL 
QGALGCSYAA YGALNAAFMI GLLSGAPAFS AMANKACAFR LIAIGLAMAA VGELGCALAP
TCGWAFAARA LVGAGEASFI ALAAPFIDDK APKGAKTMWL AMFYACVPFG VAFGIAFGGA
VTPAMGWRWA FGLNACAMAP AAAYCFWRPA VRMRGVGGDA NAREAAATST VASLTRAFAR
DCKELFVRET YVVVVLGYAA YTAVIGVYAA WGPKAGFAIF RDELHTSTNA DMLLGAITVV
SGIAGTLLGG GVVDKLGSST ATALRTSAIA AVGGFVCLEL AFRCQTFASF AVCLLIGQMF
AFALQAPINA VVLWSVPARL RPLACSMTTV TIHLFGDVPS PPLFGHFLER DGAPTPERWR
TMCSTFTLLF VVAAGVFATA ARRAGGDARR QRVLDDDDDD DSRDVDDRLL PT