Gene OSTLU_38823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38823 
Symbol 
ID5001941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp810464 
End bp811753 
Gene Length1290 bp 
Protein Length429 aa 
Translation table 
GC content62% 
IMG OID640417362 
ProductMFS family transporter: phosphate/sugar 
Protein accessionXP_001418115 
Protein GI145347307 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.726317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGG GAGGGCAAAA GAAGCGATGG GGGATGGTGT TCGCGCTGTT CATCGCGTTC 
GTGCTGTGTA ACTTGGACAA GGTGAACATG TCGGTGGCCA TCGTGCCGAT GGCGGAGTCG
TTCGGGTGGA CGGCGACGCA AAAGGGCTTG GTCGCGTCCG CGTTCTTCTG GGGTTATTCG
TTCACGCAGA TCCCGGGTGG GTGGTTGGCG AGTAAGTACG GCGGTAAAGC CGTCTTGTTC
TGGGGCGTCA TGCTCTGGTC GTTCGGTACG CTCATCGCGC CGTGGTGCGC GGCGCTCGGC
ATGCCGGCGC TGCTCGCGTC GAGATTCTTG GTCGGTCTCG GCGAAGGCGT CGCGCCGTCC
GCGGCGACCG GCGTGTTGGC GAAGGGCGTT CCGCCGAGCC AGCGATCGAA GGCCGTGACC
TCCGCCTTCG GCGGTCTCGA CGTCGGCTCG TTGTTGGGTT TGCTCATCGC GCCGCCGATC
ATCTTCCACC TCGGCGGCTG GGCCGCCGTC TTTTACTTGT TCGGCGCCCT CGGCTTCTTC
TGGGGCGCGT GGTGGTTCAT CTCCTACATG CGCGATTCCT CCACGGACAT GAAGGAAGTC
GAAACCACCG GCGCTAAGAA GGGTCTCTCC ATCCCGTGGG CCGCCTTTGT GCGCAACCCG
CAGTTTTGGG CGCTCACCGT CGCGCACTTT ACGTGGAACT ACTTTTCCTA CGGCTTGCTC
GCGTGGTTGC CGTCCTTCTT GGCGAGCGCC ATGGGCGTGA CTTTGTCCAA GTCGTCTTTC
CTCTCCATTC TTCCTTACTT GTCCACCGTC ATCGTGACCG CGCTCATCGC CCCACTCGCC
GGTGAACTCG AGGCGAAGAA GAAGCTCACG CGAACGCAAA TTCGCAAGGG CTCGCAGACG
CTCTGCTTTG GCGTCGGCGC CGTGACGCTC ACGATGATTG GCTTGATCGT GAACGCCACC
CCGGTGGCCG CGGTGACGAA CCAAACCATC GGCATGGTTG TCGGTCTCCT GTCCGTCACC
TTCGGCTTCG CCGCGTTCAT CCGCACTGGT TTGTTCTGCG GTCACCAAGA CCTTTCGCCG
AAGTACGCGT CCATCATGTT GGGCGTCACC AACACGGCGG CGGCCATCGC GTCGACTCTT
TCCACCTTCT TCACCGGTCT TTTCCTTTCC ATGACCGGCG GCAACTGGGC GTACTCCTTG
TTCTTCCCGA TCGCTGCCCT TCAATTGGTT TCCGTGTTCG TCTTCCTCAT CTGGAAGTCC
GACCCGGTTG ACTTCGACGC CGTCGCCTAA
 
Protein sequence
MTEGGQKKRW GMVFALFIAF VLCNLDKVNM SVAIVPMAES FGWTATQKGL VASAFFWGYS 
FTQIPGGWLA SKYGGKAVLF WGVMLWSFGT LIAPWCAALG MPALLASRFL VGLGEGVAPS
AATGVLAKGV PPSQRSKAVT SAFGGLDVGS LLGLLIAPPI IFHLGGWAAV FYLFGALGFF
WGAWWFISYM RDSSTDMKEV ETTGAKKGLS IPWAAFVRNP QFWALTVAHF TWNYFSYGLL
AWLPSFLASA MGVTLSKSSF LSILPYLSTV IVTALIAPLA GELEAKKKLT RTQIRKGSQT
LCFGVGAVTL TMIGLIVNAT PVAAVTNQTI GMVVGLLSVT FGFAAFIRTG LFCGHQDLSP
KYASIMLGVT NTAAAIASTL STFFTGLFLS MTGGNWAYSL FFPIAALQLV SVFVFLIWKS
DPVDFDAVA