Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42838 |
Symbol | |
ID | 5003337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 102588 |
End bp | 104006 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | |
GC content | 69% |
IMG OID | 640418758 |
Product | MFS family transporter |
Protein accession | XP_001419283 |
Protein GI | 145349734 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.183784 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.513972 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCGC GCGGGGAATC GCCGTCCGAC GACGACGTCG ACGACGCCGC GAGCGCCGAG GCGCTGCTGG GACTCTTCAT GCTCGTCAGC GCGCTGGTGT ACGTCGATCG AGGGATCGCG AGCTCGGCCG CGGTGAGCGG GGCGCCGAGG AGCGCGCGCG AGCCCGCGGG GCGAGGCTTG CAGGGCGCGC TCGGGTGCTC GTACGCGGCG TACGGAGCGC TGAACGCGGC GTTCATGATC GGACTGCTCT CGGGCGCGCC CGCGTTCAGC GCGATGGCGA ATAAAGCGTG CGCGTTCAGA TTGATCGCGA TCGGGCTGGC GATGGCGGCG GTGGGGGAGC TCGGATGCGC GCTGGCGCCG ACGTGCGGGT GGGCGTTCGC GGCGCGCGCG CTGGTGGGCG CGGGAGAGGC GAGTTTTATC GCGCTCGCGG CGCCGTTCAT CGATGATAAA GCGCCGAAAG GCGCGAAGAC GATGTGGTTG GCGATGTTCT ACGCGTGCGT GCCGTTCGGG GTGGCGTTCG GGATCGCGTT CGGGGGGGCG GTGACGCCGG CGATGGGGTG GCGATGGGCG TTCGGGTTGA ACGCGTGCGC GATGGCGCCC GCGGCGGCGT ACTGCTTCTG GCGTCCGGCG GTGCGCATGC GAGGCGTCGG AGGCGATGCG AATGCGCGCG AGGCGGCGGC GACGTCGACG GTGGCGTCGT TGACGCGCGC GTTCGCGCGA GATTGTAAAG AGTTGTTCGT GCGCGAGACG TACGTCGTCG TCGTGCTCGG GTACGCCGCG TACACCGCCG TCATCGGCGT GTACGCGGCG TGGGGACCGA AAGCCGGGTT CGCGATATTT CGAGATGAGT TACACACGTC GACGAACGCG GACATGCTCC TCGGTGCGAT CACCGTCGTG AGCGGGATCG CGGGCACGCT TCTCGGCGGC GGCGTCGTGG ACAAGTTGGG GAGCTCGACG GCGACGGCGT TGCGCACGTC CGCCATCGCC GCCGTCGGGG GATTCGTGTG CCTCGAGCTC GCTTTCAGGT GTCAAACGTT CGCATCGTTC GCGGTGTGCT TGCTCATCGG ACAAATGTTC GCTTTCGCGT TACAGGCGCC GATCAACGCC GTCGTGCTCT GGAGCGTCCC CGCGCGTCTG CGCCCGCTCG CGTGTTCGAT GACCACCGTC ACCATTCACC TCTTCGGCGA CGTCCCATCG CCGCCGCTCT TCGGGCACTT CCTCGAGCGC GATGGCGCCC CCACGCCCGA GCGCTGGCGA ACCATGTGTT CGACGTTCAC GCTCTTATTC GTCGTCGCCG CGGGCGTCTT CGCGACGGCG GCGCGGCGAG CCGGCGGCGA CGCGCGCCGA CAACGCGTCT TAGACGACGA CGACGACGAC GACTCGCGCG ACGTCGACGA CAGGCTCTTA CCGACGTAG
|
Protein sequence | MTPRGESPSD DDVDDAASAE ALLGLFMLVS ALVYVDRGIA SSAAVSGAPR SAREPAGRGL QGALGCSYAA YGALNAAFMI GLLSGAPAFS AMANKACAFR LIAIGLAMAA VGELGCALAP TCGWAFAARA LVGAGEASFI ALAAPFIDDK APKGAKTMWL AMFYACVPFG VAFGIAFGGA VTPAMGWRWA FGLNACAMAP AAAYCFWRPA VRMRGVGGDA NAREAAATST VASLTRAFAR DCKELFVRET YVVVVLGYAA YTAVIGVYAA WGPKAGFAIF RDELHTSTNA DMLLGAITVV SGIAGTLLGG GVVDKLGSST ATALRTSAIA AVGGFVCLEL AFRCQTFASF AVCLLIGQMF AFALQAPINA VVLWSVPARL RPLACSMTTV TIHLFGDVPS PPLFGHFLER DGAPTPERWR TMCSTFTLLF VVAAGVFATA ARRAGGDARR QRVLDDDDDD DSRDVDDRLL PT
|
| |