Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43234 |
Symbol | |
ID | 5005505 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 222030 |
End bp | 223433 |
Gene Length | 1404 bp |
Protein Length | 444 aa |
Translation table | |
GC content | 59% |
IMG OID | 640420926 |
Product | MFS family transporter: metabolite |
Protein accession | XP_001421238 |
Protein GI | 145353903 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.104794 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGCGT CGAACAAGCG CGCGTTCGTG TTCACGTGCC TGGGCCTCGG GTTGGTGTTC GCCGATCAAA ATTTGTTAGC GCCGAATTTG ACCGCCATCG CGAACGATTT GAATCTGTCG CCGAATGAGC GGGACTATAA ATTAGGCGGG CAGATCGCGT TCGCGTTCTT CTTGCTCGGC GCGCCGGCGG CGGTGTTGAT CGGGTCGATG GCGGATTACT ATCCGCGCAC GAAGCTCTTC GCGTGGACGA TGCTGTTGGG ATCGTGCCCC AACGTCCTGG CGTGGATGCC GGGGGTGACG ACTTTTGGTC AGTTGTATTG GTTGCGCGCG CTGACGGGAA TCGCGGTAGG CGGCGCGGCG CCGTTGACGT ACTCCTTAAT GTCAGACTTA TTTCCGCCGA GCGAGCGGAC GAAGATGAGC GCGGTGACGG GGCTATCGAT GACGCTCGGG ATCGTCTGCG GGCAAGCGAT CGCGGGATTT TTGGGGGAGT CGTACGGCTG GCGCTTGCCG TTCGCGGTGG TGGCGATTCC GGCGATATGC GTGGCCATGG TGCTGATGTT CTTTGTCGAG GAGCCCGAGC GCGGGGCGAT GGAGGCGCCG CCGGACGAGG AGGCAAGTCT CGAGCAAGAG TCGCTCGTCA TCCAATCGTC CTCACGACCC TCGACACCGC CGATACCACC GCGAGGGCTG CACTTCAGCG CGACCGCGGC GAAGATGTAT GCGCGAAAGT TGCACGGCAT CGTCTCCGTG CGCACCGTGG CGTTGTTCTT GGCGCAAGGC GTGTCGGGTT GCGTGCCGTG GAGTATGATC AACACGTTCT TCAACGATTA CTTAGCGCAA GATAAGGGGT TAGGTGTGAA GCAGAGCACG TCGATGCTGA TTTTATTCGC CGTCGGCGGC ATGTTAGGGA CGATTTGGGC CGGGTGGTAC GGCCAAATTC TTTTCAACAA CAAACCGGAG AACGTATCGA TTTTCATGGG ATCGAGCGCA ATCGTGGGGG TGTTCCCGGT GGCTTACTTG GTGTTGGCGA ATTACGACAA CTCCGCGTCC GACATCGCGA TCAAGTCGAT CTTATCCTTT ATTTCGGGAG GCATCGCTTC GTGCGTCGGC GTGAACATTC GAGCGCTGTT GCTCAACGTC CTGCATCCCA TGAATCGCGG CACCGCGTTT TCCCTCTTCA TGCTCACCGA CGATTTAGGC AAAGGATTCG GACCTCTCGT CGTCGCCGGC TTCGTCGCCG CGTTCGGTCG CGAGACGGCC TTTTTCATCT CGGTTTTGTT CTGGATTCCT TGCGGGTGTC TTCTCGCCGC CTCGTGCTAC ACGCTGAAAC GAGATTTACA AACCGCCGCT GCGCGTTACC AAACAGAACG CGCCGAAGAA AATCGTTCGG TTCGATTGGA TTAG
|
Protein sequence | MSASNKRAFV FTCLGLGLVF ADQNLLAPNL TAIANDLNLS PNERDYKLGG QIAFAFFLLG APAAVLIGSM ADYYPRTKLF AWTMLLGSCP NVLAWMPGVT TFGQLYWLRA LTGIAVGGAA PLTYSLMSDL FPPSERTKMS AVTGLSMTLG IVCGQAIAGF LGESYGWRLP FAVVAIPAIC VAMVLMFFVE EPERGAMEAP PDEEASLEQD ATAAKMYARK LHGIVSVRTV ALFLAQGVSG CVPWSMINTF FNDYLAQDKG LGVKQSTSML ILFAVGGMLG TIWAGWYGQI LFNNKPENVS IFMGSSAIVG VFPVAYLVLA NYDNSASDIA IKSILSFISG GIASCVGVNI RALLLNVLHP MNRGTAFSLF MLTDDLGKGF GPLVVAGFVA AFGRETAFFI SVLFWIPCGC LLAASCYTLK RDLQTAAARY QTERAEENRS VRLD
|
| |