Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_38010 |
Symbol | |
ID | 5004080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 255693 |
End bp | 257423 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | |
GC content | 63% |
IMG OID | 640419501 |
Product | MFS family transporter: sugar |
Protein accession | XP_001420126 |
Protein GI | 145351527 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.286894 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCG CCGCGCCCGA CGTCGACGTC GACGCCGACG TCGACGACGA CGACGGCGGC GACATCCCGC GTCCACCGCG CGCGCCCGAC GGCGCGGCGT CCACCGCGCG CGACGACGCG CGCGATGACG ACGCCGACGA CCAAAGACTC GGCCTGCTCG ACGCCGCGAC GCCGCGCGGC GAATTCGTCG TCGTCGCCCT CGACGCGCCG AACGACGACG GCGACGGCGA CGGCGACGAT GCGCGTCGAC GAACGTACAC CGCGAACGAG GCGCTCGATC ACGTGGGATT CGGGAAGTTT CAAATCCACG CGCTGTGCTT CGTCGGTTTC GCGTGGGCGG CGGACGCGAT GGAGATGATG CTGCTCAGTT TCATCGGACC CGCGATGCGA TGCGAATTCG GGGTGTCGAG CGACGCGGAA GGCGCGCTGA CGAGCGTGGT GTTCGTGGGA ATGGCTCTCG GGGCGCCGGC GTGGGGAGTG GTGAGCGACG CGCGCGGTCG GAAGCCGGCG CTGCTGTTCA GCTCGACGAC GACGCTCGCG GCGGGGATCG GAAGCGCGCT GGGCGGGAGC TTCGGGAGCG TGTTGTTCTT TAGATGCCTC GTGGGCGTTG GTTTGGGTGG GGTACCCGTG GCGTACGGGT TGTTCATGGA GTTTTTACCG AGGGAAAATC GCGGCGCGCG CTTGTCGTAC ATTGAGGCGT TTTGGACGCT GGGATCGATG TTGGAGTCCG CGCTGGCGTG GATCGTGCTT CCGCGGCATT CGTGGCGAGT TTTACTGTTG ATTTCAGCGG CGCCGTTGCT TGGATTGATC GCATGCATCT TTATCGTCCC GGAGAGCGTC TTGTACTCGG TAAACGCCGG TCGAATGGAG GAGGCGAAGG AGACGTTGCG TCGCGTCGCG GCGACGAATG GTAAATCTCT ACCGCAGGGC GAGCTCGTGG GGCCGAACGA TCGCGCGTCA TCGAGTGGCG AGTTCGAAGA TCGTACTTCG TATGGCATGG GCGCGTCGGG AGCGTCGTCT TCTACGATGA TGCAAAGGTT CGTTCCGAGC GGCGTTCGCG CGTTGCTGTC CAAGAAGCAC GCGAAGACGT CTCTCTTGGT TTGGGTGATT TTCTTCGGTG TGGCGTTTTT GTACTACGGC ATTGTCCTTC TCACAACGTC ACTCAACGTG CGCGACGACG AGTCCAAGCG TGGGGGAGAG TTGGCGTGTC TAGCGCACGG TGCGCCACAT TTGAGCGACG GCGAGTACGC CGACATCTTT CTCAGCTCGT TCGGCGAAAT TCCAGGCTTG ATCGTCGCGA TTATGATCGT CGACAAGATC GGTCGCAGGC GCTCGATGGC GTTCACCGTG ATTGCCACCG CTGTGTTCTT GCTCCCCGTG GCTTCATCGA GCATAAGTAA GGCGGTTCGT GACATCATGC TCTTCGGTGG AAGAAGCGCC GCGTTCGCGG CGTTCACCGT CTTGTACATA TTCGCCGGCG AAGTCTATCC GACGTCGATC CGTTCGACCG GTGTCGGCCT CGGAAACGGG TTCGCGCGCA TCGGTGGAAT AACATGCCCG ATATTCGCTG TGACTTTGAT TGAGTCCGGA CATCTGACGC TCTCCGTCGT CGTCTTCATC GCCGTCGCCG CCGTCGCGTG CGCCGCCGCG CTCTCGCTCG CCGTCGAAAC CGCCGGTCGC GAGCTCGACG CCGACGACGA GCCGGGCGTC GAGCTCGCCC CAGTGGCCTA A
|
Protein sequence | MSAAAPDVDV DADVDDDDGG DIPRPPRAPD GAASTARDDA RDDDADDQRL GLLDAATPRG EFVVVALDAP NDDGDGDGDD ARRRTYTANE ALDHVGFGKF QIHALCFVGF AWAADAMEMM LLSFIGPAMR CEFGVSSDAE GALTSVVFVG MALGAPAWGV VSDARGRKPA LLFSSTTTLA AGIGSALGGS FGSVLFFRCL VGVGLGGVPV AYGLFMEFLP RENRGARLSY IEAFWTLGSM LESALAWIVL PRHSWRVLLL ISAAPLLGLI ACIFIVPESV LYSVNAGRME EAKETLRRVA ATNGKSLPQG ELVGPNDRAS SSGEFEDRTS YGMGASGASS STMMQRFVPS GVRALLSKKH AKTSLLVWVI FFGVAFLYYG IVLLTTSLNV RDDESKRGGE LACLAHGAPH LSDGEYADIF LSSFGEIPGL IVAIMIVDKI GRRRSMAFTV IATAVFLLPV ASSSISKAVR DIMLFGGRSA AFAAFTVLYI FAGEVYPTSI RSTGVGLGNG FARIGGITCP IFAVTLIESG HLTLSVVVFI AVAAVACAAA LSLAVETAGR ELDADDEPGV ELAPVA
|
| |