Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_52000 |
Symbol | |
ID | 5006762 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | - |
Start bp | 287215 |
End bp | 288972 |
Gene Length | 1758 bp |
Protein Length | 462 aa |
Translation table | |
GC content | 58% |
IMG OID | 640422183 |
Product | MFS family transporter: hexose |
Protein accession | XP_001422705 |
Protein GI | 145356989 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00879] MFS transporter, sugar porter (SP) family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.128216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0380146 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGG AATCGCGGCG AGGCGCGGAC GTCGGGCCGG GTCTCGCGCC GTCGGCGGTC GCGGCGAGTC TGGGGGCGTT TCTGTTTGGG TATCACACGG CGGCGTGCAA CGCGCCGCTG AGCGCGCTGG CGCGAGATTT AGGATTCGCG GACGATGACT ACGTGAAAGG CGCGGTGGTG TCGGCGTTGG TGATCGGAGG CGCGATCGGA GGGCTCACCG TGGGTGGGCT GAGCGATAAA TACGGGCGGA AGTGGGCGTT GACGGCGACG AGCGCGCCGC TCGCGCTGGG GACGATGCTG AGCGGGATGG CGCCGAACGC GGTGACGATG ATCGCGGGGA GGTTCATATG CGGACTAGGC GTGGGAGCGA GCTCGCAAAT TGTGCCGCTG TATCTCAGTG AAATCGCGCC GCCGGCGTTG CGGGGGACGC TGAACGGATT TCGAAGGTTG GCGTACGTCT TCGGTTGTTT GGCGGCGTTT CAACTCGCGG CGCCGCTCAA GGAGACTGGA GGGGAGGGTT GGTGGCGACC GATTTTCTAC GATGCGGCGA TACCGGCGCT CATGCTAGCC GTGGGCGCGG CGTTCGTGGC GCAAGAGACT CCGGTTTGGC TGCTGACGCA GAGTGATGAA AAGGCGGCGG AGAAATCTCG ACGTTCGCTA GCGATTTTGC AGAACATTCG CGGCCGCGCC GCGGTGACTT GGCAAAATTC TTTAAGAAGC GCGTTTAGCA GCTCGCGTTC GTTAGACGAC GACGATAGTG CGTCGACGTC AATCGATGCG GAAGTTAGCG CGTCCTCAGC GGCGGAGAAA CCTGCGAAAT CAAAGTTGTC TCGCTCGAGA CGCAAGGAAC AAAAATTATC GACGTGGTCG GAGCTGATTT CAGACGACAA AAATCGTTTA CCACTTTCGC TTGGCCTATC GCTGTGCGCG CTGGCTGCAT TCTCGGGATC AAACACAGTC ATATTCTACG CATCCACTGT GTTCACGAGC GTGGGAATAA ATAATCCAGA AATTTTGACT TGGGCGGTGG GCGTGCCGAA CGTCGTCGGT GGCTTTGTCG CGCTGGCCCT GTCAGACAAA ATGGGTCGTC GACCTCTTCT CCTCACTTCA TTCGGAGGCA TGAGTGCGTG CTTGGGCATT TTGTCTCTCG CCGCGTATCT CACCCCTGCG AACGAGCTAA GCTCGTTTTG CGCCAACCCA GAGCTTGGCG TGCTCGTGGG AAAACTGAGC CAACAAATCG ACGACGGATT GTACTCGTAT CCCACGATGG CGAGCGCTCA GATTTGTGCG GATTTTGCCG CACTGTCGCC CGCGAGTGCG GGACCGGCAC AACCCGAAGC AGCAGTCGCG CTCGTCACAA TCCCATTGTA CGTTTTATTC TTCTCTCTCG GCGCAGGTCC GATTCCGTGG CTCTTGTACA ATGAAGTATT CCCCACCCGC ATTCGCGCTC GAGCCGTTTC GGCGTGCACC GCTCTGAACT ATGTATCAAA CTCAATCGTC GGTGCAACCT TTCTGCCGAT GGTTGGCGCG TACGGATTGA GTGGATCATA CGGTTTCTAT ACTCTGCTGT GCGCTAGCGG ATACGTTTTT GTCGATCGTT TCATCCCCGA GACGAAAGGT TTGCGCTTGG AAGATGTCGA GTCGACGTTG AAGAGACACG CGCGCAAAAG ATCGACGTCG CGCTCAAAGT CCATCGATGA ATAGCTTGGT CATGAATAGC TTTGATGATT CTTGTAATCT TTATTTGTAC TACTAGTT
|
Protein sequence | MTTESRRGAD VGPGLAPSAV AASLGAFLFG YHTAACNAPL SALARDLGFA DDDYVKGAVV SALVIGGAIG GLTVGGLSDK YGRKWALTAT SAPLALGTML SGMAPNAVTM IAGRFICGLG VGASSQIVPL YLSEIAPPAL RGTLNGFRRL AYVFGCLAAF QLAAPLKETG GEGWWRPIFY DAAIPALMLA VGAAFVAQET PVWLLTQSDE KAAEKSRRSL AILQNIRGRA AEQKLSTWSE LISDDKNRLP LSLGLSLCAL AAFSGSNTVI FYASTVFTSV GINNPEILTW AVGVPNVVGG FVALALSDKM GRRPLLLTSF GGMSACLGIL SLAAAGPAQP EAAVALVTIP LYVLFFSLGA GPIPWLLYNE VFPTRIRARA VSACTALNYV SNSIVGATFL PMVGAYGLSG SYGFYTLLCA SGYVFVDRFI PETKGLRLED VESTLKRHAR KRSTSRSKSI DE
|
| |