Gene OSTLU_52000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_52000 
Symbol 
ID5006762 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp287215 
End bp288972 
Gene Length1758 bp 
Protein Length462 aa 
Translation table 
GC content58% 
IMG OID640422183 
ProductMFS family transporter: hexose 
Protein accessionXP_001422705 
Protein GI145356989 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.128216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0380146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGG AATCGCGGCG AGGCGCGGAC GTCGGGCCGG GTCTCGCGCC GTCGGCGGTC 
GCGGCGAGTC TGGGGGCGTT TCTGTTTGGG TATCACACGG CGGCGTGCAA CGCGCCGCTG
AGCGCGCTGG CGCGAGATTT AGGATTCGCG GACGATGACT ACGTGAAAGG CGCGGTGGTG
TCGGCGTTGG TGATCGGAGG CGCGATCGGA GGGCTCACCG TGGGTGGGCT GAGCGATAAA
TACGGGCGGA AGTGGGCGTT GACGGCGACG AGCGCGCCGC TCGCGCTGGG GACGATGCTG
AGCGGGATGG CGCCGAACGC GGTGACGATG ATCGCGGGGA GGTTCATATG CGGACTAGGC
GTGGGAGCGA GCTCGCAAAT TGTGCCGCTG TATCTCAGTG AAATCGCGCC GCCGGCGTTG
CGGGGGACGC TGAACGGATT TCGAAGGTTG GCGTACGTCT TCGGTTGTTT GGCGGCGTTT
CAACTCGCGG CGCCGCTCAA GGAGACTGGA GGGGAGGGTT GGTGGCGACC GATTTTCTAC
GATGCGGCGA TACCGGCGCT CATGCTAGCC GTGGGCGCGG CGTTCGTGGC GCAAGAGACT
CCGGTTTGGC TGCTGACGCA GAGTGATGAA AAGGCGGCGG AGAAATCTCG ACGTTCGCTA
GCGATTTTGC AGAACATTCG CGGCCGCGCC GCGGTGACTT GGCAAAATTC TTTAAGAAGC
GCGTTTAGCA GCTCGCGTTC GTTAGACGAC GACGATAGTG CGTCGACGTC AATCGATGCG
GAAGTTAGCG CGTCCTCAGC GGCGGAGAAA CCTGCGAAAT CAAAGTTGTC TCGCTCGAGA
CGCAAGGAAC AAAAATTATC GACGTGGTCG GAGCTGATTT CAGACGACAA AAATCGTTTA
CCACTTTCGC TTGGCCTATC GCTGTGCGCG CTGGCTGCAT TCTCGGGATC AAACACAGTC
ATATTCTACG CATCCACTGT GTTCACGAGC GTGGGAATAA ATAATCCAGA AATTTTGACT
TGGGCGGTGG GCGTGCCGAA CGTCGTCGGT GGCTTTGTCG CGCTGGCCCT GTCAGACAAA
ATGGGTCGTC GACCTCTTCT CCTCACTTCA TTCGGAGGCA TGAGTGCGTG CTTGGGCATT
TTGTCTCTCG CCGCGTATCT CACCCCTGCG AACGAGCTAA GCTCGTTTTG CGCCAACCCA
GAGCTTGGCG TGCTCGTGGG AAAACTGAGC CAACAAATCG ACGACGGATT GTACTCGTAT
CCCACGATGG CGAGCGCTCA GATTTGTGCG GATTTTGCCG CACTGTCGCC CGCGAGTGCG
GGACCGGCAC AACCCGAAGC AGCAGTCGCG CTCGTCACAA TCCCATTGTA CGTTTTATTC
TTCTCTCTCG GCGCAGGTCC GATTCCGTGG CTCTTGTACA ATGAAGTATT CCCCACCCGC
ATTCGCGCTC GAGCCGTTTC GGCGTGCACC GCTCTGAACT ATGTATCAAA CTCAATCGTC
GGTGCAACCT TTCTGCCGAT GGTTGGCGCG TACGGATTGA GTGGATCATA CGGTTTCTAT
ACTCTGCTGT GCGCTAGCGG ATACGTTTTT GTCGATCGTT TCATCCCCGA GACGAAAGGT
TTGCGCTTGG AAGATGTCGA GTCGACGTTG AAGAGACACG CGCGCAAAAG ATCGACGTCG
CGCTCAAAGT CCATCGATGA ATAGCTTGGT CATGAATAGC TTTGATGATT CTTGTAATCT
TTATTTGTAC TACTAGTT
 
Protein sequence
MTTESRRGAD VGPGLAPSAV AASLGAFLFG YHTAACNAPL SALARDLGFA DDDYVKGAVV 
SALVIGGAIG GLTVGGLSDK YGRKWALTAT SAPLALGTML SGMAPNAVTM IAGRFICGLG
VGASSQIVPL YLSEIAPPAL RGTLNGFRRL AYVFGCLAAF QLAAPLKETG GEGWWRPIFY
DAAIPALMLA VGAAFVAQET PVWLLTQSDE KAAEKSRRSL AILQNIRGRA AEQKLSTWSE
LISDDKNRLP LSLGLSLCAL AAFSGSNTVI FYASTVFTSV GINNPEILTW AVGVPNVVGG
FVALALSDKM GRRPLLLTSF GGMSACLGIL SLAAAGPAQP EAAVALVTIP LYVLFFSLGA
GPIPWLLYNE VFPTRIRARA VSACTALNYV SNSIVGATFL PMVGAYGLSG SYGFYTLLCA
SGYVFVDRFI PETKGLRLED VESTLKRHAR KRSTSRSKSI DE