Gene OSTLU_46693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_46693 
Symbol 
ID5004237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp53590 
End bp56669 
Gene Length3080 bp 
Protein Length979 aa 
Translation table 
GC content54% 
IMG OID640419658 
Productpredicted protein 
Protein accessionXP_001420062 
Protein GI145351387 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.649333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCGCGCGCGC GATCGGTGAC GGTCGGTGAA CGGGACGGTG ACGATAGACT CGAGCTCGGG 
GAGTCGAAGA GGACAAAGAG TTTACAGATG CCGCGAGGGA CCACGACGAC GACGCGAGCG
ATGAGCGGAG GAGAAGCGGG AGGGGCGACG ATTGAACTCG AAGCCGCCGC TGAATTCGCC
AAGGATGAGT TTTACACGAA AAGTTTCGTC GTCAAGGCGC GCGAGCGCGT GGAAGCCTCT
GTGAAAGTTC GGGTCGAGAC GAATGGTGTT AAGTACAAGG TCACCGTGGA AACGGACATG
GACGCCGAAG GGCAAGATTT GCGCATGCAT TGGGGCGTGG CGACTAGTAA GGAAACGTGG
GATAACATGG AGACACCTCC AGTAAAAATT AGACCGCCGT TTACGATTGA AACCACGGGG
GTATGTCAAA CGCGCATGAC GCCGACGTCG TCACGCTTGA GCCCCACGCT CACGGCTACG
ATCGAAGGTG ACGTAGCGGA CGGCTTTTAC GCAATCAACT TCCTGTTCAA AGAACCAAAG
AAAGATCGCT GGATTCATAA CACGAACGGA AGAGATTGGC ACGTGCCGAT ACCACAGCCG
CCGGAACCTG AGTTAGTGAC GCGTACGATT ACTATGCTGG AAGATTATGA GGAAGAGATT
GAAGAGGACG AAGAATACGA AGAGGAACAA GAGGAGCAAG AGTTGGACGA ATCTCTCGCC
TCACTTTCAT CAGTGGTCGA AGTGAACATC GATTTGAACG CGCGCGCGAG CTCTCCCGTC
AAGGAGGAAG TCGTCTCAGC AGCGTCGTCA GAAATCTCCG AGCCGGCAAA CGCGTGGAAA
CGTCTTCTAA CCCCGCCTGT GGACCCCAAG CCGTTCGACA CGAAAGACGA GAATATTAAC
GCCTTAGTTG AGTCACTTTT TGGCGCTGGT TCGAACGGTT TGACATCTAT CAAGCCAATC
GCCGCGCGAG ATGAAGGACC GCCTGAGGAT GAGGACTCTG AACCGGAACT TGAACTGGCC
AAAGCGGCTC CCATGGAAAA GCCGAAGACG CTCAAAACTG GGATGCGAAA GGTGAAGAGG
ATGGTAACAA AGTCCCGCAC TATCACTAAG TCGATAGATG AACCGCTGCC GGTGAAGTTG
GTGGAGGGTC CTTGGCTCAC TACGAACACC ATTGAAGAAA AGTTGATTCG CGAGAAAGAA
GTGAGTTATA GAGTTGGCGC CGTAGTGGAA AAGAACGTCG AGGGTGGTGG AGTCTTGGTG
CGTGTAGAAG CTGAACTTCC GTGGAATATT GTTTTGCATT GGGGTATTGT TCCTCGCGGC
GCGCGTGCGG ATGTTTGGAC GTTGCCTCCG GAACAATGGC GCCCAGAAGG TTCAGTGGTC
GGGGACACGG GCAAAGCGTG CGAGACTCCC ATGAAGAAAT GCGAGAACCC GCTGTCCGAT
CGGATTCAGA TGAGTTATGC GGAGCTTCAG CTCGGAAACG CACCCACTGC GATGCGGTTT
GTTCTCAAAG AAAATGGTGG AGAAGGACGG TGGCTTGATC GCAATGGCGA CGATTTTGTC
ATTCCCATGC CCGAGCCGGC GTACGCAAGC ACCACGCTCG ATCTCACTGG TGAGCGGACT
AAGGAAGCGA TAGACGTCGC CACAGCCGCT GCGTTTCGTG CGGCGGAGTT GCATCTCGAC
TCCATGGATG AAGTCGAAGA GAAGGAGCTT GATATGAATA TGCAAGTCGA GTACGAGGTG
GCGACTCCAC CACTGAGCGA AGACGACTTT GCTCCGGCGG AGCGAATTCA AAAGCCTCAG
AAATCAGCCG TCGGTAACGG TCGGGAAGTT TTGCTTCAAG GCTTCAACTG GGAGTCTTGC
AAGGCTCCAT GGTACCAAGC CGTCGAGCGA CTTGCACCCA CCATTGCGGA ACTAGGATTC
ACGGTGGTTT GGTTGCCTCC GCCGACATCG AGCGTCAGCG AGCAAGGTTA CATGCCTCTG
GATTATTATA ACTTGGACTC TCGATACGGC ACAAAGGAAG AATTAAAGGG CGCCATCAAG
GCTTTGCACG ATAATGGTGT GATGGCGCTC GGTGATGCCG TGCTTAACCA TCGATGCGCC
CATTTCATAG GCGACGTACC CGGCACGTAC AACAAGTTTG GAGGCAAGTT ACCGTGGGAC
GCGACAGCCA TCGTGGCAGA CGATCCCAAT TTTCATGGCC GAGGAAACAA AGCCGATGGT
GAAATGTTCC ATGCAGCTCC TAATATCGAC CATAATCAAG CTTTCGTCAA GGCTGATCTC
GAAGATTGGA TGAGCTGGCT CATGCGAGAA GTCGGCTACG ACGGTTGGCG ACTCGATTAC
GTGAGAGGTT TCTGGGGTGG TCACGTGAAG GATTACATGG AGGCGACGAA TCCGCAGTTC
GCGGTCGGTG AGTACTGGGA TTCGTTGGCC TATAACATGG ATGCTCTGGA TTACAACCAA
GATGGTCATC GACAACGGAT CGTGAACTGG CTCAACGCCG CCGGCGGAAA CGCCGGCGCG
TTCGACGTTA CCACCAAGGG CATCCTGCAC GCCGTATTTG AACGCCAAGA GTATTGGCGC
TTATCGGATA AAGCAGGCAA GGCTCCGGGT GTCATGGGTT GGTGGCCTAG TCGAGCGGTG
ACTTTCATAG AAAATCACGA CACTGGATCG ACGCAAGGCC ACTGGCGGTT TCCTCGCGAC
AAAGAGCTTC AAGGATACGC GTACATCCTG ACCCACCCGG GCACGCCGAC AATCTTCTGG
GATCACATCT TTGACAACAA CTGGGGACAT TTGCACAAAC CCATCGAAGA CATGATTCGT
ATTCGCAAAC AGTCCGGTAT TCACTGCCGG AGCGAAGTGA AAATCGTCAA GTGCGAGCAG
AGCGTGTACG CCGCCGTCAT CGATGACCGA CTTTTGATGA AAATTGGACC TGGTCATTTT
CATGCAGACG ACGCGTGGGA TTGCGTGCTT TCTGGACAAG ACTTTGCAAT CTGGCGCAAG
AAGAAAGATC GTTCGCAGCC GCGGTGATTT CTCATCTCTA CTTAATATAC TGTAATACTA
CAACTCGCGA GCGAGCAAGC
 
Protein sequence
MPRGTTTTTR AMSGGEAGGA TIELEAAAEF AKDEFYTKSF VVKARERVEA SVKVRVETNG 
VKYKVTVETD MDAEGQDLRM HWGVATSKET WDNMETPPVK IRPPFTIETT GVCQTRMTPT
SSRLSPTLTA TIEGDVADGF YAINFLFKEP KKDRWIHNTN GRDWHVPIPQ PPEPELVTRT
ITMLEDYEEE IEEDEEYEEE QEEQELDESL ASLSSVVEVN IDLNARASSP VKEEVVSAAS
SEISEPANAW KRLLTPPVDP KPFDTKDENI NALVESLFGA GSNGLTSIKP IAARDEGPPE
DEDSEPELEL AKAAPMEKPK TLKTGMRKVK RMVTKSRTIT KSIDEPLPVK LVEGPWLTTN
TIEEKLIREK EVSYRVGAVV EKNVEGGGVL VRVEAELPWN IVLHWGIVPR GARADVWTLP
PEQWRPEGSV VGDTGKACET PMKKCENPLS DRIQMSYAEL QLGNAPTAMR FVLKENGGEG
RWLDRNGDDF VIPMPEPAYA STTLDLTGER TKEAIDVATA AAFRAAELHL DSMDEVEEKE
LDMNMQVEYE VATPPLSEDD FAPAERIQKP QKSAVGNGRE VLLQGFNWES CKAPWYQAVE
RLAPTIAELG FTVVWLPPPT SSVSEQGYMP LDYYNLDSRY GTKEELKGAI KALHDNGVMA
LGDAVLNHRC AHFIGDVPGT YNKFGGKLPW DATAIVADDP NFHGRGNKAD GEMFHAAPNI
DHNQAFVKAD LEDWMSWLMR EVGYDGWRLD YVRGFWGGHV KDYMEATNPQ FAVGEYWDSL
AYNMDALDYN QDGHRQRIVN WLNAAGGNAG AFDVTTKGIL HAVFERQEYW RLSDKAGKAP
GVMGWWPSRA VTFIENHDTG STQGHWRFPR DKELQGYAYI LTHPGTPTIF WDHIFDNNWG
HLHKPIEDMI RIRKQSGIHC RSEVKIVKCE QSVYAAVIDD RLLMKIGPGH FHADDAWDCV
LSGQDFAIWR KKKDRSQPR