Gene OSTLU_30233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30233 
Symbol 
ID5000450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp751210 
End bp754407 
Gene Length3198 bp 
Protein Length949 aa 
Translation table 
GC content61% 
IMG OID640415871 
Productpredicted protein 
Protein accessionXP_001416482 
Protein GI145343768 
COG category[S] Function unknown 
COG ID[COG3781] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.204381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGCGACTCG CGCGCATCAT GCTGCGCCTC GCGCCTACCG TTCGCGCGTC GAGCGCGTCG 
AGCGCTTCGG CGCGCGTTCC ACGGCGCGTT CCACCGCGCG CCGCTCGCAT CGCGCGTCAC
GCGAGCCCGC GCGCGCGCGT TTTCGTTCGC CGTGTCGCGA TTCCACGAGG TGTGCAAATC
ACGCCGCCGC GCGCGCTCTC GAGCGACGAC GACGACGCGC GATCGTCGAG CGAGGCGATT
CGTTCCGATG CCGTCGACGA CGCGGCGCCG GCGCCGCGCG CGAGCGACGC GATGGTGGAG
CCGTGCGACG GCGACGACGG CGCCGGCGTC GTCAGCGGCG CCTCCAAGCC GAACGACTCG
TACGCCATCG AAAAGCGGCG CGCGAACACG ACGGCGATCG TGGGCACGGG ACCGGTGGCG
CAGACGCCGG CGGTGGCGCG GGATGAGCCG ACGAAATCGA CGCCCGCGCG CGCGGAGTCG
ACGACGCGGC CGGTGTTGAG CGAGGAAGAA CTGTTGCCGC CGTCGAAGCG GGAACCGAGA
GGGAAAGAGT ACAAAGAAGA CGGTGGAGAT AAGGAAGGGG GGGACGCCGC AGACGGTGAT
GGGAAAGGAC CGAGCGGAAG CGCGACGAAG GGAGGCGCGA CGACGAAGAC GAAACCGAAA
ACTGACGGAG AGAAGGAGAC GGGGGAGAAG AAGCTGGCGA CGCCGAAGAG CGCGCCGGTG
GTGAGCGGAG GCGCGATAAG GTCCGCGGCG GGAATGGCCG CAGAATCTGG TGCTGATCCC
GTGTTAGCGG GCGTGGGCGG TGGCGATGAT GGCGACGACG CGCGGTCCGG CGGCGGCGGG
GGAGGAGGAG GAGGAGACGA TAAGAATGAC GGAGACGACG AGGACGAAGA GGAAGATGAC
GACGACGAGG TCGTGCTTCC CGATCCCTGG TACAAACTCG CGTGGGAAGA AACCGAGCGC
GAGATTGTTC GCATCATGCA AAGGTTGCTC AATATTTGGA TCACGCGCGA CCCGAGCAAG
ACGACGCGAC TCGTTTTGTT CTCCGTCATG GGTACCGGTT TATTCTTAGC TTCGCTGCTA
CTTTACCCCG AGAATCCTCT CGAGTTTAGC GATAATGGAT TGTTTAGCTC TGTGTTCGGT
CGTAACCCTC GCTGGGTGAT CAATCGTGAA CTCGGTCCTG TGCAGCCGAC GTTGTTGAGC
ACAACGTGCG CGTGTGCCGT GGCTTTCGCC AAGGTTCGTC TCGGTGCGAA CATGTTCAAG
CTCTCACCCT TCATGCACGG CGTCCTGGGC CTCCCCATGG GTTTCATGTT AGTCTTTAGA
TGGAACAACG CGCACGAGCG CTGGTGGTAC GGCCGCACGT GTCTCGGAAA CATTTTGTTT
TACTGCAAAA ATCTTGGCGG CACGTTCTGC ACGTGGGTCG CGCCGGACGA CCCGATACTC
GCCGCGCGAA CGCTGGGTCT GATTGGTGCT TTGAAAGAAA CCGTGGCTGA TCGATTGAAT
GGTACAGTGC TGAACGACGG AGCGATTTTG AGTCAGCTCA CCACACCGCT TGACGCGGGA
GACTTGGAAG GATTATTTCT CGCCGAGAAT AAGGTGTTGT ACTGCCTCGA AAAGATTCGC
GGCTGCGTCC AAGAAGCATT CAACAAAGGT TACGTCCCTC CCGCCATCGC GAGCACGATC
CACAGCGAGG TGGCGATGAT CATGGATAAC TACGGGAGTT GCGAAAAAGT CGTCAACCAA
CCGCCTCCGG GTTGCATCAT CACCCACTTG AAGTCTACGC TCATGGTGTA CGTGTGCTCG
TTACCGATGA TTTTAGTGCA TGAAGTCGGA GTATGGGGCG TCGTCCCCGT CACCACCATT
CTTTCTCTCG CCCTGTTCGG GATTGAGGCT GCGGCTGAGC AAATCGAGCA GCCGTTTGGC
AACAGACCGT ACGATTTGCC CGTGCGCGCG CTGATGAATA GCAACTCGCG AGACCTCGAG
CAGACGAGCG AAAAGGTGAT CGGAATGACG GGCTTCGTAA ACGGGGTGAA GATGCCGTTC
ATTCCAGAAG GCTCAACGCC GTCGAAGGAG GCGATGTCGG CGAAGACGTT GCCACAGACG
TTGAAGACGT CTGAACCGCC GACGCCACCA CCGCCGCCGC CGGCCGCTTT CGTCACGCCC
GCGCCTTCTG CGGCTGCGCT CAAGGCGGCG GCGGCGCCGC CGATTTCGGA GTCATCCGTC
GTTATTCCGA CGGCGCGCAA GGACAACACA GTCTCGTTTT CGAGCGATCG CTTAGATGCC
GCTTCCGTCG CGGCGCTGAG CAAGGAGAGT CCGTTCGCGT TCACATCGAA CGAGCCTGCA
ACGAGTTCTG CTGAGCCGGC GAAACAGCCG CAGCCCCCGT CACCAAAGGT AGAGACCCGA
GACGGTGAAG GCGTGCCTTC GCGTCCATCG ACGCCTACGG CACCGATGAC GCCCCCTCGA
GGAAAGAGCG CGTTGCAAGC TGCGATGGAT GCGGTTTCAG AGACGGCGAA TTCGCGCGCG
CCGGTGAACC CTTCGTCCTA CTTGTCGCCG TTACGAACGG GTCGGATGTC CGATGCATCG
AGTTTATCGT CGTCGGCGCC GGCGTCGACC ACCAAGGGTG AGCTTCCGGC TTGGAAAGCG
CAGAGCCCAA TTGAAATTTA CGATCAAGTG CTCGATGAGC GACCAAGCTC GACGATTCCG
GACTCGCCTC GATACCACAC GCAAAATTCA TTGCGACGAC GAGACTCCTA CGGGCAGCAA
TTCTTCGATA TGTTCACTCA GCGACCGCAA GCCAACGGCG CGTCCGAGCC GAACAGTGCA
AAAGCTGGTT CGAACTTACA GCGATCGAGC TCGGTGGGCC TACCGCGATC GAGTAGCGTC
AGCAACTTGT CGGGCCCGTC GTATCGAACG AACGCCGAGG GTGAACGCGA CATTCGCGAC
ATCCCATTCT CGCGCAATGC ATTCGGACGA TCGCCGTCGA GATCGGATCT TATCTCCGAG
GAAGACGGCG GCGTGCCCAT GCGACGCGTG GGCGCGGTGA ACCGCTCGAC GTCCGTCACG
GACATGAGCG CTCTAGAGGA GGCCATCAAG AGAGCGCGCG AGGCGCGCAA GCGAACGTCC
GGGGGCTCGG GCTCACCGTG AAAAGAAGGC ACACGATAGA AAGACGCACA GTAGAGCCTT
CAAATGTATT GTATAAAA
 
Protein sequence
MVEPCDGDDG AGVVSGASKP NDSYAIEKRR ANTTAIVGTG PVAQTPAVAR DEPTKSTPAR 
AESTTRPVLS EEELLPPSKR EPRGKEYKED GGDKEGGDAA DGDGKGPSGS ATKGGATTKT
KPKTDGEKET GEKKLATPKS APVVSGGAIR SAAGMAAESG ADPVLAGVGG GDDGDDARSG
GGGGGGGGDD KNDGDDEDEE EDDDDEVVLP DPWYKLAWEE TEREIVRIMQ RLLNIWITRD
PSKTTRLVLF SVMGTGLFLA SLLLYPENPL EFSDNGLFSS VFGRNPRWVI NRELGPVQPT
LLSTTCACAV AFAKVRLGAN MFKLSPFMHG VLGLPMGFML VFRWNNAHER WWYGRTCLGN
ILFYCKNLGG TFCTWVAPDD PILAARTLGL IGALKETVAD RLNGTVLNDG AILSQLTTPL
DAGDLEGLFL AENKVLYCLE KIRGCVQEAF NKGYVPPAIA STIHSEVAMI MDNYGSCEKV
VNQPPPGCII THLKSTLMVY VCSLPMILVH EVGVWGVVPV TTILSLALFG IEAAAEQIEQ
PFGNRPYDLP VRALMNSNSR DLEQTSEKVI GMTGFVNGVK MPFIPEGSTP SKEAMSAKTL
PQTLKTSEPP TPPPPPPAAF VTPAPSAAAL KAAAAPPISE SSVVIPTARK DNTVSFSSDR
LDAASVAALS KESPFAFTSN EPATSSAEPA KQPQPPSPKV ETRDGEGVPS RPSTPTAPMT
PPRGKSALQA AMDAVSETAN SRAPVNPSSY LSPLRTGRMS DASSLSSSAP ASTTKGELPA
WKAQSPIEIY DQVLDERPSS TIPDSPRYHT QNSLRRRDSY GQQFFDMFTQ RPQANGASEP
NSAKAGSNLQ RSSSVGLPRS SSVSNLSGPS YRTNAEGERD IRDIPFSRNA FGRSPSRSDL
ISEEDGGVPM RRVGAVNRST SVTDMSALEE AIKRAREARK RTSGGSGSP