Gene OSTLU_51391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_51391 
Symbol 
ID5005426 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp621526 
End bp624847 
Gene Length3322 bp 
Protein Length645 aa 
Translation table 
GC content60% 
IMG OID640420847 
Productpredicted protein 
Protein accessionXP_001421518 
Protein GI145354493 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.644512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGTC GCGGTCGCGA CGGCGATGCG TCGCCGGCGC GTCGGACGCG CGCGTGGACG 
CTGGCGACGG CGATGAAGTT TACGGCGCTG CGACTGTTCG CGTCTGGGTT TCTGCTCGCG
CGCGTGGAAT CGCCGTCGAG GGCGACCGCG AGGCCGGACG CGGCGCGCGC GATCGTCGAT
AAGGCGGTGG TGCTCGTCGT CGACGGCGCG CGGCACGATT GGACGACGGC GACGAGGGAC
GAGGGCGACG AGGCGCGGCG GCGGTTGAAA CTGCCGAGCG CGAGACGATA CGGGGGGGGG
AGGCGGTGCG AAGACGCGAC GAACGAACGA GGACGAGGGA TGGTGTTTAA ATTCATAGCG
GACGCGCCGA CGACGACGCA GCAGCGGTTG AAGGGATTGC TCACGGGAGG GTTGCCGACG
TTTATCGACG CGAGCGCGTC GTTCGGCGGG ACGACGCTCG GGGAAGATAA TTTGATCGAA
CAGTTGAGCG CGAATGGACG ACGGATGGCG ATCAGTGGGG ACGATACGTG GAGCGAACTT
TTCGACGTGA ACGCGACGTT TCGGGCGGGG GCGGCGATGT ATCCGAGCTT TGACGTGAAG
GATACGGAGA CGGTGGACGC CGGCGTGCGC GCGTCGATGG CGGCCGCGTT GCGCGCGCCC
GATGACTGGG ACGTTTTGAT AGGGCACATG CTCGGTGCCG ATCACGTCGG CCACACGCAC
GGGGCGACGA CGGATTTCAT GCGCGCAAAG TTAGAGGAGA ATGATCGAGA TATCGAGAAC
GTAGTCGAGG CGATGCGAGC GGACGAAAAG TACGCCGACG CGATGGTGTT CGTGTTCGGC
GATCATGGGA TGACAGACAA CGGAGACCAC GGCGGTGGGA CGCCCGAGGA GGTGGAGTCT
TTCATGCTCG CGTATCACCC CTGGGCGAAA GGTGAGAACT GTGGGAACGG CGACGGCGAA
GACGACGATG ATTTCCCACA GATTGACTTC GCGCCGACGA TGGCGACGCT TCTGGGCGTG
CCCATACCGC ATGGAAATTT AGGAAAGGTG AACGAAAAAG TATTCAATCT CGCGCACGAG
GGGAAACGCG CGAGTGGGCG AGGCGATGTG TTCGCCGCTT ACGTGCGCGC GATGCACGCG
AACGCGGAGC AAATTTGGAC GTACGTTCAA TCGTACGGCG ACGGCAAGAC GAGTCCGTTT
GGTGCCGAAC ACACCACGCG CCTCGCGGCG CTCATGAAGG TGGTGCGAGC GAATAAATCT
GTAGACACGA CAGAGTTTGT GCTGGATTTC ATGAATGAGG TGGCAGAATT GGCGCGCGCG
AAGTGGGCAC AATTCGGATT GTTGAGCATG ACGGTTGGTT TCATCGCGCT CGTCGTCACG
CTCACGGCGC ACGCCGTGCT CGCGTACGAC AAAGTCGACG ACGCGCGCGA TGGAGATTTA
GATCTTATGA TCGCGCGCGT CGGTGTATTT ATGGTGATAT TAGCGTCCGT GGCACGATTA
TCGAATAGTT TCGTGGTGCA AGAGCGCGAG ATGATGCAAT TTCTCTTCGC CACGTTCATC
GTCGCCGCGA TGTTTGGGAG ATTCACACGA GGTCAAGCGG GGGTCTTGCA AAGTGGATGC
AAGTGCCTTT TTGCTAACGG TGCGCTTTAT GTTTTAGGAG TGTCGTGGGT GAAGAGTGAC
TCGACGGCGA TCGCGTCGCC CGCGATCACC GTCGTCATCG CAACGTGCGG GCTCGTCGTC
GTCATCGTCG CTTTGAACGC GATTCGTCGC CATGCGTCGA TGGCGTCGTA CAATGGTAAG
TTACGCGTCG TCGACATCGC ATCGTGCGCC TGGCTCACCG TGGCGATTCG GAGCGTGCAA
ATTTTAGTTT TCAAAGGCGA AGGAATCGCG CTCGCCCGTG CGACGTACGC GCTTTCAATC
GCAGGCGCTG TGGCGAACGG GATCGAGACG TCGACGCGGT CGCCGTCGGC GAACGTCGCA
CGTCTTCTGC GAGTTTTCAT ACTCTCAGTC GCACCGACGA TCGCCATGCT CGCCGGACCA
ATCCTAGGCG TCGCGTACGT AGCGCTGACA TACGTCTTGT ACGACGGTTT GCTCAGCCTC
TTGGTCGACG CGTCGCCGCG CTCGAAAGGC ACGGAAACTG TCGTCGCCAG TGGACTATGG
CTCGCGAGCA CGGTTGTATT TTTTGGTGGA GGACACACGT GCTCGTTTGA CGGTTTACAC
TTCGCCGTCG CGTTCACCGG CTTTCGCAAA TTCAACTTTT ACGGCATGGG ATTTCTCCTC
GGCTTTGAAA CGTGGAGTGG TGAAATCATT CTCGCCGTCG CCATTCCGCT CTTCGCGTTC
GCTATGACTC AAAATGAACC ATACGAGTCT TTCCAACGAT TGACCGTGCG CGTATCGATG
AAAGTCGCGC TCTTCCGCGC ATTCGCGGCG ACGTGCGCCG CCCTGTGCGC CTTCATCCAC
CGTCGACATT TGATGGTTTG GGCGATTTTT GCGCCAAAAT TCGTCTTCGA CGCCATCGGT
TCCACCGTCG CCGACGTTTG CGCCATCGTC GCCGTCGCTT CATCTTTCTC TAGGCATCCT
TTAGAGCGCG TCAAGCGCGA GTGATGCGAC GTTCGCTGTA TTATTATGTT CTCTCGCGAG
TCCCGACGCC CCGACGGCTG CCCAAAACCT CAACCTCATT TCGGGACACC GATTTTCGCG
ATGTTTAAAC CGATCACCCG CGAGCTCAAG CTCGCGCACG GCTCGCGACG CGAGCGTCGC
CGCGTCTCGC GCGCGACGGA ATCGCGCCTG CGCCCCGGGC GACAGGGCGT GCAACGAATC
TATCGCCTGA CCCAGGAAAA GGAAATGATT GGCGAGTACG AATACCTCGA GTCCATCGGC
GTCCCCAGGG CGCAGGCGCT GCAAGTCATG TCCCGAGCGT CCACAGCGTT CGAGCGCGAG
GCGGTGCGAC GAGGCCAAGA CCCGAAAGCG ATGAAATTCG GCGCGGAAGA GATGCGAGAG
GTGGTGGAGT TTCTGAAGGC GAGCGGCGTG AAGGAGGACG CGGTTGGATT TTTAGTCATA
CGTAATCCCG CGGTGTTGGC GTACGACGTG GAGAAACGAT TGAGACCGTT GTTTGAGTAC
ATGGAGGCGA CGTTCGAGCG GACGGCGGAG ATGTTTGTGG ATGACGTGAC GAAGCGGCCG
AGCTTGCTCG GGTTGGACGC CAACGAAAAC GCGAAAAAGA TGGTGGACTT TTTGTTATCC
ACGGGAAGCA CGAAGGAAGA GGCGGTGGAG TATTTATTGC GAACGCTTTA GGACTCGATG
TCGAGCGATT GATGATTAGC AG
 
Protein sequence
MRRRGRDGDA SPARRTRAWT LATAMKFTAL RLFASGFLLA RVESPSRATA RPDAARAIVD 
KAVVLVVDGA RHDWTTATRD EGDEARRRLK LPSARRYGGG RRCEDATNER GRGMVFKFIA
DAPTTTQQRL KGLLTGGLPT FIDASASFGG TTLGEDNLIE QLSANGRRMA ISGDDTWSEL
FDVNATFRAG AAMYPSFDVK DTETVDAGVR ASMAAALRAP DDWDVLIGHM LGADHVGHTH
GATTDFMRAK LEENDRDIEN VVEAMRADEK YADAMVFVFG DHGMTDNGDH GGGTPEEVES
FMLAYHPWAK GENCGNGDGE DDDDFPQIDF APTMATLLGV PIPHGNLGKV NEKVFNLAHE
GKRASGRGDV FAAYVRAMHA NAEQIWTYVQ SYGDGKTSPF GAEHTTRLAA LMKVVRANKS
VDTTEFVLDF MNEVAELARA KWAQFGLLSM TVGFIALVVT LTAHAVLAYD KVDDARDGDL
DLMIARVGTE TVVASGLWLA STVVFFGGGH TCSFDGLHFA VAFTGFRKFN FYGMGFLLGF
ETWSGEIILA VAIPLFAFAM TQNEPYESFQ RLTVRVSMKV ALFRAFAATC AALCAFIHRR
HLMVWAIFAP KFVFDAIGST VADVCAIVAV ASSFSRHPLE RVKRE