Gene OSTLU_25712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25712 
Symbol 
ID5006247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp53202 
End bp56122 
Gene Length2921 bp 
Protein Length967 aa 
Translation table 
GC content60% 
IMG OID640421668 
Productpredicted protein 
Protein accessionXP_001422190 
Protein GI145355912 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.260732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.855505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAGG AGGAGGAGGG AGAGGCGACG ACGTCGTCGA GCGGGACGGG ACGCGGAGGC 
GGCGCGGCGC GATGGTTGAT GAACGCGGAG ACGAGCGGGG CGTACGGGGA AGCGGCGTCG
CGAGGAAGGA CGGGGGGGAG AGGAAGGGGT GGGGGCGCGC GGGGGAGGGG CGCGGCGCGA
TCGAAGTCGG CGGACTCGAA GAAGCGGACG CGAGGGGACA AACCTACGAA CGTGCGACAG
CGCGAGCGGA GCGACATGTA CGTGAAGAAT AATCAAATCA TCGCCAAGTT TGACGACGTC
GAGGACGTGT TAGATTTCGC GAGCGAGAAT TTGGAAATCA TGAACGTGGT GAACTTGGCG
ACGGCGGCGC ATCGGGTGGG GAAATTGAAC TCGACGCGGA CGAGGAACGA AGCCGGAGCG
CCGGCGACGG CGACGAGGCA TCCCGCGGTG GTGGAAGACG CGCGATTTCG CGCGTTGTTT
GAGAAATTGC GCGAGTATCT CGTCGCTTCG CAAAACGGAT CGCCGCTCGG GAAGGGGTTG
GGGCGGTTCA ACGCTCGCGA ACTGTCCGCG ATTTTGTGGG GCTCCGCGCA CTGCGGCATC
ACGACGTCGG ACGACGACCC AACCGTCGCT CTCGTGGTGA GGCGAATCGC AAACTTGGAC
GACGACGACC CACCGGCGCA GAACGTGAGC AACGTATTGT GGGCGTACGC GACGATGCAC
ACGTCGAAGA AGATAGACGT GGAGCTGGTG CAAAAGTGCG AATATTGGTG CGACCTCATC
ATGGACGACT TCGCCCCGCA AGGGATTAGC AACTCTTTGT GGGCGTTCGC CACGCTGGGG
TACACGCTTA AACCGGAGAC CATCGCCAAG TTTTCGCAGG CGATCAGGCG ACAGCTGAAA
GATTTTAAAT CCATGGAGTT CTCAAACGTC GTTTGGGCGC TGGCGACGAT GAAGACGCAT
TTAGACCCAC TAGAAGTCTT CGACGAACTC TTGGACGAGA TGCACGCGAG CATCAAGGCG
GTTCCAAACA TGTGGAGCTC ACAGAGCGTG AGTAATACTT TATGGGCGAT CGCCACGCTC
GACGGAGAAC CGCACAAATT AAGAGCTCGC CACGGCGATT ACTTGAACAC GCTGTGCATG
TACGTAGAGC GTAAGGCAAA CGCGTTTGTT TGTCAAGGTT TGGCGAACAC GCTGTGGGCG
CTGGCGACGC TCGAGTACAC GCCTTCGATG AAGATGCTCG AAGCCGCCAC GGCGCGTTGG
TCCGCGTTAG CGACGGACGT GTACATCAGT GAGTGTAGCA ATTTGCTTTG GTCGTACGCC
AGCCTGCGGT TCAACCCAGG AAATGAAGTG CTCACGCAAG TCGCAGAGTT GTACCTTCGC
GTCGGGCAGC ACGACGAAGT GGCGTTGACG CAAGTCTCGA ACACCTTGTG GGCGTGGGCA
AATTTCGGTT GGCTTCCCGA GGATCCAAGC ATCGTGGAGT GCGTCCTTCA AGTGGCGATC
AAACACTTCA AGAGCGATCC AGATTTGCAA ACGCAGAGCT TGGCGAACAT CTTGTGGTCG
TTGGCGACGC TGAGGTTCGT TCCCGGGGAT GAATTTCTCC AAGCCTTTAG AGAGCGCGCG
CTCATAGAAT TGCGCGAGGA CGAAAGATTC TCCGATCAAG GGTTGTGCAA CACGGTTTGG
GCGTACGGTC AGCTCGGAGT GAATCCAGGG ACGGAGTTAA TGAGTGAAAT CGCGAGTCAG
CTGGGCGCTC GCGTGACGAA TTTCCCCACC CAAGGCGTGA CGAATTCGAT TTTAGCCTTT
GCCACGCTCG GGTTTTGGCC GGATGAATGG GTCGTAGACA ACTACAGGGC GAAGATCGTG
GAAATGTACT ACTCCACCAC GATTTCGGAC ATCGACTTGA CGCAGTTTTT CCAAGCGAAT
TACTTGTTTG AAAAGTGTTC GCCCTACGGA CCGCTCGTCA CCGACCCGCA GATGATTGAG
GACATGTTAT CGGCGTGGAA GCGCGGATCG AGCAAGGTTG TCATCAGTCA GTTTCATCGC
GAGGTGAGCG ATACGCTGAC GAACATGGGC GTGCCACACG AAATTGAATA CATCACCGAA
GACGGTTTGT TCTCCCTCGA CATCGCACTC AAGGGTAAAA AGCTCGCCAT CGAGGTGGAC
GGTCCGTCGC ACTTTGCGAG AAACATCCAA AACCGCCGCA TGTCGGGGAA GCGACCCGAC
GGCACGGGGA CGTATAACAT TCGTTATCAC TACCTCGACA CCAACGGTTG GACCACGGTA
TTCATACCGT GGTACGATTG GAAACAGGTG TGCGACGAGG AGTCCGCGAC GAGAACCACC
GGCAGACGCG CCGCGTTTTT AGCCAAGACG CTCTACGACG ACGCCGGTCT CACGCTCATG
GACGTCGCCT CGGACGAAGA CATGTCCGAC TCCGGCATGT CTGGTTTTCA CATCCGCGCA
CTCGCCGACG ACGACGTCTC CGTCGCCCAA GACGGCTCGC GTCTCGTCAT TCAAGGCGCC
GAAAGCCAGC GACACCCCGA CGGCACGTTC AAGCCCGAGA TGAAATCCGT CGGCGCCTCC
GTCCCCAAAC CTCCCGCCCC CGTTCGACGC ACCGGATACG GCATGCTCGA ACCATCACCG
TCAACACCGT CAACACCGTC AACACCGTCA ACACCGTCCC CGCCCCCCCC CGTCGTCGCG
CGCGTCGCCG GCGCTCGCGT CGCCGCGCGT CCGCCGTCTC GCGCGTCTCC GCGCGCGCGC
GCGCCCGCCG ACGCCGCCGA CGCCCCCCCC GACGACGACG ACCCATCTCC GGCGCGTCGA
TCGCGTAAAT CTCTCGCCAC TCAACGCGGC GCGGGCATTC GTCGTCGTCG TCCGCGCGCA
CCGCCGAGCG TCGAGTCCGA TTAAACCGCG CGCGCGGCCG C
 
Protein sequence
MDEEEEGEAT TSSSGTGRGG GAARWLMNAE TSGAYGEAAS RGRTGGRGRG GGARGRGAAR 
SKSADSKKRT RGDKPTNVRQ RERSDMYVKN NQIIAKFDDV EDVLDFASEN LEIMNVVNLA
TAAHRVGKLN STRTRNEAGA PATATRHPAV VEDARFRALF EKLREYLVAS QNGSPLGKGL
GRFNARELSA ILWGSAHCGI TTSDDDPTVA LVVRRIANLD DDDPPAQNVS NVLWAYATMH
TSKKIDVELV QKCEYWCDLI MDDFAPQGIS NSLWAFATLG YTLKPETIAK FSQAIRRQLK
DFKSMEFSNV VWALATMKTH LDPLEVFDEL LDEMHASIKA VPNMWSSQSV SNTLWAIATL
DGEPHKLRAR HGDYLNTLCM YVERKANAFV CQGLANTLWA LATLEYTPSM KMLEAATARW
SALATDVYIS ECSNLLWSYA SLRFNPGNEV LTQVAELYLR VGQHDEVALT QVSNTLWAWA
NFGWLPEDPS IVECVLQVAI KHFKSDPDLQ TQSLANILWS LATLRFVPGD EFLQAFRERA
LIELREDERF SDQGLCNTVW AYGQLGVNPG TELMSEIASQ LGARVTNFPT QGVTNSILAF
ATLGFWPDEW VVDNYRAKIV EMYYSTTISD IDLTQFFQAN YLFEKCSPYG PLVTDPQMIE
DMLSAWKRGS SKVVISQFHR EVSDTLTNMG VPHEIEYITE DGLFSLDIAL KGKKLAIEVD
GPSHFARNIQ NRRMSGKRPD GTGTYNIRYH YLDTNGWTTV FIPWYDWKQV CDEESATRTT
GRRAAFLAKT LYDDAGLTLM DVASDEDMSD SGMSGFHIRA LADDDVSVAQ DGSRLVIQGA
ESQRHPDGTF KPEMKSVGAS VPKPPAPVRR TGYGMLEPSP STPSTPSTPS TPSPPPPVVA
RVAGARVAAR PPSRASPRAR APADAADAPP DDDDPSPARR SRKSLATQRG AGIRRRRPRA
PPSVESD