Gene OSTLU_31751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31751 
Symbol 
ID5001873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp474258 
End bp476414 
Gene Length2157 bp 
Protein Length709 aa 
Translation table 
GC content61% 
IMG OID640417294 
Productpredicted protein 
Protein accessionXP_001417783 
Protein GI145346618 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0545641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAAC CGCACGAGTA CGACGACTGG TTCGCGCACG CGGACGCCGA TGGCGACGGC 
CGCGTCTCCG GCGCCGAGGC CGTGCACTTC TTCATGCGCG CTGGGCTCCC GAAGACCGAT
CTCGCCAAGC TCTGGGACGC GGCGGACCAC GAACGGGAGG GATCGTTGGA TCGACGGGCG
TTTTCGTTGG CGTGCGCGCT GATAGGAGCG TTGCAGCAGT ACGGAACGAT CACGAGAGAC
GTGTTCGATC GCGCGCTTGC GGGAGATACG CGAGGATTTC CAAAGCCGAA GATGCAAGGG
TTGGAGTTAC CGGCGGCGCC GACGGCGACG ACGAGTCAAC CGCCGGTTGC CGCGGCGACT
GGCGGGACGT TTGGGAACGT TGCGCCGTCG CCCACGATGG AATTTACGTC GCCGCCGAAG
GACGATTTAT TCGCTATATC GAGTGGGGTC GATGATTTCG CGCCGGTGGC GCCGGCGCCG
ATGGAGCGAG CGCCGGCGCC CGTCGCGCCG ACGAGTCTCG CGTTTGACGC GCCTCCGCGC
GCGACGTCGG TAGAGCACAC GGTGCAGGCG TATCAAGCGC CTGCAGTTGC CGTACCCGTG
GCGCCGGAGG CGAACGTTGA TTGGCCGGTG ATCGGTCCGA ATGATTGGCA GCGATATCAA
CAGATTTTCC TTTCGAACAC GAACGGGAAC CCGGAGGGAC GGTTGAGCGG GCAGCAAGTG
GCGCCAATTT TACTCGGTAT GAACGCGCCT AAGCAAGTAT TGAAAGACGT GTGGGAACTT
AGTGACTCCG ACAAAGATGG GTCTTTGGTT TGGACAGAAT TCGTCGTGGC GGCTTACCTC
ACGGAACAGG CGAGGAACGG CTTGATGCCT CCCAAATCGC TTCCGCCCGG GCAATTTCCA
CCATTTAGCA TGACGGCGGG TGAGCAACCC GCGCCAGTAG CTCCCGTCGT TCCAGAAGCC
GCGCCAACAT CAGTGCTACA GGTGAACGCG GTCATGACGG ACGGGTTAAT GACACCTAGC
ATCGCGCGAG AGCAGTTACA AAACATCACC GCACCCGCGC AAGCGGCACC GCAAGTGAAT
GAGGCTTACA CATACAGAGG GCCAATGGCA AACATCGACG CCATCCCTGA ACAAGATCGT
GATTTGGCGG GAAAGGTCAA GGAAAACGCG GAGAAGAGCG ATCGGCAATT GTGGGAACAA
GAAATGAACG AGCGACAGAA CGTTCTGAGC GCGCACGCGG CTCAAGAAGT GTTGGCAAAT
CTCGCATTGT TTGTTCGTAA ATGCGAAGCC GGAATGACAG AAGCGTCCTA CAGGGCACAA
GTTGCCGAAT CGCAAGTCAT CGAACTTAGG CAGAAATGTG AGGTGATGGA AGGACGTGTG
ACGCAGCTCG TCGAACAACT GGCCGGACCC ATTGAGCGCA TCGAGGCGAG CAAGAAGGAA
CACGAGGAAT TGAGTGCGCG ATACCAGCAA CTCGAAGAGC GTCACGCCGA GTTGTCGCAG
AACGCGTCGC AGCAAAATCA TTCGCAGATG ATGCAAGATA ACGTGAGTCT GCGTGCGAAA
GTCGAGGCGA AATCCACGCA AATAGTGATG GAGGAGACGC GCGCGAGTCA AGCCGCGACA
ACGTCGTTGA GCGCTCAGAT GCGAGAAACG CAGCTCACAG CGACGCAACC GCCTGCGACA
GCGGCGTTGA TGGATTTCGG CGGCGTTTCG GCGGCGTCCA CAAATATATC ACCGGCCTCA
GCCAATGCGA AGAGCACTTT CGAAGAGTGG GACAATTGGG GCACGACGGC GCGCGAAGAG
GCGTCGACTC AAACGCGAGC GTCGTCGATG CCCGCGCAGC GGCACCGAAA AGTCCCATCC
GAAATCCCCG CCGTCACCGC CGCGCTCATC GAAGGCGACG GCGGCTTTTT CGATAGCATC
GACTCCGACC CGTTCGGCCA ATCGAACGAG TCGACGTCGC CCGCTCCCGC CATTCCTCCG
TCGTCCTTCG GCGACGACCC CTTCGGCGCG CCGCCGCCGA CGCCGCCCGG CGACTACGAC
GACCCTTTCG GCGCGCCACC ATCCCCCGGC CCGACGCCGC CGCAGAGCGC GCGCGCATCT
TTCATAGACA TCGATCCATT CGCGATGTGA GTGCTCGCCC TAGCTCGGCG CGAGCGA
 
Protein sequence
MQQPHEYDDW FAHADADGDG RVSGAEAVHF FMRAGLPKTD LAKLWDAADH EREGSLDRRA 
FSLACALIGA LQQYGTITRD VFDRALAGDT RGFPKPKMQG LELPAAPTAT TSQPPVAAAT
GGTFGNVAPS PTMEFTSPPK DDLFAISSGV DDFAPVAPAP MERAPAPVAP TSLAFDAPPR
ATSVEHTVQA YQAPAVAVPV APEANVDWPV IGPNDWQRYQ QIFLSNTNGN PEGRLSGQQV
APILLGMNAP KQVLKDVWEL SDSDKDGSLV WTEFVVAAYL TEQARNGLMP PKSLPPGQFP
PFSMTAGEQP APVAPVVPEA APTSVLQVNA VMTDGLMTPS IAREQLQNIT APAQAAPQVN
EAYTYRGPMA NIDAIPEQDR DLAGKVKENA EKSDRQLWEQ EMNERQNVLS AHAAQEVLAN
LALFVRKCEA GMTEASYRAQ VAESQVIELR QKCEVMEGRV TQLVEQLAGP IERIEASKKE
HEELSARYQQ LEERHAELSQ NASQQNHSQM MQDNVSLRAK VEAKSTQIVM EETRASQAAT
TSLSAQMRET QLTATQPPAT AALMDFGGVS AASTNISPAS ANAKSTFEEW DNWGTTAREE
ASTQTRASSM PAQRHRKVPS EIPAVTAALI EGDGGFFDSI DSDPFGQSNE STSPAPAIPP
SSFGDDPFGA PPPTPPGDYD DPFGAPPSPG PTPPQSARAS FIDIDPFAM