Gene OSTLU_18023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18023 
Symbol 
ID5005339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp162826 
End bp163935 
Gene Length1110 bp 
Protein Length369 aa 
Translation table 
GC content60% 
IMG OID640420760 
Productpredicted protein 
Protein accessionXP_001421222 
Protein GI145353870 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.150148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACG AGACGCTGCG GCGGCGCGCG GCGGAGGCGG CGTTCGACGA GACGTCGACG 
ACGCCCGAGT ATCGTGGGTT TTTGAATCAC GCGAGACGGT GCGTCGATGG GAGCGGAAAG
ATCGACTGGG GTTCGGTGAC GCGCGTGGGG GGGCTGGCTG GGTCGAGAGA GGTGCGGCGG
GCGCGCGGGG TGGACCCTCG GGACGTCGAG GCGGCGTGTT TGCGAGGCGA AGGCGAACCT
TTGATCGTCG AGGACGGAGG GAAAGATTGG GCCAAGTGGG ATTTTGAAAC GCTTCAAAAC
GAAATCGGTG ATTTTGAGGT GTTGTGTAAC GATCGAGCGC CCGCGCGGCG ACGAGAGATC
GATGGATCCA AGCAGCGGTC GCACCTGATT CCGTTTCGGG CGTACGCCGA CTACGTACGA
AAGCGCGACG GCGTCGCCGG CACAGTCTTT GACGATCGCC GCACGCCGTT TTATGCAAAC
GGCATGCGCG TATTCAGCGA GTGCAAGCGC GCCGACGCCC TCTCGCGGGC GTTTCCACGA
CCTTATTTCA CGCACGAGTG CGATAATACG GAGACGCTGC TCATGGCGAC AACGAACGAG
CTCGGGTCGA TACTCAAATT TGACTCGGAA ATCGCGCTCA GAATGCGTGA CAGCGTCTCA
AAATCGCTCG ACAAGATGTT TGTCGGGCCT CGAGGCGCAC TCACGCGCCT TCATTACGAC
GCCGGGGACG CTCACGGTTG GCTGGGACAG GTCGAGGGGC GGAAGCTCTT CGTGTTTTAT
CCGCCCAGCG CGTCGCCGAT GCTTTATCCG ATTGAAGACT CGCACGCCAG CGTCGATCCA
CTGGAGCCAG ACTACGATCG ATTTCCACTA TTTCGCGAAG CGCAATCGCG CGCGCGAGTG
TGCGTGCTGA ATCCGGGAGA AGTCGTACTG TGTCCTCGAC GATGGTGGCA CTACGCCGTG
GCCCTGGACA CTAGCGTCAC GGTTATGAGG AATTGGTACA ACGTCAATAC CAACGCCCAG
GCGTTGGTCG AGCAGATATG CTCCACGATT AAACAAACAG TAGACAATAG AGCGAAAGGA
TCCGTGCCTC GAAACGAAGC GTCTCGATGA
 
Protein sequence
MDYETLRRRA AEAAFDETST TPEYRGFLNH ARRCVDGSGK IDWGSVTRVG GLAGSREVRR 
ARGVDPRDVE AACLRGEGEP LIVEDGGKDW AKWDFETLQN EIGDFEVLCN DRAPARRREI
DGSKQRSHLI PFRAYADYVR KRDGVAGTVF DDRRTPFYAN GMRVFSECKR ADALSRAFPR
PYFTHECDNT ETLLMATTNE LGSILKFDSE IALRMRDSVS KSLDKMFVGP RGALTRLHYD
AGDAHGWLGQ VEGRKLFVFY PPSASPMLYP IEDSHASVDP LEPDYDRFPL FREAQSRARV
CVLNPGEVVL CPRRWWHYAV ALDTSVTVMR NWYNVNTNAQ ALVEQICSTI KQTVDNRAKG
SVPRNEASR