Gene OSTLU_50546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50546 
Symbol 
ID5003913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp99682 
End bp100817 
Gene Length1136 bp 
Protein Length334 aa 
Translation table 
GC content57% 
IMG OID640419334 
Productpredicted protein 
Protein accessionXP_001419490 
Protein GI145350171 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00915074 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0604025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACGTCGACGA TGGGTTTGAT GAAGAGATCA GCAATCAGAA TGGCGATAAA ACCGGTCTTG 
ATCGCGTGGG CGTGAAGATT ACGGCGTCTT CGTTGTACAT TCTTCAGGGC ATCGTCGAGT
ACACGGCGCT GATGCAAGTA ATGCGGCCGA GCGTGCCGGT GATTTTCAAC GGTATCTGTC
AACTCTTCGA GCTCGCACTA GTGAAAACTT TTAACGCTTT CGGGCGAACG GAAGCTTTGC
TTCCCGAATC GCACGACATG ACGCCGCGGC TGCGAGGTAC GTTGTCTAGA CTCGGGAACT
CGGGTGGCGC CATGGCGATT CGACCTACGA TTCGGGGACA AGGGGACGTA GACATCTTGT
CCAGCGGTAA TTTGTACGGT TTGAAGGAGC GCGGCATCGC GCTCGAGTCG CTCTCGCGCG
TCGCCGATGA GTTCAAGCGC ATCAAGGCGC GTGTGAAACG CTCGCTCCCG CTCAAAGACG
CCGCGCTGGC GGATCGCTTT TACTCGCACA CCGTCGCTGC GGTGGACGAT TTGCGCGAGC
ACGTGTACAA GAACGTTGCT TCATTATTGC TCAACATTGA ATTTTGCGCA GAAGCGATCG
GAGACGGCGA TCCAAACTTT GCGTTGGTGA ACTCTACGTT CGTGGGCAAG TACAACATTC
GCGAGACGCC GTCGAGGCAC AACAAGTGGG TGGACGACGT CCAGGCCGAA CTCTTACAGT
TCACCACCAA ACTCGCCTGC GCCGACGTCG CACCTGAAGC TCTGGATGTT CTGTGGCAGC
ACGCCGCGAG TGTGATCCAA GACTCTCTCG TCGATGGTTT CAGCAAGGTG AAAAAATGCA
CCGACGCCGG CAGAGCGCTC ATGGCGCTCG ACGTCGAAAC CCTGCGCGGC GAATTCGCGA
AACTCGCCCC GTCCCAGTCC CTTGCCTTCG ACTGGCGCTA CGTCAGCACC TACATCAACG
CCTTCTACGT CCCAGAAAAA GACGTCGAGA AATGGATGCA AATCCATCCC GAGTTTTCCA
AAAAACAAAA GCTCGCCTTG GTCGCCCACA ACGCCTCGGC GGATCGCGAC AGGCGCTGGA
CCGCGAAATT CCGCCAAGAC TTGCTCAACG CCATCGAAAC CGACACGTTA CTCTAG
 
Protein sequence
MQVMRPSVPV IFNGICQLFE LALVKTFNAF GRTEALLPES HDMTPRLRGT LSRLGNSGGA 
MAIRPTIRGQ GDVDILSSGN LYGLKERGIA LESLSRVADE FKRIKARVKR SLPLKDAALA
DRFYSHTVAA VDDLREHVYK NVASLLLNIE FCAEAIGDGD PNFALVNSTF VGKYNIRETP
SRHNKWVDDV QAELLQFTTK LACADVAPEA LDVLWQHAAS VIQDSLVDGF SKVKKCTDAG
RALMALDVET LRGEFAKLAP SQSLAFDWRY VSTYINAFYV PEKDVEKWMQ IHPEFSKKQK
LALVAHNASA DRDRRWTAKF RQDLLNAIET DTLL