Gene OSTLU_17433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17433 
Symbol 
ID5004483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp525222 
End bp526592 
Gene Length1371 bp 
Protein Length456 aa 
Translation table 
GC content63% 
IMG OID640419904 
Productpredicted protein 
Protein accessionXP_001420537 
Protein GI145352400 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.614267 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGC TGGCGCGCGA GCGCGAGGCG CGACAGCTCG AGGCTCGAGA GCGCGAGCGA 
TGGAATCGCG ACGTGCGCGA GGCGCTGCGA CGCGGGACGC GGGAGGAGGC GCTGCAGAGA
CGATTGGAGG CGCTCGACGC CGAGTTTGGT GACGACGCGG CGCGCGCGAC GACGGCGGCG
ACGCGCGGCG ACGCGACGGA AGGCGGTCGA GACGGCGCGC GAGACGATAC GATGCGACGG
GCGTCGACGC TGGAGCGAGC GTCGAGGGCG AAGGCGAACG AGGAACGGCG GGCGAGTCGA
GAGGCGTCGA CGGCGGCGCG CGCGCTGGAG ACGGACGCGA TGCGAGCGCT GCGAGCGAGG
CTGTCGGCGA TGGAGACGCA CGCGATTCGA AATAACCAGA TCGAGGAGAA GAAAGCGTTG
GAAAAGTTGG AGATGGAGCG GCAGCGCGCG CTGGATGCGG AGATGGAACG CGAGAGATTG
GAAGGGGTGG CGCGCGATGA GAGAGAGAGA GCGGCAAAGT TGGCAAAGAA AATCGCGGCG
CGCGAGGCGC TCGACGCGCA GCACGCGGAG CGGGCGCTTC GCATGAGCAA ACGAGCGATG
GAGAAGGAGG ACGAGCGACG GAAGATGGCG CAGTTAGTGA AAAAGATTGA GGCGGAAGAT
GAAGAGGAAA GACGAGAAGC CATCGCGAGA AGAGAGTGCG CGAAACGCGA CGCGGCGAAG
GCGCTGGAGG AGAAACGAGC GCGAGTCGCG CGCGAAGCTG AGGCGGTGAA GGCGTTGGAA
GAGCAAATCA TGCGACACGA TGAGATGGTG GCGCGGCGAG AGGCGAAAGA GGAAGAAGAG
AAGCGTAAAG AAGAGAAGCG TAAAGAACGC GCTCGCGTTT CCGTCGAGAC TTCGCAGACG
CGCGCCGCGC AAGAGCGCGC AGACTTTGAC GAACTGATTT CGCGATTGCG ATTTGAAGAA
TTTGAAGAGG CGTGTCGCGT GAAAGAAGCC GAAGAGCGCG AAAATGCTGC GCGAAGACGC
GATGAGCAGC GTCGCGAATA CGAAGCCGCC GTCGCCGCGC GAGAAGCGAG AGAGGCTCAA
GCTTTGGAAG AAAAGAAACA ATTTCAGCGA GATCTCGCCG CAAAGCTCGC TGAAGACGAT
CGCGTGGAGC AACTGAGCGC ACAAAAGCGC CGCATGAAAC TTCTCGAACA CTCCAGAGAG
GTGCGTCGTT TGGCGGAAGA AAAATTCGCC GAGCGCGAAG CTCGGCTCGA GCTCGAGCGC
CGCGAACTCG ACGCCTCTCG ACGCGAGCGC GAAGCCATGG ATGCGGCCAT CGAATGCGGG
CGCGAAAAGC TGCTCGCCGA ATTCCACAAC AAAACCGCCA GTCTCGCGTG A
 
Protein sequence
MRALAREREA RQLEARERER WNRDVREALR RGTREEALQR RLEALDAEFG DDAARATTAA 
TRGDATEGGR DGARDDTMRR ASTLERASRA KANEERRASR EASTAARALE TDAMRALRAR
LSAMETHAIR NNQIEEKKAL EKLEMERQRA LDAEMERERL EGVARDERER AAKLAKKIAA
REALDAQHAE RALRMSKRAM EKEDERRKMA QLVKKIEAED EEERREAIAR RECAKRDAAK
ALEEKRARVA REAEAVKALE EQIMRHDEMV ARREAKEEEE KRKEEKRKER ARVSVETSQT
RAAQERADFD ELISRLRFEE FEEACRVKEA EERENAARRR DEQRREYEAA VAAREAREAQ
ALEEKKQFQR DLAAKLAEDD RVEQLSAQKR RMKLLEHSRE VRRLAEEKFA EREARLELER
RELDASRRER EAMDAAIECG REKLLAEFHN KTASLA