Gene OSTLU_35581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35581 
Symbol 
ID5002719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp424056 
End bp425216 
Gene Length1161 bp 
Protein Length386 aa 
Translation table 
GC content65% 
IMG OID640418140 
Productpredicted protein 
Protein accessionXP_001418699 
Protein GI145348528 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0000902697 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.474336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCG CGACGCCGGC GCCGGCGCCG GCGCCGGCGA CGGCGCGCGC GCGCGCGGAG 
GCGCGCGGCG ACGCCGGTCG GCGGTCGGTC GCGGCGCGCG CGCGACGCGC GCGGGCGCGC
GCGCCGGTCG GCGCGGTGGT GGACGACGCG ACGACGCGCG CGCGCGGCGC GCGCGGCGTC
GACGGCGCGC GGCGAACGCG AGACGCGCGC GACTATTGGT TTCCGGTGTG CTTCTCGGGG
AATCTGCGAG ATAAGGACGC GCTGGTGGCG TTTGATTTGT TCAACGTGCC GTGGGTGCTG
TTTCGGGGGC GCGACGGCGA GGTGGGGTGC GTGAAGGATG AGTGCTGTCA TCGCGCGTGC
CCGCTGTCGC TCGGGAAGGT GGTGGACGGA CGGGTGCAGT GCCCGTATCA CGGGTGGGAG
TACGAGACGA ACGGCGAGTG CGCGAAAATG CCTTCGTGTA GTTTTTTGAA GAACGTCTTC
GCGGACGAGC TCAGGGTGAT CGAGAGGGAT GGGATGATAT TCGTTTGGGC CGGAGAAAGC
GATCCCGCGG ACTTTGTTGG ACCGGAGGCG GCGTGCGAGT CTTGGGACGA GGACGTGTAC
GAAGCCAACG AACCGGGGAT GTTCACGACG GGCGAGGGAT TCGTGACGAT GGCGGAAGTC
ATCGCGGACG TGAAACTTGA CAGCGATGTC GTGGTGGAAC GGTTGTTAGA CATCACCGAG
CGCGCGCGGC GGGAACCGGT GTCGGTGAAG AATCGCGGTC GAGGCGCACT CTTTCCGGTG
GACGGCACGC GGTTGATTTC AAAAGTCTTG CGGATCGGTT ACGACGCGGT ACCGCAGAGC
GTGGTTTTCA AACCGTCGTG CGTGATCGCG AGCACGATCG CGCTGAGACC GCGCGTAGGG
GGTGGAGATG GGACTTCGAT GCAAGTAGAG CAGTTGCACG TGTGCTTGCC CGCCAAGCCG
GGGCTCACGC GCGTCCTGTT CCGCATGGCT TTCGATTTCG TCCCGGAGGG TGCGCAAAAC
GCGGCGGGTG ACGTGTGGAA GAACTTAGCA ATGCAGGTCC TACAAGAAGA GCTCGAAGAC
GTGCGAAGCG CGGGGCTGAA ATCGGAAACG ACATCCATAG CGGTTGAATC GTTTCGCGTC
TTCAAGAGAG GAGACAAGTA G
 
Protein sequence
MARATPAPAP APATARARAE ARGDAGRRSV AARARRARAR APVGAVVDDA TTRARGARGV 
DGARRTRDAR DYWFPVCFSG NLRDKDALVA FDLFNVPWVL FRGRDGEVGC VKDECCHRAC
PLSLGKVVDG RVQCPYHGWE YETNGECAKM PSCSFLKNVF ADELRVIERD GMIFVWAGES
DPADFVGPEA ACESWDEDVY EANEPGMFTT GEGFVTMAEV IADVKLDSDV VVERLLDITE
RARREPVSVK NRGRGALFPV DGTRLISKVL RIGYDAVPQS VVFKPSCVIA STIALRPRVG
GGDGTSMQVE QLHVCLPAKP GLTRVLFRMA FDFVPEGAQN AAGDVWKNLA MQVLQEELED
VRSAGLKSET TSIAVESFRV FKRGDK