Gene OSTLU_40637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40637 
Symbol 
ID5005782 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp456404 
End bp458029 
Gene Length1626 bp 
Protein Length510 aa 
Translation table 
GC content63% 
IMG OID640421203 
Productpredicted protein 
Protein accessionXP_001421803 
Protein GI145355088 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGA GGACGACTCG GTCCGCGACG GTTCGACGAC ACCGAAGCGC GGTGCGACTC 
GAAGCGACGT CCGAGGCGGC GACTGCGGAG ACGAACGCGC CGCCGGCGGC GGCGGGGGAG
TCGTTCGATT GGTCCAGCGC GTGGTATCCC CTTCGTCCGG TGAGCTTTTT GGATGCCAAC
GAACCGAACG AGCTGCGCGT GCTCGGTAAA AAGCTCGTCG CCTACCTCGA CCCGACGTGC
AAGGAGTGGC GAGTGTTGGA GGATAGCTGT CCGCACAGAC GTGCGCCTTT GAGCCTGGGA
TACGTTCAAA AAGACGGCAC GCTGGCGTGC CGGTACCACG GCTGGGCGTT TGACGGCAAG
GACGGCTCGT GCGTGTCCAT CCCGATGTCC GTCGATGAAG CCGCGGAAAA GACCGCGTGC
GCATCGCCGC GATCTTGCGC TACGAGCTAT CCGAGCCGCG TCGAAGACGG AATTCTCTGG
GTGTGGCCGA CGGCGGGCGC GGATGGCCTT CTCGCCAGCG CGGGCGCGCC GTGCGCGACG
TCTCTGGCGC GAGAAGGCAC GCTTCCCGGG GAGTGGGGCA TGGTGGAGCT TCCCGTGGGT
TACGCCCCGG CGCTCGAGAA TCAGTTTGAC CCGTCTCACG CGGAGTGGTT GCACGCGAGG
TACGACGCCG AAGGACAGCT CGACGAGCGC GCGAACGCGG GTTTCGTGGC CATGACTGAG
TTCAGCGTTC GCGAGGGGAC GATGCAAAAG GATGGATTCG TGGTCGAGCA CGGTGGATAC
AACAAGTCGA ACGTCGGCGT ATCGGCATCG CGCGTGTTCA CCGCGCCGTG CTCGAGTCGA
AGCGAATACT TGGACGCCAA GGGTAAAAAG TACCTCTCGG CGGCGATTCT CTACGCGCCG
ACGGAGCCCG GACGAACGCT CATGTTTACA AAGTTCCAAG CCCACCAGGC GAGCGCGGTG
CAGGGCGCGG GCGCGCGTAA GGTTTCACCC GCCGATCGCA TCAACTCCCT TGTCACCGCT
CCCGCGACGT CGCTGTTTGA CTTTTACGTC GACAACTTTA CGAGCGACCC AAAGCTCGTG
CGCGTAGGGC TGTCGCACGG CACGCCGCCG GGATCGAGCG CGTACAACTT GGGGGACCAG
GATATCTTAG CCATGCACGG AGTTGAGGTC GAGATGGAGC TTCAAAACAA ACCGTGGAAA
CAATCGTATT ATTTGCCGAC GCCCGCCGAC GCTGGAGTGT CGGCGTTTAG AAATTGGATG
GACAAGCACG CCGGAGGCAA AGTCGCGTGG GCGCCGGGCG TCGTCGACGA CGCGTCGAAG
GTGAAATCCG AAGCCGAACA GCTGGATCGG TACCATCGTC ACACCAAGCA CTGCGTGGCG
TGTAAGACGG CGCTCAGCGA ACTCGGCGTG CTCGAAGAGC GATGCGTCGC CGCGAGCAAG
TACTTGCTCG CCGGCGGGTT GTTCCTCGCC GTCACGGGTG CAGCGTTCGA TCAAGAAGCG
CCAGCCATCA TCGCCACCTG TCTCGCCGGC GCCTCTCTCG TCGGCGCCGA AAAGGTTCGC
GACATGCAGC ACGAATTCCT GTCGAGCGTG CCTCGAAGAG GCGTGCCGAA ACCGAAACTT
TGGTGA
 
Protein sequence
MDKRTTRSAT VRRHRSAVRL EATSEAATAE TNAPPAAAGE SFDWSSAWYP LRPVSFLDAN 
EPNELRVLGK KLVAYLDPTC KEWRVLEDSC PHRRAPLSLG YVQKDGTLAC RYHGWAFDGK
DGSCVSIPMS VDEAAEKTAC ASPRSCATSY PSRVEDGILW VWPTAGADGL LASAGAPCAT
SLAREGTLPG EWGMVELPVG YAPALENQFD PSHAEWLHAR YDAEGQLDER ANAGFVAMTE
FSVREGTMQK DGFVVEHGGY NKSNVGVSAS RVFTAPCSSR SEYLDAKGKK YLSAAILYAP
TEPGRTLMFT KFQAHQASAV QGAGARKLVR VGLSHGTPPG SSAYNLGDQD ILAMHGVEVE
MELQNKPWKQ SYYLPTPADA GVSAFRNWMD KHAGGKVAWA PGVVDDASKV KSEAEQLDRY
HRHTKHCVAC KTALSELGVL EERCVAASKY LLAGGLFLAV TGAAFDQEAP AIIATCLAGA
SLVGAEKVRD MQHEFLSSVP RRGVPKPKLW