Gene OSTLU_43564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43564 
Symbol 
ID5006739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp196543 
End bp198207 
Gene Length1665 bp 
Protein Length554 aa 
Translation table 
GC content64% 
IMG OID640422160 
Productpredicted protein 
Protein accessionXP_001422516 
Protein GI145356601 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.079373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0235879 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGC GCGCCGCGCT GTCGACGCGG CCGGCGCCGG CGCGCGCGCG CGCGCGTCCC 
GCGCGGCGCG CGACGATCGC GCGCGCGCGC GACGCGGACG TCGTCGTCGT CGGCGCCGGT
CTCGGCGGCC TGAGCGCGGC GGCGCTGCTC GCGTCGCGGG GACTGCGCGT GACCGTGCTC
GAGGCGCACG TCGTCGCGGG CGGCGCCGCG CACGCGTGGA CGCGCGGTGG ATACACGTTC
GAATCCGGAC CGTCGCTGTA CTCGGGGCTG GAGGCGCGAC CGACGACGAA CCCGATGGGA
CAGGTGCTGC ACGCGATCGA CGAACGCGTG GCGTGCGCGC GGTACAACAC GTGGACGTGT
CACCTGCCGG AGGGAACGTT CGTGACGGAG GTCGGGAACG ATCAGTTCTT GGAGGTGCTG
CGAGAGTACG TGGATGAGGA GGCGTGCGAG GAGTGGACGC GGTTGAAGGC GCTGATGGAA
CCGCTGGCGA CGGCGAGCGC GTCGCTGCCG CCGGCGGCGG TGCGAAGCGA CGTCGGAGGG
GCGTTCACGC TGGCGCGATT CGTGCCGGGA TTGGTGCGAT CGGCGCCGTA CATCGGGAAG
ATTATGGCGC CGTATAGCAA GTTTATGGCG GAGAACGACA TCAAGCACCC GTTCATTCGT
AATTACATGG AGATGCTCTG TTTTTTGCTC AGCGGGGCGC CCGCGAGCGG AACGATGGCG
GCGGAGATCG GATACATGTT TGACGATTGG TACAAGCCAA ACTCGATGCT CGAGTTTCCG
ATGGGCGGGA GCGGAGCGAT CGTGGACGCG CTCGTGCGAG GGCTGAAGAA GTTTGGCGGC
GAGCTAGAGC TCGGGACACA CGTCGAGGAA ATCATCGTCG AAGGCGAAGG CAAGGAGAAG
CGCGCGACGG GCGTGCGGAC GAGAGACGGA AAGGTTTACA AGGCTAACAA GGCTGTGATA
TCGAACGCAA CGCTCTGGGA CACGGTACCG ATGCTACCAC CGGATGCGAT GCCTCGAGAG
TGGATAGAAG AAGCGACATC GACGGCGATG TGCGAGTCGT TCATGCACTT GCATCTCGGC
ATCGACGCCG CCGGCCTTCC CGATGACTTA GAGATTCATC ACATTTACGT CGAAGATTGG
GATCGCGGCG TCACGGCGGA GCAAAACATG GTGCTCGTGA GCATTCCTTC CGTACTCGAC
CCCTCGATGG CGCCCGAGGG CAAGCACGTG ATTCACGCGT ACACGCCCGG AAACGAGCCC
CTCGATATTT GGGACGGGCT CGACCGTAAC TCGGACGAGT ACAAAAAGTT GAAGGAGGAA
CGCTCGCAAG TGCTTTGGAA AGCTGTAGAA AAGATCATCC CAGACGTGCG CAAACGCTGC
GAAATCACGA TGGTCGGCAC GCCCGTGACG CAAAAGAGAT TCTTGCGAAG GGCGAAGGGC
ACGTACGGGG GCACGGGTTG GATATCCGAG GATCAAGACA GCATTCCGAT CACGAGCGCG
TCGACGCCGC TTCCAGGCCT GCTTGTCGTC GGCGACTCTA ACTTTCCCGG CCCTGGCGTC
CCGGCCGTAG CCGCGGGCGG TTGGAGCGCG GCGAATGAAT TGATTTCGCC GCTTCAAACC
GCCGCTTTGC TCGACAAAGT GTGTCCTCCG GGGACGTCAA AGTGA
 
Protein sequence
MATRAALSTR PAPARARARP ARRATIARAR DADVVVVGAG LGGLSAAALL ASRGLRVTVL 
EAHVVAGGAA HAWTRGGYTF ESGPSLYSGL EARPTTNPMG QVLHAIDERV ACARYNTWTC
HLPEGTFVTE VGNDQFLEVL REYVDEEACE EWTRLKALME PLATASASLP PAAVRSDVGG
AFTLARFVPG LVRSAPYIGK IMAPYSKFMA ENDIKHPFIR NYMEMLCFLL SGAPASGTMA
AEIGYMFDDW YKPNSMLEFP MGGSGAIVDA LVRGLKKFGG ELELGTHVEE IIVEGEGKEK
RATGVRTRDG KVYKANKAVI SNATLWDTVP MLPPDAMPRE WIEEATSTAM CESFMHLHLG
IDAAGLPDDL EIHHIYVEDW DRGVTAEQNM VLVSIPSVLD PSMAPEGKHV IHAYTPGNEP
LDIWDGLDRN SDEYKKLKEE RSQVLWKAVE KIIPDVRKRC EITMVGTPVT QKRFLRRAKG
TYGGTGWISE DQDSIPITSA STPLPGLLVV GDSNFPGPGV PAVAAGGWSA ANELISPLQT
AALLDKVCPP GTSK