Gene OSTLU_16384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16384 
Symbol 
ID5003375 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp171930 
End bp174713 
Gene Length2784 bp 
Protein Length317 aa 
Translation table 
GC content64% 
IMG OID640418796 
Productpredicted protein 
Protein accessionXP_001419298 
Protein GI145349764 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0382] 4-hydroxybenzoate polyprenyltransferase and related prenyltransferases 
TIGRFAM ID[TIGR01476] bacteriochlorophyll/chlorophyll synthetase
[TIGR02056] chlorophyll synthase, ChlG 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.960686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.418422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCGT CGAAGTAATC GGCGGGAGAT TCGAAGTCTT CTCCAATCGA CATTCCGCCG 
TGGAAATCGG CGAAGACGCG TTCGCACGCC GCGCGTCGGT CGCCCGCTTG TACGGCTAAC
TCGACGGCTT CGCGGCCGTC AAACGAGAGC GCTTGGTACC AATCCCGCGA GTTCGATGAC
TGACGCAAAT CTTCCACGGA ACCGACGAAA GGGCACGTGC GCACGCCTTC CGTGAGTTCT
GCTAACGAAT CGCACTTGTG CGGGTTGTAG TCTTTCATGG CGCCCTCCAA GCCGTAGTGG
TGTCGAATCG CGTACGTGTG CTCGCCTTGG TACGCCTTCA TTCGCCCCTT CGGAAACACG
CCGCGACGCC AGAACTCGAG AGCTTCGTTC ACCGTGAACC CGATGCCCTT GAAGAACAGG
TTGAGTTGAA ACCTGTCGTG CCACTTTAAG TGTGCATCTC GGCGTAAATC TTCCATCTTG
TTTCGAATGC ACGGAGGAAA CGGTCTCGTG GCGACGTCGC GATTGACGTT TGAGTCAAGC
GCTTTGCGCG CGCGACGCAA TTCCGCCCCA GGTCGCGTCT CGATGATCGC TTCGACTTCA
CCGTCGGCCT CGATCGCAAT AGCGTGTTGC GCCACGCCAC CCACACGCTG CGCGCGACGT
CGCTCGCGCC TCGCCTTTGC GAAATTGAAC CCGTTGGGTG CCGATATTTT CGGCAACTCG
CTCGCATCCG GCTGATACGT GCGCACGATT CGCAGCGCGT TCGATCGACA CTTCGGATCG
AACGCCATCG GCGCCTTCCC CGTCGCCAAC CACGCGTGCA TCTTCCGCAA CACCCGATCG
CAATCTTCCC GTATCTCTCG CGCCACCGCC TCCGCGTACG TCTCCGCCAA CTCTCGCTTC
GGCACCAACG CGTACCCGTC TCGCATCCGT CGCACCGCTC GCCTCGCCAC CGCCCGCCCC
GCCCGCTCGA ACGTCATCCA CACGTCTCCC TCCTCCTCGA TCGCGTAGTC GCAATCCGGC
TTCAAAAACC GCGCCGGTAT CTCGCGTTCG CCGCATTCGA CGCGCGCCTT CGTCAAGTCG
CGCTCCGCGC GCGCCAACCA CTCCGCCCCG TACTCCCACC GCTCCAGCCT CCGCTTCCGC
TCCACCGCGC AGAACGCCAT CTTCACCATC GCCAGCGACG TCCGATCGCC GACCTCGTCG
TCCGCGAACA GCCCGCGCGC GACCTCGGCC GCGTCCGCGT CGCGAATCGC CCACGCGTCG
TCGCCGTTTG CCCCGTCGCT CGCGCGCGCG TTCGCGCGCA CGTGCTGAAT CCCGTACAGC
GCGTTCAACC GCGTCTTCGC GCGTCGCACG AGCGTCTCCA CGTCGACGTC CGCGCCCGCG
CGCGGCATCG CGCGCGGCGC GCGCGCGCGC GCCGAGGTCT GCGGCGAACC CGCGCGCGCG
TCGATCGACG ATCGCGCGTC GACCCTCGCG CGAGGTTCGA ACGCGAACCT CGGATTCGGG
TTCGATCGCG GGCCGACGCC GTGGATAACG ATCGCGCGCG CGCGACGTCG CCAGACGACG
ACGCGCGAGC GCGCGCGCGC GATGCGAGCG ATGCTGTCGA CGCGGACGAC GGCGACGCCG
AGAGCGACGC CGCGCGCGGC GCGGCGCGGC GCGACGGCGA CGCGCGGGCG CGCGATGACG
CCGACGATGG CGACGGCGAC GTCGAGGATG ACGACGACGG CGGTGGCGAC GACGCGCGGA
CGACGACGCG CGCGAACGGC GGCGTCGAAC GATGACGACG CGAACATCAA GGCGGAGAAC
ACGGCGGAGG ACGTGACGAC GACGGAGCTC GGTGGGAACG TGAGACAGCT GCTCGGGTTC
AAGGGCGCGG CGGAGACGGA CGACGTGTGG AAGATTCGCG TGCAGCTGAC CAAGCCGGGG
ACGTGGGTGC CGCTGATTTG GGGCGTCATG TGCGGGGCGG CGGCGAGCGG GCACTACGAG
TGGAACCTGG ATAACGTCGG GAAGGCGCTG CTGTGCATGA CGATGAGCGG GCCGTTTTTG
ACGGGGTACA CGCAGACGAT CAACGATTGG TACGATAGAG AGATCGATGC GATCAATGAG
CCGTACAGGC CGATCCCGAG CGGGTTGATT TCGGAGAATG AAGTCAAGGC GCAGATTGCG
GTGCTGTTGG TCGGTGGATG GTTGTGCGCG CTGCAACTCG ACCGATGGTG CGAACACGAT
TTTCCAATCG TGTTGGCGCT GTCGCTCTTC GGGTCGTACA TATCGTACAT TTACAGCGCG
CCGCCGCTGA AGCTCAAGGC TGAGGGCTGG AAGGGGTGTT ACGCTCTTGG CTCGTCGTAC
ATCGCACTGC CGTGGTGGGC GGGTATGGCG ACTTTCGGGC AATTGACGCC GGACGTCATG
TTTTTAACCG TCCTCTACTC GATCGCCGGT TTGGGCATCG CCATCGTGAA CGATTTCAAG
TCCATCGAAG GTGACCGAGA GCTCGGCTTG CAATCGCTCC CCGTGGCGTT CGGCATCGAG
AAGGCGAAGT GGATCACGGT GTCTACGATC GACATCACGC AGCTCTTCGT CGCGTGCTAC
CTGCGCGCCA TCGGCGAAGA GACGTACTCA AACGTCTTGT TTTGCTTGAT CTTCCCACAA
ATCTTCTTCC AGTTCAAGTT CTTCTTGCCG GATCCCATCA AGAACGACGT CAAGTATCAA
GCAAGCGCGC AGCCGTTCTT GGTGTTTGGT CTGCTCACGA CGGGCTTAGC GTGGGGTCAT
CACATCAACG CGCTCGGCAT GTAA
 
Protein sequence
MDASKQLLGF KGAAETDDVW KIRVQLTKPG TWVPLIWGVM CGAAASGHYE WNLDNVGKAL 
LCMTMSGPFL TGYTQTINDW YDREIDAINE PYRPIPSGLI SENEVKAQIA VLLVGGWLCA
LQLDRWCEHD FPIVLALSLF GSYISYIYSA PPLKLKAEGW KGCYALGSSY IALPWWAGMA
TFGQLTPDVM FLTVLYSIAG LGIAIVNDFK SIEGDRELGL QSLPVAFGIE KAKWITVSTI
DITQLFVACY LRAIGEETYS NVLFCLIFPQ IFFQFKFFLP DPIKNDVKYQ ASAQPFLVFG
LLTTGLAWGH HINALGM