Gene OSTLU_47343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_47343 
Symbol 
ID5005033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp173579 
End bp175705 
Gene Length2127 bp 
Protein Length363 aa 
Translation table 
GC content69% 
IMG OID640420454 
Productpredicted protein 
Protein accessionXP_001420924 
Protein GI145353232 
COG category[R] General function prediction only 
COG ID[COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.0770421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000226051 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
CTAATTCCTT CGCCCTCTCG TTCGCCCTCT CCAGCGCCTC CGCGTCCGTC GTCGACGTCG 
TCGTCCTCTT CGACGCCTCC GCCCGCGCCA GCGTCCTTCG CAGCGTCGCC CGCAGTTCGT
CCCGCAGCGC GTCCCTTTCC GCCCTCGTCT TCGCCTCTCC CGCCGCCGCT CGATCCCTCG
CTTCCTCCAT CTCCCTCGCC GTCGCTCGCG CCTCCTCCAG CGCGCGCTCG CGCGCCTCCA
GCGCGCTCGT CAGTCGCCGC ACGTCCTCGT CCACGCTCGC GTACGCCTTG CTCAGCCGTC
CCGCCTGCGC CGCCGCCGCG CGCGCGCGCT CCAGCCTCGC CTCGCTCGTC TCCAGCGCCG
CGCGCATCCC CTCCAGCTCC CCGCGCAGGC CGTCCACCGT GTCGTCGAAC GCGCGCGCGT
CGCCGGGCGA CGACGCGCCG CTCGCCGGCG CATCCATCGC GCCGCGGTGT GCCGTCGATC
GATCGCTCGG CGCGCGCGCG CGTCGACCGC GCGAGGTTCG CGGCGCGCGC GCGATCGACG
ACGCGAGGGT CGCGCGATCG CGCGTCGATC GATCGCGCGC GCGGCCGCGC GACCGCGTCG
ATCGATCGCG CGCGCGCCGC GTCGATCGAT GCGTCGTCGA TCGACGCGCG CGCGCGATCG
ACGACGCGGC GGTCGCGCGA TGGCGCGCGC GATCGATCGC GCGCGGTGGT GACGTCACCA
CCGGGCGCGG CGGCGACGCG CCGGCGCGCG GCGCGGCGCG GCGCACACCG ACGCGCGCGC
GCACGACCGC GCGCGATGAC GACGAGCGCG ACGGCGACGA CGGCGACGAC GGCGACGGCG
CGACGCGCGA CGGCGCGCGC GCGCGACGCG GGGTCGAGAC GGGCGCGAGG GGGGGCGCGA
TCGCGCGCGC GAGCGACGAC GCGGACGACG ACGGCGACGC GGGCGACGAC GGAGGGCGCG
GGGGAGGACG CGAGCGCGGA GGGGAAGATG CCGCGACGCG CGCTGGCGGT GAAGACGGCG
ACGCTCGTCG CGGCGCTGAG CGCGCTGCCG ATGGATAAGC GCGCGCTCGC CGAGGGTTCG
ATCGAGAGCT CGTACTGGGA GCAAGTGGAG CTGCCGCTGG AGCCGGGAGT GATTCTGCTG
GACATCGCGT TCAGTTCGAA CGATCCCAAG CACGGGTTCT TGCTCGGGAC GAGGCAGACG
GTGCTCGAGA CGAAGGACGG AGGGAAGACG TGGGACGTGC GCGACTTGAG CGGATTGTTG
GACGACGACG TGAATTATCG CTTTAATAGC GTGTCGTTTT GCGGCGACGA GGGATGGATC
ATCGGTAAGC CGGCGGTGTT GTTGCACACG ACCGATGGTG GCGCGAACTG GGAGCGCGTC
GGGTTGAGCC CGCGACTTCC GGGGGCGCCG GTGTTGATCA CGGCGGTGCA AGATAACGGC
ACGGCTGAGA TGGTGACGGA CGAGGGGGCG ATTTACTTCA CCAAGGACGC GGCGCGCAAC
TGGAAGGCTG CGGTCGAGGA GACCGTCTCC GCGACGTTGA ACCGCACGGT GAGCTCTGGT
ATCACCGGCG CTTCGTATTA CACGGGCACG TTCTCCACGA TCTCGCGCAA CGACAACGGC
GAGTACCTCG GTTTAAGCTC TCGCGGGAAC TTTTACATGT CTTGGGCGCC GGGTCAGGCG
TACTGGCAAC CGCACAACAG AACGTCCGCG CGTCGGGTGC AAAGCATGGG CTGGCGCCCG
GATGGCGGGA TTTGGGAGCT TACCCGCGGC GGTGGCATCT TCTTCTCCGC CGAAACCGGC
CTCCCGGAGG AGGATTCTGA ATTCAACGAA GGTAGAATCG GCTCTCGCGG CTTTGGTCTG
CTCGACTTGG GTTACACCCC GAGCGGCAAG ACGTTCTGGA CCGTGGGTGG CTCTGGAAGC
GTGTTTTACT CTACCGACGC CGGTAAGTCA TGGAAGCGCG ACCGCGGCAC GGACAACGTC
GCGGCGAACC TCTACAACGT CAAGTTCCAA AGCGAAGATC AAGGATTTAT TCTCGGCAAC
GACGGTATTC TCTTGCGTTT CACCGGCGCC AAGTTGTAAC GGGCTGCCAC AATTGAGCAG
CACCCCGCGT CGCGCGTGAA ATTGCAT
 
Protein sequence
MPRRALAVKT ATLVAALSAL PMDKRALAEG SIESSYWEQV ELPLEPGVIL LDIAFSSNDP 
KHGFLLGTRQ TVLETKDGGK TWDVRDLSGL LDDDVNYRFN SVSFCGDEGW IIGKPAVLLH
TTDGGANWER VGLSPRLPGA PVLITAVQDN GTAEMVTDEG AIYFTKDAAR NWKAAVEETV
SATLNRTVSS GITGASYYTG TFSTISRNDN GEYLGLSSRG NFYMSWAPGQ AYWQPHNRTS
ARRVQSMGWR PDGGIWELTR GGGIFFSAET GLPEEDSEFN EGRIGSRGFG LLDLGYTPSG
KTFWTVGGSG SVFYSTDAGK SWKRDRGTDN VAANLYNVKF QSEDQGFILG NDGILLRFTG
AKL