Gene OSTLU_31872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31872 
Symbol 
ID5001997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp676844 
End bp677992 
Gene Length1149 bp 
Protein Length382 aa 
Translation table 
GC content56% 
IMG OID640417418 
Productpredicted protein 
Protein accessionXP_001418082 
Protein GI145347241 
COG category[R] General function prediction only 
COG ID[COG0384] Predicted epimerase, PhzC/PhzF homolog 
TIGRFAM ID[TIGR00654] phenazine biosynthesis protein PhzF family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.888244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.28032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA GGCACGCGTA CGCGATCGCG AACGCGTTCC CGTCGGCGAC GAAAGGCGAC 
GACAGTGGGA ATCCCGCGGC CGTGGTGCTT TTGCCGGACG ATGGACCGTG GCCGCCGGAC
GCGGTGATGC AGGCGGCGGC GAGCGAATTA GGCTTGAGCG AGACGGCGTT CGCGAAGCGG
GCAGACGTGG AGGTCGTCGG CGCGATGGTG CATCGGATGT ACGACATACG GTGGTTTACG
CCAAAGTGCG AGATCAATCT GTGCGGGCAC GCGTCGATGG CGACGGCGCA CGAGATTTTT
CGAGCGCCGG GGAACGAGCA CGTCACTAAG ATTGGATTTT TATACGGCAA GCCGCGTGAT
GTGTGCGATG GAACACAAGT GAAAGCCGCT TACGAGTCGT TGTTCGTGTG GAAAGATTTG
GACGGCGGTG TAGAGTCGTC CTACGCGATG GCACTTCCGG AGGAACGCGC GAGATCCTTT
CCGGACGCGC TTGGTGAGGA TGGGTTCTTA CACCCTCAAG AAATCGTGAA CATGATTCGA
GGCTGCTTTG GCGACGACCA CGAGCGCTCG AACTCGCAGA GCTCTGCGAC TTCCAAGGCG
AAGTACGAGC TTCGGTACAA CTTCATCGGC GACTTGTTTT TCATAATCGA CACGGATGAT
GCACCCGACG AGGTGTTTGA GGCGTGTTTC GACCGCTTCA TGAATCACGC GCCCGATCTG
AAGGCGATAT CGGAAGTGGG GAGGTGCTTC ATCGAGGAAA TGAAGTACGA CTTTGAAATG
GTTCAAGGTT TTCGTGGTTT GTGCGTGCTC TTGACGGTGA AAAATCGGCC GAATCATTCG
TACGATTTCT ACACGCGATG GTTTGGGCCG GATGTCGGTA TCGACGAAGA TCCCGTGACG
GGTAGCGCGG CGAGCGGCTA TGCTAGATTT CTTGATGACA AGTTGCCCGA GGTCGTCGGT
AAAAAGCGCG GTTGTCAAAT GTCGAAACCC AGGGGCGACA TCACGGTGAG TCTCTCGGAC
AACCCCTACC CCGACACGGA TGTTCATGTC GAGGTTGTCG GCAAAGTTGC GACGAGATCG
AGTGGTGTTC TCGAGGTTTC TGTTCTAAAG GATGGTGATA TATCAGTCGT GAATCGACGA
AACTCCTAG
 
Protein sequence
MTTRHAYAIA NAFPSATKGD DSGNPAAVVL LPDDGPWPPD AVMQAAASEL GLSETAFAKR 
ADVEVVGAMV HRMYDIRWFT PKCEINLCGH ASMATAHEIF RAPGNEHVTK IGFLYGKPRD
VCDGTQVKAA YESLFVWKDL DGGVESSYAM ALPEERARSF PDALGEDGFL HPQEIVNMIR
GCFGDDHERS NSQSSATSKA KYELRYNFIG DLFFIIDTDD APDEVFEACF DRFMNHAPDL
KAISEVGRCF IEEMKYDFEM VQGFRGLCVL LTVKNRPNHS YDFYTRWFGP DVGIDEDPVT
GSAASGYARF LDDKLPEVVG KKRGCQMSKP RGDITVSLSD NPYPDTDVHV EVVGKVATRS
SGVLEVSVLK DGDISVVNRR NS