Gene OSTLU_39472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39472 
Symbol 
ID5004914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp296605 
End bp298542 
Gene Length1938 bp 
Protein Length645 aa 
Translation table 
GC content61% 
IMG OID640420335 
Productpredicted protein 
Protein accessionXP_001420815 
Protein GI145352989 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000845209 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.555705 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGCG CGACCGCGCG GATAACGAGC GCGACGAGGA AAGACGCGCG CGGGCGACGA 
ACGGTGACGC GCGCGAGCGC GCTGGACGAC GCGAGCGAAA CGCACGACGT CGTGGTGATC
GGAAGCGGGA TCGGTGGGTT GTCGTGCGCC GCGCTGTTGG CGAAGTACGG ATACGAGGTG
AAGGTGTTTG AGAGTCACTA TCTCGCGGGC GGGTGCTGTC ACATGTTCGA TCATCGAGCG
CCGGATGGGG CGTTGTATAA GTTTGAGGTC GGACCGAGCA TTTGGGAGGG GTTGGATCGA
CCGACGGGGA ACCCGCTGAG GATGGTGCTG GACGCGCTCG GGGAGACGGT GCCGGTGAAG
ACGTACGACG GGATTTCGAT GTGGACGCCG GAGGGGCACT GGAGGTTTCA AACCGGGGAC
GATGACGCCC CGGGTGGGTT CTGCGATTTG TTGCGCGAGA AGGCGACGGA TCCGGAGTTG
GCGATCAAGG AGTGGAAGGC GCTGAAGGGG CGGTTGGAAC CGTTGTACGA CGCGCTGGAC
GCGTGTCCGC TCACGGCGCT TCGCCAGGAC GCCGGGTTGT TGGTGTCCAC GGTGATTGCG
ATTCCGTTTT ACCTCACCCA TCCGAACGTG ATGTTGGATA TTCCTTATAT TTTGGACTCA
TTTCATAAGT TATCGCGACA GTACGTCACC GAACCGTTCT TGAAGCAGTG GATCGACATG
CTGGCGTTTT TCAGCGGGTT CCCGGCGGAG GGCACGATGG GGGCGACAAT GATTTATAGT
ATTCCAGGAT TCCATCGTCC CGGGGCGTCG TTGTGCGCGC CCGAGGGCGG TACGCAGGCA
GTTGTGGATA AGCTTCAATA CTGCTTGGAA AAGTATGGTG GTGAATTGCA GCTCAAGTCG
CACGTGGAGG AAATCATCGT CGAAGACGGA GAAGCCAAGG GTGTGCGTCT TCGAAACGGT
AAAGTCATCA AGGCGAACGT CGCGGTCGTT TCGAACGCGA CTATCTGGGA CACTGTGCCG
ATGTTGCCCC AGACAGACGA ATTAATAGCA CAAGGTCTCG ACAGGGCGGT GGAGTGGAAG
GACGAAATGT CGGAAATCCC CGCCCTCGGA AGCATCATGC ACTTGTTCCT CGGCATCGAC
GCCACGGGCT TACCCGATCT GGATCCGTCG CATCTGTGCG TTTTGGACTG GGACCGACCT
CTCGGTGATC CCCAAAACGT CATCACAATC TTCATTCCCA CCGTGCTCGA TCCTGAAGTC
GCACCCGAAG GTAAGCACAT CATTCACGTG TACACAGCCG GAAGCGAACC GTACGACATT
TGGGAAGGCA AGGATCGAGG GAGTCAAGAG TACAAGGATT TCAAGCGCGA ACGCGCTGAA
ATCCTGTGGA ATGCCATCGA ACGCATCATA CCGGACATTC GTAGCCGCGT CGAAGTTGAA
GTGTACGCGT CTCCGCAGAC GCATCAACGG TTCCTCCGAC GGCACCGCGG CACGTACGGC
CCGGCGCTCC CAGCCGGCGG CAGCCTCTTC GGCTTCTTAC CTCTGCCCGA AGTTCCGCAA
CCGGGCGTAT TATCCCCTAT CCCCAAGTTA TTACGTTGCG GCGACTCCGT GTTCCCGGGC
GTCGGCGTTC CCGCGGTCGC CGCCTCTGGC GCCATCGCCG CGAGCACGCT CGCCCCGCTC
CCCAAGCACC TCGGTCTCAT GTGGGACGTT TCCCAAACTC AAAACACCTT TTGGCGCGAG
TGCGGAGGCA AGGAAAAGTG GCTCGCGTCC GCGCCCGCGC CTTTCCGACC CGCCGCGGGC
GGCGGCGCCG AATTCATGAC CCCAGACCAA TACTACAAGC CCAAGCCCGA GGCCGAAAAG
TCCTTCGCCG AAATCGGCAA GCGCGGCGTC GTCGCTCAGT CCCGCGAGCG CGGCGTTGCG
GATCGCGAGT CGCGGTGA
 
Protein sequence
MQRATARITS ATRKDARGRR TVTRASALDD ASETHDVVVI GSGIGGLSCA ALLAKYGYEV 
KVFESHYLAG GCCHMFDHRA PDGALYKFEV GPSIWEGLDR PTGNPLRMVL DALGETVPVK
TYDGISMWTP EGHWRFQTGD DDAPGGFCDL LREKATDPEL AIKEWKALKG RLEPLYDALD
ACPLTALRQD AGLLVSTVIA IPFYLTHPNV MLDIPYILDS FHKLSRQYVT EPFLKQWIDM
LAFFSGFPAE GTMGATMIYS IPGFHRPGAS LCAPEGGTQA VVDKLQYCLE KYGGELQLKS
HVEEIIVEDG EAKGVRLRNG KVIKANVAVV SNATIWDTVP MLPQTDELIA QGLDRAVEWK
DEMSEIPALG SIMHLFLGID ATGLPDLDPS HLCVLDWDRP LGDPQNVITI FIPTVLDPEV
APEGKHIIHV YTAGSEPYDI WEGKDRGSQE YKDFKRERAE ILWNAIERII PDIRSRVEVE
VYASPQTHQR FLRRHRGTYG PALPAGGSLF GFLPLPEVPQ PGVLSPIPKL LRCGDSVFPG
VGVPAVAASG AIAASTLAPL PKHLGLMWDV SQTQNTFWRE CGGKEKWLAS APAPFRPAAG
GGAEFMTPDQ YYKPKPEAEK SFAEIGKRGV VAQSRERGVA DRESR