Gene OSTLU_18539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18539 
Symbol 
ID5006053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp235333 
End bp237474 
Gene Length2142 bp 
Protein Length713 aa 
Translation table 
GC content60% 
IMG OID640421474 
Productpredicted protein 
Protein accessionXP_001421885 
Protein GI145355265 
COG category[R] General function prediction only 
COG ID[COG4188] Predicted dienelactone hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0118293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000205789 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGACGCGC GATTGGCGGC GATGCATCCG GCGAGACGGG CGCTGCCGCT GCTGGCGGTG 
ATCGCGCAGT GGAGGGTGGA CGGCGCGCGG TGGCAGGTGG TGCCGGCGTA CATCGGGACG
GGGGTGTGCA TGAGCGTGGG GGCGGCGGGG AGTAAATTGC CGCTGTGGGC GCGGCGCGCG
GCGGCGCTGA GCGGAGGCGC GACGACGTTC GCGAGCGTGC TGAGTGGATT GTTGATACCG
GTGTTTCGAA TGCCCAAACC CACGGGACCG CATAAGGTTG GAAAGAGGAC GCTGATGTGG
ACGGATCGAT CGAGGAAGAG TTGGTTGTTG AACACGAAGG GACGCGGGGC GTTTCCCGAG
CACAGAAAAT TGATGGCGAA CATTTGGTAT CCGGCGTGCG AGCGCGGGCT CAAGGAGGTG
AAGAAGCCGC GCAAGGCGAA CTGGCTCGAG CCCTTGCTCG CGAAGTCTTT AGCGGTGAGC
TTTTGGGCGC CAGAGTGGTG CGTGAATTAC TTTCGATTGG TTCGCATGGA GGCGCTGGAG
GACGTCCCCG TCGCGGGTGG ACGAGGAGAG CAATTCCCGG TGGTCGTGTT TTCGCACTCA
TTCACGGGGA TGAAGGAGCA AAACAGCGCG CTGTTGCAAG AGCTCGCGAG TTGGGGGTAC
ATCATCGTCA CCGTCGATCA TCCGCACGAC GCGGCGCTGG TGTTGTATCC GGACGGGTCG
ACGGCGGATT TCAGAGGATA CGACATGCCG AACGACACCG TGCCGTTCAA CTGGTGGAAA
TTTAGAAATC AGCACTTGCG TTGGCGCGCG CTGGATGTGG CTTTCGCCCT GGATCAGGTG
GTGGCGATGA ATGCCGATCC TGAAAGCGAC TTTTACGAGC GATTGGATTT GACGCGCGTC
GCCGCCATGG GACACTCGTT CGGCGGCGCC GCGGTGGGGA TGCTCGCGCA AATGGATTCT
CGAATCCAGT GCTGCATCAT GCTTGATCCT TGGATGTGGC CTTTCGGGTA CGAGCGCATG
AAACAAGGCA TCCCATGCCC AGTGTTGGTC TTCGAAGCAC CGCGATTTCT GGGTAATCGG
GACATCTTTT GCATTTCCAA CGCCGAGATG ACGTCGGACT TGTGCGCCAA CACGGCGCCG
GCGGCGTGCT CGAAAGAGGT CGAGCCCCGA CGTTTGTCCG AAGTTCTCGA AGCCCCCGAC
GCGAGTGCGA CAATAGATGT CGCCGGCGAG CAATCGCCGA GCGATGAGAT CGATTACAGA
GAAGATGATG CGTCGAACGA AGAATCGGCT CGCGCCGCTT CGCCGGGACG TCCGCCGCGT
CCGCCGCGTC GCTCGAGTGG TGCTGATTCG ACAGATAGGC GTGCGACAGA CGCCCGCGCG
TTATCACAAA GTAGAAACAG TAGCTGGGGA TCTTTCATGT CGGTGGATGA GCAACGCGAG
CCGAGCGGTG TGGCGTTCAA GGCGGTGATC GAAGAGACGA TGCACTTTGA TTTCACCGAT
CTCGCCATGG TCGCCCCGTT GACGACGCGC TTACTAGGCG TCGTCGCGGT CGGAGGATAC
GAAGTTCATC ACATCACTTC GGCCGCGATT TTGCGTTTCT TGCACGGGTA TAATCATCCG
CACAAGTTTG AATCGATCTT CAGCGCGGAG GAACTCGACG AGCAAGCGAG AGCGTTGTAC
CCGGGGCCTT GGTTGGGTTG CCCGTTTGAC GAAAAGGCGA AGCGTGTCGC TGAAAAAGAG
TCTTTGGACG CCCTCGATGG GCGTGCGACA CCACTCAAGA CGCTTCGCGA GGAATTGTCG
AAACCGCGAT CGAGCGGCTG GAAGGAGAAG GGCGGTATTT GCCGATGGAT CCCCACCGAA
GAGTTTACTC TCGACAAGAG ACGACCGTGG ATCGACGAAC AAAATCGCGA GATCCGATGG
TTGTGCGCCG AGACGGAGGA CAAAGGTGTG CCACTAACGG ACCGCGATTT GCACTGCATG
TTCCCCACGC GTTCGTTGAA GAGTACGCGA GACGCCATCC GGGCGTACCG AGACAGCGAT
GAAGTGTTTC CCGAGCCGAC CAAGTTAGCG TGGTTGTCGC AGGCGCTTCT CGACGGCGAT
GCCGCGGATT CAGGTCCAGA CTACGCATCA CTACGACCGT GA
 
Protein sequence
MDARLAAMHP ARRALPLLAV IAQWRVDGAR WQVVPAYIGT GVCMSVGAAG SKLPLWARRA 
AALSGGATTF ASVLSGLLIP VFRMPKPTGP HKVGKRTLMW TDRSRKSWLL NTKGRGAFPE
HRKLMANIWY PACERGLKEV KKPRKANWLE PLLAKSLAVS FWAPEWCVNY FRLVRMEALE
DVPVAGGRGE QFPVVVFSHS FTGMKEQNSA LLQELASWGY IIVTVDHPHD AALVLYPDGS
TADFRGYDMP NDTVPFNWWK FRNQHLRWRA LDVAFALDQV VAMNADPESD FYERLDLTRV
AAMGHSFGGA AVGMLAQMDS RIQCCIMLDP WMWPFGYERM KQGIPCPVLV FEAPRFLGNR
DIFCISNAEM TSDLCANTAP AACSKEVEPR RLSEVLEAPD ASATIDVAGE QSPSDEIDYR
EDDASNEESA RAASPGRPPR PPRRSSGADS TDRRATDARA LSQSRNSSWG SFMSVDEQRE
PSGVAFKAVI EETMHFDFTD LAMVAPLTTR LLGVVAVGGY EVHHITSAAI LRFLHGYNHP
HKFESIFSAE ELDEQARALY PGPWLGCPFD EKAKRVAEKE SLDALDGRAT PLKTLREELS
KPRSSGWKEK GGICRWIPTE EFTLDKRRPW IDEQNREIRW LCAETEDKGV PLTDRDLHCM
FPTRSLKSTR DAIRAYRDSD EVFPEPTKLA WLSQALLDGD AADSGPDYAS LRP