Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18539 |
Symbol | |
ID | 5006053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | + |
Start bp | 235333 |
End bp | 237474 |
Gene Length | 2142 bp |
Protein Length | 713 aa |
Translation table | |
GC content | 60% |
IMG OID | 640421474 |
Product | predicted protein |
Protein accession | XP_001421885 |
Protein GI | 145355265 |
COG category | [R] General function prediction only |
COG ID | [COG4188] Predicted dienelactone hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0118293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000205789 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGACGCGC GATTGGCGGC GATGCATCCG GCGAGACGGG CGCTGCCGCT GCTGGCGGTG ATCGCGCAGT GGAGGGTGGA CGGCGCGCGG TGGCAGGTGG TGCCGGCGTA CATCGGGACG GGGGTGTGCA TGAGCGTGGG GGCGGCGGGG AGTAAATTGC CGCTGTGGGC GCGGCGCGCG GCGGCGCTGA GCGGAGGCGC GACGACGTTC GCGAGCGTGC TGAGTGGATT GTTGATACCG GTGTTTCGAA TGCCCAAACC CACGGGACCG CATAAGGTTG GAAAGAGGAC GCTGATGTGG ACGGATCGAT CGAGGAAGAG TTGGTTGTTG AACACGAAGG GACGCGGGGC GTTTCCCGAG CACAGAAAAT TGATGGCGAA CATTTGGTAT CCGGCGTGCG AGCGCGGGCT CAAGGAGGTG AAGAAGCCGC GCAAGGCGAA CTGGCTCGAG CCCTTGCTCG CGAAGTCTTT AGCGGTGAGC TTTTGGGCGC CAGAGTGGTG CGTGAATTAC TTTCGATTGG TTCGCATGGA GGCGCTGGAG GACGTCCCCG TCGCGGGTGG ACGAGGAGAG CAATTCCCGG TGGTCGTGTT TTCGCACTCA TTCACGGGGA TGAAGGAGCA AAACAGCGCG CTGTTGCAAG AGCTCGCGAG TTGGGGGTAC ATCATCGTCA CCGTCGATCA TCCGCACGAC GCGGCGCTGG TGTTGTATCC GGACGGGTCG ACGGCGGATT TCAGAGGATA CGACATGCCG AACGACACCG TGCCGTTCAA CTGGTGGAAA TTTAGAAATC AGCACTTGCG TTGGCGCGCG CTGGATGTGG CTTTCGCCCT GGATCAGGTG GTGGCGATGA ATGCCGATCC TGAAAGCGAC TTTTACGAGC GATTGGATTT GACGCGCGTC GCCGCCATGG GACACTCGTT CGGCGGCGCC GCGGTGGGGA TGCTCGCGCA AATGGATTCT CGAATCCAGT GCTGCATCAT GCTTGATCCT TGGATGTGGC CTTTCGGGTA CGAGCGCATG AAACAAGGCA TCCCATGCCC AGTGTTGGTC TTCGAAGCAC CGCGATTTCT GGGTAATCGG GACATCTTTT GCATTTCCAA CGCCGAGATG ACGTCGGACT TGTGCGCCAA CACGGCGCCG GCGGCGTGCT CGAAAGAGGT CGAGCCCCGA CGTTTGTCCG AAGTTCTCGA AGCCCCCGAC GCGAGTGCGA CAATAGATGT CGCCGGCGAG CAATCGCCGA GCGATGAGAT CGATTACAGA GAAGATGATG CGTCGAACGA AGAATCGGCT CGCGCCGCTT CGCCGGGACG TCCGCCGCGT CCGCCGCGTC GCTCGAGTGG TGCTGATTCG ACAGATAGGC GTGCGACAGA CGCCCGCGCG TTATCACAAA GTAGAAACAG TAGCTGGGGA TCTTTCATGT CGGTGGATGA GCAACGCGAG CCGAGCGGTG TGGCGTTCAA GGCGGTGATC GAAGAGACGA TGCACTTTGA TTTCACCGAT CTCGCCATGG TCGCCCCGTT GACGACGCGC TTACTAGGCG TCGTCGCGGT CGGAGGATAC GAAGTTCATC ACATCACTTC GGCCGCGATT TTGCGTTTCT TGCACGGGTA TAATCATCCG CACAAGTTTG AATCGATCTT CAGCGCGGAG GAACTCGACG AGCAAGCGAG AGCGTTGTAC CCGGGGCCTT GGTTGGGTTG CCCGTTTGAC GAAAAGGCGA AGCGTGTCGC TGAAAAAGAG TCTTTGGACG CCCTCGATGG GCGTGCGACA CCACTCAAGA CGCTTCGCGA GGAATTGTCG AAACCGCGAT CGAGCGGCTG GAAGGAGAAG GGCGGTATTT GCCGATGGAT CCCCACCGAA GAGTTTACTC TCGACAAGAG ACGACCGTGG ATCGACGAAC AAAATCGCGA GATCCGATGG TTGTGCGCCG AGACGGAGGA CAAAGGTGTG CCACTAACGG ACCGCGATTT GCACTGCATG TTCCCCACGC GTTCGTTGAA GAGTACGCGA GACGCCATCC GGGCGTACCG AGACAGCGAT GAAGTGTTTC CCGAGCCGAC CAAGTTAGCG TGGTTGTCGC AGGCGCTTCT CGACGGCGAT GCCGCGGATT CAGGTCCAGA CTACGCATCA CTACGACCGT GA
|
Protein sequence | MDARLAAMHP ARRALPLLAV IAQWRVDGAR WQVVPAYIGT GVCMSVGAAG SKLPLWARRA AALSGGATTF ASVLSGLLIP VFRMPKPTGP HKVGKRTLMW TDRSRKSWLL NTKGRGAFPE HRKLMANIWY PACERGLKEV KKPRKANWLE PLLAKSLAVS FWAPEWCVNY FRLVRMEALE DVPVAGGRGE QFPVVVFSHS FTGMKEQNSA LLQELASWGY IIVTVDHPHD AALVLYPDGS TADFRGYDMP NDTVPFNWWK FRNQHLRWRA LDVAFALDQV VAMNADPESD FYERLDLTRV AAMGHSFGGA AVGMLAQMDS RIQCCIMLDP WMWPFGYERM KQGIPCPVLV FEAPRFLGNR DIFCISNAEM TSDLCANTAP AACSKEVEPR RLSEVLEAPD ASATIDVAGE QSPSDEIDYR EDDASNEESA RAASPGRPPR PPRRSSGADS TDRRATDARA LSQSRNSSWG SFMSVDEQRE PSGVAFKAVI EETMHFDFTD LAMVAPLTTR LLGVVAVGGY EVHHITSAAI LRFLHGYNHP HKFESIFSAE ELDEQARALY PGPWLGCPFD EKAKRVAEKE SLDALDGRAT PLKTLREELS KPRSSGWKEK GGICRWIPTE EFTLDKRRPW IDEQNREIRW LCAETEDKGV PLTDRDLHCM FPTRSLKSTR DAIRAYRDSD EVFPEPTKLA WLSQALLDGD AADSGPDYAS LRP
|
| |