Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32403 |
Symbol | |
ID | 5002200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 816433 |
End bp | 818433 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | |
GC content | 56% |
IMG OID | 640417621 |
Product | predicted protein |
Protein accession | XP_001418342 |
Protein GI | 145347785 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 69 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGTAGGT GCTTTCCCAC GTGGAGTCGC GGCGTCGGTG CCGCGGAAGA GGCGCGAAGA GGAGAGTCTG ACGTGAAAAA GACGAAGAAA GAGCCGCGCG CGCCCTTGTT GTCTTTGAAA GAATACAAAG AGAACATGGA AGAGAAGATG GCGCAGAAGC AGCGCGAGAA GGAGGCAAAA CAGCGCGAGA AAGAAGAGAA GGAGCGCAAG CGCAAGAAGG AGGAAGATGA GGCATTGCGA CGTTTAGTAG AGGTGAATGT GACGCACGGC GCCGTTTCCG AAAACGAAGA CGCAGAGACA AAAGGCGAGA CGTTGGAACC GAATAGCACG GAGACGACGA CGGTGGACGA AGAACCCGCG CCAAGCGAGG TTTCGATCGA AGTAGAAGGC GGGCAGCAGC AGGCGGAGAC GATGGACGGA GCGTCGACGG CGGAGACTGG CGAGCTCGCT CGCGACGCCG TGAGCGAAAC GTCGCAGGCG GCGCCCGCGC CGGTCGAACC GCCCGTGATG GAAGAAGAAG ATACCGACGG TGAAGCAACT TTCGCTGAGC TCGTGATCAA ACCTGAACGC CTCACGGAAG CAGACGCGGA GATGTACAAT TACGCCGCGA GCTTCAACGG AGCAAAGGTG GTGGCGAGCG ATAAGGATTC AAAACACGCG AGCGCCGCTT TGAAGGAAGA TAAAGATGTT TATTACATTT CTCCATGCGC TTCGGAAAAG TTTGTCACTG TTGAGCTGAG CGAGGAGGTG ACGGTGACGA GCTTGGTTTT AGGAAACTTC GAGTTTCATT CGTCTCGCGT CAAAGATTTC GAAGTTTGGG GCACGGACGG GCACCACGCT ATTGAGGAAG GTTGGAAGAG ACTGATGATT GGACGTGCGG ATAACACGCA AAACTATCAA AAGTTTGCCG TGCCTTCGCC AGCGTGGGTG CGTTACGTAC AAATTCGCAT GACTGGTCAT CACGATCAGC AACACTTTTG CACGTTGAGC CTGCTGCGCA TCCACGGTAA AGACGCCAAG GAGACGTTGA AAGAAGAGAT GGAGCGTTTG CAAGCGGAGG TGCAAGAGGT AGAGTCGTTA TTGTCAGACG AGGACGAGGA CGAGGACGAG GACGAAGACG TGGATGTTCG CGAAAGTTCT GCAGAAGTTG TGCTAGACGT TGAGGAGCAA AATCGTGAGG AAACAAACGC GAGCGCTGTG GTGGGCGAAG AAAACGAGCG CGCGTCGACT GGTGACGACA GAGATGTCTC TACTTCGAGC GAAACAGATC ACTCGGCAAA TGTCAACACA TCGATCGCGG AGGGTGCACC GAGCGAAACG ACGTCGAACT CTGATGAGGA TGACAACGCC GCACAGGAGA GAGCGACGAA GATTGATGCG TCGACAACGT CTCGTCCAAA TGCGACCGCC GCGGGCGCGA CCGCCGTGAA CGCGACGAAC TCGAACGCTA CGGGCGTCGC GACGGCTAAA CCCAAGATGG CGACATCGAC GAATGAACTC GCCAAGGGTG GCGGCGATGC GAACGTGTTT CGGTTGCTGG CGCAGAAGAT CAAGGATTTA GAGCTCAACC AATCGCTTTT GTCGCGGTAC GTGGAGTCGC TCAACGTGCG ATACGGCGAA ACGTTGGAAG ACTTCGGGAA AGAGATTGAC GAGATTGAAG AATCAGTGTC GAATTCCACT GGCAAGCTCG ACGAAGCCAG TCGCCAAGCG CGAGCGAGCT CGAAAGCGTG CGATGACGCC GTCGCGCGCG TCAACGATAG CTCCGAAAAG CTCGTCGCCG CAGCCGTGTC TGAGTTGGAC GCGTATCGCA CGACTGTCGC GAAGCGGGAC ACCGTTCTCG CACTCGCGCT CGCGCTCACA GCAGGCGCGC TCGTGGCGTC GCGCAGATCG TCTGGTGCGA TCGAACGCGT CTTGAGCGCG CTCTCATCAT TCGCTTTGCT CGTCATCGTC GTGGCGAACA TCGTCCTCAT AGCACAAAAT TTCTTGTTAA AGTCGATGTA A
|
Protein sequence | MCRCFPTWSR GVGAAEEARR GESDVKKTKK EPRAPLLSLK EYKENMEEKM AQKQREKEAK QREKEEKERK RKKEEDEALR RLVEVNVTHG AVSENEDAET KGETLEPNST ETTTVDEEPA PSEVSIEVEG GQQQAETMDG ASTAETGELA RDAVSETSQA APAPVEPPVM EEEDTDGEAT FAELVIKPER LTEADAEMYN YAASFNGAKV VASDKDSKHA SAALKEDKDV YYISPCASEK FVTVELSEEV TVTSLVLGNF EFHSSRVKDF EVWGTDGHHA IEEGWKRLMI GRADNTQNYQ KFAVPSPAWV RYVQIRMTGH HDQQHFCTLS LLRIHGKDAK ETLKEEMERL QAEVQEVESL LSDEDEDEDE DEDVDVRESS AEVVLDVEEQ NREETNASAV VGEENERAST GDDRDVSTSS ETDHSANVNT SIAEGAPSET TSNSDEDDNA AQERATKIDA STTSRPNATA AGATAVNATN SNATGVATAK PKMATSTNEL AKGGGDANVF RLLAQKIKDL ELNQSLLSRY VESLNVRYGE TLEDFGKEID EIEESVSNST GKLDEASRQA RASSKACDDA VARVNDSSEK LVAAAVSELD AYRTTVAKRD TVLALALALT AGALVASRRS SGAIERVLSA LSSFALLVIV VANIVLIAQN FLLKSM
|
| |