Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18354 |
Symbol | |
ID | 5005696 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | + |
Start bp | 249852 |
End bp | 252651 |
Gene Length | 2800 bp |
Protein Length | 855 aa |
Translation table | |
GC content | 58% |
IMG OID | 640421117 |
Product | predicted protein |
Protein accession | XP_001421608 |
Protein GI | 145354684 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.683916 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0555178 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTTA TGAAGCGCGT CGGCGGCGTG GATTTAGTGG TGAATTTTGC ACAGCAAGTG GCGGCAGACG ACGAGAAATT TGATGCACTT ATTCGTGAGG GCGCGTTTGA GGTGTTCACG CACGCGTTGC TCGCAGATGA GGACGAGGTC ACCGTTCGCG GGCTCATCGG ACTCGCTTGC GCGCTGCCGA GACGACGGGC GCTGCGCGTC AAACTCGCTG AAGACGGCGA GTGCGTGCGA CGGCTCGCGA CCCTGATGGG ATCATCCACG GATGAAAGTC TCAAGGGCTT CGCCGGGGGT TTGTTCCGAG CCTTAGCGCT CGATCCCGAG ACGAAAGGTT TAGTCGAGAA GGCGCTGCGA GAAGGCGGTG CCGGCGTGCT GCAAACAGCT TGATAAACAT CGATCCCGGC TCGCCGCCGC CCGATACCGC TCGTATTTGA TGTAGTATTC ATTTGCCATT GTCTAGGATG TTTCCCGTCG TCGCGCGCGC GTCATCGCCT CACTTCGTCT CTCTCTCGCG GCTTCACTTC ACTTCGCGCC TCGCGCTCCG CGCTCGATGC CTCCTCGACG ATCGACCCGA TCGCGATCGC CCTCGAAAGA GGCGCGAGAC GCGATCAACA ACGCCGCCGT CGCGTCATCC GCGCGCGCGC CCGCGGAAAT GCGTCAAGAC GACGCGCAGA ATGAAAAAGA AGGCGGCGGC GCGGTGGAAT CCATGTCAAT CGAAGCGACG AATGCGATGC GTTTAAAGCT CGGTTTAGCG CCCTTGCGCG AAGGAGCGTC GAAGAAAACC CATGACGCGG ACGCTTTACG TCGTGATGAG GCGAAAGCGG CGGAAACGGC GGCGCTGGCG GAGAAGATTG CTGCGAGGAA GCGTCAGAGG GAGATTGAAA AGTTGAACGC GGCGACTACA AAGCTCGGCG ACGCGGACGA TGAGGAGGAA GACGCGGGAG CTTGGTTGGC GAAGAGCAAA ACGAAGATGG CGACGAAAGC GCAGGAGTTG GAACGCGCGA AAGCGGCCAA AGTTGCGGCG ATGTTCGCGG AGAGGGATGA GGACGCGGAG GCGAGCGCGT CGGAGGAGGA GGACGAGGGG GCGAAGAAAT CAAAGTCGGC GGCGTACACG TCGAAGGATT TGCGCGGACT CAAGGTGAGG CACACCGCGG ATGAAATTAA TGAAGGTCAA GAGGTGGTGT TGACGCTGAA GGATACGAGC GTGTTGGATG ATGAGGATGA TGAATTAGAA AACGTCTTAA TCGCCGAGCG CAAGTCGCGT AAAAAGGCGA GAAAGGAGTC AACGAAGAAG AGTGACGACC CTTTCGGGGA GGGCAAGGAT GTGGAGGCCA AGAAAACGGT GTTAGGAAAG TACGACGCTC ATGATGAGGA TGCGGCAATG GAACTCGACG GTGAGGGTGG CCTCGACGCG GCTGAGGAGA AGCGAAAGGC AGAAATCAAG GCGCGACTCG CCGCCGAACT CTCGGGATTG AAGGGTAAAG CGGAAACCGC GGAAGTTGTC AAGGGTGAGC AAGCTGATTT TCACACCCAA GAAGAGATGC AAGCTAAGTT TGTGAAGCGC GAAAAGAAGA AGAAGATGCG CAAAAAATTG AGAAAGAAGC ATATCGATGC TGCGGAGCTC GAGCAAGACG CGCTGGCGCC AGAGTCGAGC GATCTCGGTT CACGACGCTC GCGCGGCGAG TCTAGTGCCG AAGCTAAGGC CACGACTAAT GAGAAAGACG CGAAGTTCGC GAATGCGTTG CAAAAAGCTC GTGAAGTGAC CGATAAGAAG ATTCTCGCTG AACTCGCCGG CGAAGCAGAG GAAGAGGAAG ACGACGAGCT CGCTCGTGCG CTCGCAAGGA GCCGGAAGAT GGCAAATATT TCGAACAAAG CAGCGCGCTC GGCGCCGGCA GACGTGGTCG CGCAAGTGGC GGCGCGTCGT CAAGCAGATG AGGCGAAGGC TCGAGAGAAC GCCGCGAATG TCGCCGACGA ATCTCTAGTT TTTACAGACA TGTCTGAGTT TGTTCAAGGC ATCAACACTC AAGACGGCGC GCTCGATGAG GTCTACGACG AAGACGCAGA CACTGAGGAA ATGCCAGACG TACCGCCGCC GCCACCACCG GGCGGTGAAG ACGAGCCGAT GGACGATGAA ATGCCAGATG TGCCGCCACC GCCACCGCAA GAGGAAGTCG ACGCGCACGT TAACATGCCC GTACTCGCGG AAAAGCACGT CGTGCAAAAA GGGTTGGCGA GCACACTCGC TCTTCTCAAA GACAAGGCTC AACTCGATGA TGCGCAAAAC ACGCGCTGGA GTGGGCGCGC CAACGACATG AAGGATCGAT TCGACAGACA GCACGTGCTC GAGGCGCACG CCATAGAAGA AAAAGCCGTC GACGGCTACA AGTTCGGTTT CAAGCTCGAT AAGTTGGACG AGTTCGGACG GAAGCTCACC CCGAAAGAGG CGTTTAGAGA GCTTTGCCAT CGTTTCCACG GCATCGAGCC TGGCAAAATG AAGCGCGAAA AGCGCCTGCG ACAATTCCAG GAGGAGCAAC AGCGTCTCAA GGCGTCAAGC GTCATGGACG ACCGCATCAA GGACGTACAG CGCGATCAAG CGACGCCTTA CGTCGTACTT AGCGGTCACG TGAGGGCTGG ACAGGCGAGA AACGCTGATC CGGTGGCGAC GATGAAGCGC GAACAGGAAT CCGCGCGCGC CACCCCCGCC GCCGCGTCGC GCGGTCCATC ATCGGCATCC GGTCTGAAGG CTTCCAACGC GACAAAAGTC TCGTTTGCGA TGAAACCATC GAAGAAGTAA
|
Protein sequence | MDFMKRVGGV DLVVNFAQQV AADDEKFDAL IREGAFEVFT HALLADEDEV TVRGLIGLAC ALPRRRALRV KLAEDGECVR RLATLMGSST DESLKGFAGG LFRALALDPE TKGLVEKALR EGEARDAINN AAVASSARAP AEMRQDDAQN EKEGGGAVES MSIEATNAMR LKLGLAPLRE GASKKTHDAD ALRRDEAKAA ETAALAEKIA ARKRQREIEK LNAATTKLGD ADDEEEDAGA WLAKSKTKMA TKAQELERAK AAKVAAMFAE RDEDAEASAS EEEDEGAKKS KSAAYTSKDL RGLKVRHTAD EINEGQEVVL TLKDTSVLDD EDDELENVLI AERKSRKKAR KESTKKSDDP FGEGKDVEAK KTVLGKYDAH DEDAAMELDG EGGLDAAEEK RKAEIKARLA AELSGLKGKA ETAEVVKGEQ ADFHTQEEMQ AKFVKREKKK KMRKKLRKKH IDAAELEQDA LAPESSDLGS RRSRGESSAE AKATTNEKDA KFANALQKAR EVTDKKILAE LAGEAEEEED DELARALARS RKMANISNKA ARSAPADVVA QVAARRQADE AKARENAANV ADESLVFTDM SEFVQGINTQ DGALDEVYDE DADTEEMPDV PPPPPPGGED EPMDDEMPDV PPPPPQEEVD AHVNMPVLAE KHVVQKGLAS TLALLKDKAQ LDDAQNTRWS GRANDMKDRF DRQHVLEAHA IEEKAVDGYK FGFKLDKLDE FGRKLTPKEA FRELCHRFHG IEPGKMKREK RLRQFQEEQQ RLKASSVMDD RIKDVQRDQA TPYVVLSGHV RAGQARNADP VATMKREQES ARATPAAASR GPSSASGLKA SNATKVSFAM KPSKK
|
| |