Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50969 |
Symbol | |
ID | 5004775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 324889 |
End bp | 327129 |
Gene Length | 2241 bp |
Protein Length | 456 aa |
Translation table | |
GC content | 58% |
IMG OID | 640420196 |
Product | predicted protein |
Protein accession | XP_001420817 |
Protein GI | 145352993 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0627207 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000441065 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGATAA AAACCATCTC GCGCGTCGAG GAGGACTACA CGCGCGAGCG AAAGTCGGAC GCGCTCAGGG TGCATCGAAA TCTCGCGCCC GAGCTGCGGC CGATGGGACG GGCGACGGAG TACAAGCGCG CGCTGAACGC GACGAAGCTG GACAAGGTGT TCGCGAAGCC GTTCGCGGGA CAGATGAGCG GACACGCGGA CGGCGTGCTG TGCATGGCGA AGTCGCCGGC GTCGCTGACG GAATTGGTGA GCGGCGCGGC GGATGGAGAG ATACGAGTGT GGGACGTGCC GAGCCTGAAG ACGGTGCGGG TGCTGAAGGG ACATCGAGGG GCGTGCCGAG GCGTGAGCGC GTCGAACGAC GGCGGCGCGG TGGTGTCGTG CGGCGACGAC GCGACGATTC GGTTGTGGAC GATGCCGAAG GCGGGAATGG GGGAGATGAA CGATCCGACG CGGAAGATTC CGGTGTTGGA GACGTCGGAG ATGTACGTCG AGAGCAACGG TTTTAGGGAC TGCGACGCGC ACTGGGGGAA AAAGGAGTTC GCCACCGCGG GGGCGAACGT GCAGGTGTGG AGCATGGAAC GGAGTCATGC GCTGCATACG TTCGAGTGGG GTTCCGATAC GGTGCTTTCA GTGCGATATA ATCCGGTGGA GACGGATATT TTTGCGTCGT GTGGGTCGGA TCGATCCATC GCGTTGTACG ACGTTCGAAT GCAGACGCCG TTGAAGAAGA TCATCATGCA GACAAAGTCG ACCAAACTGT GCTGGAATCC GATGGAGGCG TTTAATTTCA CCGTTGCCAA CGAGGATACC AACTTATACT CGTACGACAT GCGAAAGCTG GATATCGCGA CGTGCGTTCA TAAGGATTTC GTGAGCGCCG TGATGGATAT CGATTACTCG CCCACGGGTC GGGAATTCGT GGCGGGGAGT TATGACAGAA CCGTGCGCAT GTTTGATTAC AACGCTGGAC ACTCTAAAGA TTGCTACCAC ACCAAACGCA TGCAGCGCGT GTTCTGTACG CGCTTTTCGA TGGATGGTTC GTACGTCTTC AGCGCCTCGG ACGACATGAA CGTGAGGTGT TGGAAGGCGG ACGCGAGCGC GCAAATGGGC ACGCTCTCGG CTCGTGAAAA GCGCAAGCAC GCGTATAACG CGTCTTTAAA GGACCGTTTT AAGCACATGC CCGAAATCAG GCGCATTGCG AATCACCATC ACGTACCCAA GGCTATTCAC AAGCAAACCA AGTTGCGGCG GACGATGCAA GAAGCCGAGA CTCGCAAGGC GAAACGTCGC GTCGCGCACG CCGCGCCTGG CGCGGAGAAG AAGGAATTCA AACCCGCGCG CAAAAAGAAG ATCCTCGCAG AAGTGGAGTA GGGGACGATA TTAGCTTGCC GCACGAAACG AACCTATGTA ACTGGTGTAC AAACAATGAT TTTACTGCTC ACTTGCGAAG AGCGTCGAGG CGCGCCTGTA AATCGTCGTC AAGCCCGCCA CCCCCGGTCG CGGGCTCAGC CGAACCACCC TCTAGGACCG CGGTTGGCGC CACTCGTTCG GCATCGACGG CGGCGCCAAC CTTACCCGCG GGCGCCGAGA TTAATTCCGC CCCGACGTTG CATCCCAGTT CGTCCAACAC CGCGTTCACG AGTTCATCCG TCTCCTCCTC TTCATCCTCA CCCTCGAACG CATCGTCGAT TGCGTCCCCC ATCACCTCTG TCGTCATCTC CATCTTTTCA TTCTGTCGCT CAAACTCCTT CAATATATTT TGCAACGAGG GTAAATTCAG TTTGGTGTTC ATCGACTTCA TCGCCGTCGT CACGCCTCGC ATCGCGTCCG CCATAGCCTG CGAGCTCTTC AACGTTTGCA TTCGCAGCGA CACCCCTTGC AACTGGGATT TAAGCGCATA GAACTTCGTT ATCGAGTGTC TCGTGCGCAC TAAATCCTTC GCCATCACCT ACATCGCGCG ACGCACGTTC GTTTCGTTCC GTCAGTCGCC GTGGTCAATT CCGACCGCCG CGTCGCGCTT CCACCTTCAT CCCCTCCAAC GCATCATCGC GCCGACACCG CGCGTTTTCG CCCTCGCGCA CGATCGTCCA CGCGCGCACC TTCACCGCGC CCATCTGATT CGCCTTGGCC ACGCGCTTAA TCTCCGCTAT GAGCTTCTTT TCTTGCGACA TCATGGCTGA CCGTTCGCGA TCGATTTCTC GAATGGATCT GTCGAGCATG CGCTTGTTCT CGCGCAACAG C
|
Protein sequence | MKIKTISRVE EDYTRERKSD ALRVHRNLAP ELRPMGRATE YKRALNATKL DKVFAKPFAG QMSGHADGVL CMAKSPASLT ELVSGAADGE IRVWDVPSLK TVRVLKGHRG ACRGVSASND GGAVVSCGDD ATIRLWTMPK AGMGEMNDPT RKIPVLETSE MYVESNGFRD CDAHWGKKEF ATAGANVQVW SMERSHALHT FEWGSDTVLS VRYNPVETDI FASCGSDRSI ALYDVRMQTP LKKIIMQTKS TKLCWNPMEA FNFTVANEDT NLYSYDMRKL DIATCVHKDF VSAVMDIDYS PTGREFVAGS YDRTVRMFDY NAGHSKDCYH TKRMQRVFCT RFSMDGSYVF SASDDMNVRC WKADASAQMG TLSAREKRKH AYNASLKDRF KHMPEIRRIA NHHHVPKAIH KQTKLRRTMQ EAETRKAKRR VAHAAPGAEK KEFKPARKKK ILAEVE
|
| |