Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31219 |
Symbol | |
ID | 5001255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 511162 |
End bp | 512668 |
Gene Length | 1507 bp |
Protein Length | 493 aa |
Translation table | |
GC content | 63% |
IMG OID | 640416676 |
Product | predicted protein |
Protein accession | XP_001417266 |
Protein GI | 145345542 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.00425878 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGCGACGCG CGCGCCGACG ATGGGCAAGA CGCAGAAGCG CGCGCGAAAG AAATCGTCGC AAGCGCCCGG CGTGCGCGAC GGCAGCGCGA TCGCGAAGCC GACGCACGCG ACGGCGGACG CGACGACTGA TGTTCAAGTC GCGGTGGCGG CGGCGGCGAT GGCGGTGGAC GACGCGCGCG ACGACGGCGC GCGATCGAGA GACGTGAGCG AAGGGACTAT TTTACGTGCC CTGCTCGCGC AGTCGTGCGT GGACGCGTGC AAGGCGCTGG ATGCGGCTCT GGCGCGCGGC GTCGCGGTGC GGGCGGATAC GTGCGCGAGA GTGCTGGCGT TGTGTCAGGC GCGAGAGAAG GCGAAACCGG CGCTGCGATT GCTGCAAAGA ATGGAAGCGA ACGGTATGGC GCCGTCGCGA GAGGCGATGC GGTGCGCGTT CTTCGCGTGC GCGAAGCGGG GAATGTTGCG CGAGGCGCTG GTGTTGATGT CGCGGCTCGA CGGCGAAGGA AAGCGTTTCT TAGGGAAGGA TGTCCTCGTG CGAGCGTGCG CGATGTGCCC AGGGGGCGTC GATGGGGATT TAGGTTTAGC ATTATTGGAA AGCGCGCTGC GCGGTATCAT GTGCGGGTCT TGGGAGCCGG GATCTACCGC GACGATTCGC GCGCAAATGC CGTTCGCGGT GCCGCGCACG AAGGACGACG ACGACATTCC CGCGCCATTG GTGCGTCGCG ATATTGAATA CGCGCCGGGT AGCTTTAAAA TGGTTTGCGA TAACGATGGA CGACGTCGGC CGGTGGATAG GTTGCCGCTT GAGCTGTACG CGCCCGCGCA CCCGGGCGTA ATCCCATTTG CGCCTTCAGG ATTGACAGAG GCGGCGCGTT ACGACGTGCC GCACGTTCCG GGGGCGTTCG TCGTGACGAA TCTTCTCACG AAAAGCGAAT GCTCAGCAAT CATAGCCGCC GGTCACGCGA TTGGGCTTCG TACCGATCCC GGCGACGTCG ACGGTGTCAC GGGCGCGTCT CGTTTGCAAT ACTGCGAGTT CATGGTTTGG CCGCAAAATA TTCAAGGTCT GTGGCGACGA ATCGCAGACT TAATGCCTCC CGGTGCTGTG GGTATTAACG CGCGCTGGAG GTTCTTCCGT TACGGTCCTG GTACGATTTA CCGCCGTCAC GTGGATGGGT CCTGGCCCGA GGGCGCACTC AACGAGGAAT GCGAATATGT CACCGACGTC TCCGATGGCA AAGTCCGCTC GAGGTTGACT TTTCTGATTT ATCTCACCGA AGGCTTCAAC GGTGGATCGA CGACGTTTTA CACCGCTACG CCGAGCGAGC CTGGAGTCTT GAGCGCGCGA GGTGTCGTGC CACAACTTGG CGCCGCGCTC TGCTTCCCGC ACGGCGAAGC CGAGGAGTCG CCCGTGCACG AAGGCAGCGC CGTGACCGCG AGCTTGAACG GAGGCGAAAC GTTCAAGTAC GTCATTCGCA CAGATGTGCT GTTCAATGTG TCGGATATCT AGACTAG
|
Protein sequence | MGKTQKRARK KSSQAPGVRD GSAIAKPTHA TADATTDVQV AVAAAAMAVD DARDDGARSR DVSEGTILRA LLAQSCVDAC KALDAALARG VAVRADTCAR VLALCQAREK AKPALRLLQR MEANGMAPSR EAMRCAFFAC AKRGMLREAL VLMSRLDGEG KRFLGKDVLV RACAMCPGGV DGDLGLALLE SALRGIMCGS WEPGSTATIR AQMPFAVPRT KDDDDIPAPL VRRDIEYAPG SFKMVCDNDG RRRPVDRLPL ELYAPAHPGV IPFAPSGLTE AARYDVPHVP GAFVVTNLLT KSECSAIIAA GHAIGLRTDP GDVDGVTGAS RLQYCEFMVW PQNIQGLWRR IADLMPPGAV GINARWRFFR YGPGTIYRRH VDGSWPEGAL NEECEYVTDV SDGKVRSRLT FLIYLTEGFN GGSTTFYTAT PSEPGVLSAR GVVPQLGAAL CFPHGEAEES PVHEGSAVTA SLNGGETFKY VIRTDVLFNV SDI
|
| |