Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19906 |
Symbol | |
ID | 5006723 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 146764 |
End bp | 148631 |
Gene Length | 1868 bp |
Protein Length | 559 aa |
Translation table | |
GC content | 61% |
IMG OID | 640422144 |
Product | predicted protein |
Protein accession | XP_001422502 |
Protein GI | 145356572 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.722622 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.566407 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGCGA CCGATGGGAC GAGCGCGCGC GAGGCGCGTC GAAAGGCGCG CGAGTTGGAG GAGGGTCGAA AGGTGCGCGC GCGCTCGCGC GCCGAGCGCG TCGACGCGTC GCGCGAGCGA CGCGCGGACG ACGCGCGAGG ACGATTTCGT CGCGCGCCGC CCGCCATTCC TCGCGACATT CGTTTCGTCG CGCCTCGACG CGACGCGACG CGCGCGACGG CGGTGGACTG ACGGCGACGA CGCGTTCGCG CGCGCTCCAG GCTGGGTTGA TCCCGCACGA GATCGATGAG GACGGGAACG CGATCAACCC GCACATTCCT CAATTCATGG CGGCGGCGCC GTGGTACCTG AAGCAAGACG GGCCGGGATT GAAACATCAA AAGGCGCCGA AAAAAGCCGA AGAGAGCGCG GAGTGGTACA AGAGAGGCGT GACGACGACG AAGGCGACGA AATTTCGTAA GGGGGCGTGC GAGAATTGCG GAGCGATGAC GCACAAGAAG AAGGATTGCA TGGAGCGGCC GCGCGCGAGA GGCGCGAGCA AGACGCAAAA GGACATCGCG GCGGACGAGT ACGTGCAACC GGAGTTGAAG CTTGGGTTTG AGAGTAAGCG CGATCGGTAT AACGGATTCG ATGTGGATGA TTACGTCAAG GTGGTGGAGC GATACGAGGC GGCGGACGCG ATGAAGCAAA AATTGGCCAA GCAAAAAGAG TTGGAACGCG CGTTTCGGCG GGCGAATAAA AAGGAGGACG ACGCGGCGAG CGACTCGGAT TCGGACGATA CGAGCTCCGA CGACGACGAC GACGACGACG CGAAGGTTGC GGATAAGGCG GCGACGGGGT TTGCAAACAT CAAACGCGCG GTGCGCGCGC CCGGAGGGGG CGCTTCCGGC ACGGTGCGTA ACTTGCGTCT TCGCGAAGAC ACGGCGAAAT ATTTGCGCAA CCTGGATGTG GATTCGGCGT ACTACGACCC AAAGACGCGC TCGATGCGCG AGAATCCGAC GCCGAACGCC GATCCCAAAG ACAACTTCTT CCGCGGTGAT AACGCGGCGC GAAATGACGG GCAAGTGGTG GAGTTTGAGC GTTTGAATCG TCACGCATGG GAGCAGGCGG AAGCCGGCGG CGCGAGCGCC ATTCACATGC AAGGCGCGCC GTCGCAAGCC GAGGCGCTGT ACAAGCAATT CAAAGAAAAG AAGGAAAAGC TCGCGGGAAT GAATAAAAAG AACATCATGG AAAAGTACGG CGACGCGAGC GCGGGCAAAG AGCTTCCCGA CGGTTTGGCG CTCGGTCAAA CGGAGCAATA CGTCGAGTAC GACCGCGCGG GCCGTCTCAT CAAGGGAACC GAAAAAGCCA CGGTGAAGAG TTGTTACGAG GAGGATGTCC TTTTGCAAAA TCACACCAAG GTTTGGGGCT CGTACTGGAA CGCCGGTCAG TGGGGTTACG CGTGTTGTCA AAGCATGGTG AAGAACTCGT ATTGCACGGG CGAGCGCGGC GTCGAAGCCG CGCTCGCGAG CGAGCAACTC ATGGTGGACA ACATGGAGAA CAAGCGCGCG ATGGACGAGG CGAACGAAGC GCGAGCGAAG TCGCAGCTCA ACGCGACGAC GAAACCGAGC GATCTGTGGG GTGGTGATGT CAAGGATGAC GTCGAGATCG ATCCTCAAAA GCTCCTCGAA GCCTTGAAGC GGCAAGACGA ACGCGAGGAA GCGCTCAAGC GCGGCGGCGA CGGGAAGAAC AAGCGCGGGT ACAACGTCAC GCACGATTCG CAAGTCACGG CGGAAGACAT GGAGGCGTAT AGGATGAAAA AGCGCGCATT CGAGGACCCG ATGAAAAAAG CGTCGGGCGC GGGGACCGAT GGTTACGATC TAGTGTAG
|
Protein sequence | MGATDGTSAR EARRKARELE EGRKAGLIPH EIDEDGNAIN PHIPQFMAAA PWYLKQDGPG LKHQKAPKKA EESAEWYKRG VTTTKATKFR KGACENCGAM THKKKDCMER PRARGASKTQ KDIAADEYVQ PELKLGFESK RDRYNGFDVD DYVKVVERYE AADAMKQKLA KQKELERAFR RANKKEDDAA SDSDSDDTSS DDDDDDDAKV ADKAATGFAN IKRAVRAPGG GASGTVRNLR LREDTAKYLR NLDVDSAYYD PKTRSMRENP TPNADPKDNF FRGDNAARND GQVVEFERLN RHAWEQAEAG GASAIHMQGA PSQAEALYKQ FKEKKEKLAG MNKKNIMEKY GDASAGKELP DGLALGQTEQ YVEYDRAGRL IKGTEKATVK SCYEEDVLLQ NHTKVWGSYW NAGQWGYACC QSMVKNSYCT GERGVEAALA SEQLMVDNME NKRAMDEANE ARAKSQLNAT TKPSDLWGGD VKDDVEIDPQ KLLEALKRQD EREEALKRGG DGKNKRGYNV THDSQVTAED MEAYRMKKRA FEDPMKKASG AGTDGYDLV
|
| |