Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18877 |
Symbol | |
ID | 5006442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009373 |
Strand | + |
Start bp | 83284 |
End bp | 84630 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | |
GC content | 62% |
IMG OID | 640421863 |
Product | predicted protein |
Protein accession | XP_001422384 |
Protein GI | 145356327 |
COG category | [C] Energy production and conversion |
COG ID | [COG1252] NADH dehydrogenase, FAD-containing subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 0.00280001 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.00000163751 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGTCGG GAGATGCGAC GAAGATTTTT TTCACCGCGC TCGCGGCGCT CGCGCTCGTC GCGGCGCGCG CGGCGGCGCG CGCGCTGACG ACGCGCGAGG CGAAGAAACG CGTGGTGATC ATCGGTGGTG GATTCGCTGG GATGCAAGCG GCGCTGGATC TGCGGAAAAC GTGCGAGGTG ACGCTGATTG ATAAGAAGCG GTATTTCGAA TACGTCCCGG GGACGCCGGC GGCGCTCGCG GGCGCGGCGC CGTTGAGTCC GGCGTCGCGC GGAGACTCGG CGAAGAAACG CGAAAAGACG CTGACGGTGC CGTATAGCAA GGCGCTCGGA AAGAGCGTGG CGTTCGAATG CGCGGCGGGA CGAGACGTGC GGGTGATGAG CGCGTACGTG GACGTCGGCG GCGACCGCTT TGAATACGAC GAATTGATAC TCGCCACCGG GAGTCATTAT CCGGGCGTGT TGAAAGCGGA GTGCGACGGT GAGGCAGGAG AAGACAGAGA TGCGAGAATG CGAGAGATCG CCGAGGCGCG AGAGGACGTG ACGAAGGGGA AATCGGTCGT CGTCATCGGA GGCGGCGTCG TCGGCGTCGA GGTCGCGGGC GAACTCGCGG CGAGAAACGC CAAGATGAAA TCGGGTGCGC GCGTGATTCT CGTGCACAGC GGCCCAAGGT TGCTCGATAC GTTGCCCAAG TTTGTCAGTG AGTATGTGTC GAAAACGCTC GTGAGATTTG GGGTGGAGGT GTACGTCGGG CAAACGTACG ACCGCGTCGG CGACTCGTTC GTCGGGCGAA TGAACGAAAA CGTCATCGAG GGCGATAGGG TCATGATGTG CGTCGGCGCC AAGCCCGCGA CTGAGTTCTT GGATCGGGAG AGCGTGAACT TCAGCGAAGA TGACGATGAC TCGCCGCTAA ATTTTCCGCT CGACATGATT GGGCGCGTTC GAGTGGACGA GGCGACGCGT CAAGTCATCG GCTACGACAA CGTCTACGCC GTCGGCGACT GCGCGTGTAA ATTACCGGAC CAGATGCTCG CTTCGTACGC GCACTGGGAG GCCGAATACG TGTCAAAGCG AATCATGTGC GACGGCGACG CTCGCGCGCT CGGGAAACTC GGCAGATATC GTCTTCCGCC GCGGTTGATG GCGATTTCTC TGGGCCCGTT CGCGGGCGTC GTCGTCTGGG GCGACAGAAT TTTATGCCGC GGATGGTGCG CCGCCGTGTT CAAGGCGTTG ATCCAGTTTT GGTTCATTCG CTTCCTTCCC GCGCCGTATT CCATCATGAA GAGATTCCCT CGCATGCGCG TGAAACCGCC CGCGAGTGTG ATTTTGCGCA AACCGCTCGC GAATTAA
|
Protein sequence | MTSGDATKIF FTALAALALV AARAAARALT TREAKKRVVI IGGGFAGMQA ALDLRKTCEV TLIDKKRYFE YVPGTPAALA GAAPLSPASR GDSAKKREKT LTVPYSKALG KSVAFECAAG RDVRVMSAYV DVGGDRFEYD ELILATGSHY PGVLKAECDG EAGEDRDARM REIAEAREDV TKGKSVVVIG GGVVGVEVAG ELAARNAKMK SGARVILVHS GPRLLDTLPK FVSEYVSKTL VRFGVEVYVG QTYDRVGDSF VGRMNENVIE GDRVMMCVGA KPATEFLDRE SVNFSEDDDD SPLNFPLDMI GRVRVDEATR QVIGYDNVYA VGDCACKLPD QMLASYAHWE AEYVSKRIMC DGDARALGKL GRYRLPPRLM AISLGPFAGV VVWGDRILCR GWCAAVFKAL IQFWFIRFLP APYSIMKRFP RMRVKPPASV ILRKPLAN
|
| |