Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30877 |
Symbol | |
ID | 5001011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 907823 |
End bp | 909193 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | |
GC content | 58% |
IMG OID | 640416432 |
Product | predicted protein |
Protein accession | XP_001417091 |
Protein GI | 145345164 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.242987 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGACG CCAAGGTGCA CGCGATGATG GACGACATCG CGGGATGGAA GCGATCGACG GGGAAGATGC GACGCTTGCT GCGACAGATG CGCGCGCTTC GAGCGGCGAT CGCGGGAGAG GTGGAGACGA CGTGGGGGAA GCTGACGCAC AGGACGCGCG GGACGTTTCT GAGGGAGAGC CTGCCGCGGC TGTGCGAGTT TAAGGAGCGA GAGGCGCAGA TCGCGGCGTT CGGGCTGGTG CAGACGCTGG CGCTGCACGA TGATTGCGCG ACGGCGCTGG CGACGAAGGA GATGTTTAAA CTGTGCGTGG CGACGATCAA GGGGAAAGAT AAAGAGCGCG CGGTGGCGGC GAGCGGGGCG CTGACGACGC TGGTGAATCA CGACGATACG CGGTTTTTAG CGAGCGACGA GGGGCTGGAT AAGAGCATGA CGGCGCTGAT CACGGAAAAG GGGTTAGGGG TGAGGGTTAA AAGAAATTGC GTGGTGACGT TCGCGAGAAT CGCGGATGAT CCCGAGGTGG CGTCGCTGAT GAGCGCGAAG GCGCCGGAGC AATTGATTAA AAACTTTCTC GACTTCGTCG ATAAGACGGA CGACACGGAC ACGGAGAAGT GGGCGCTCAT CGCCATCGCG CGTTTGGCGA TGAACGACGA ATTTAGTAAT TTGATGGAGA AAAAAGGTTA CGTGCCTTTT TTGTTCGAAC TCTCGAGAGA TAAGATTCCG GCTCGGAAGC TGGCGGCGGC GCTCGTCATC GCGCACATGG CGCGCAACAA GGATTTGCGC GAGACGCTCG TCAAGTATCG CGCCATTCAG TTGTTTTGCA CGATTGCGAT GAACACGTCG GAACGAATCG ATATGGCGGA GATGCAATTA GTCGCCGCGC TCGGGTTGAA AAATTTGGCG TCGAATTTCG ATTTGCGCGC GCTCGCCGGA AAAACGGGCG CCATTCAGGC GTGCATCTTC ATGTTGCGCA GTCCGCAGCA GGAAGTAAAG CGGTTCGCCG CGCTCGCGAT CGCAGAATTA GCGCTGTACG AGCCAAACGG TGAGCGCTTT TGCAAGCAAG GGGCGTTAAA ATGGATCATT CAGCTCGCTC GGACCGGGGA CGTGCGCTCG GAAACCGCCG CCATCACCGC GTTATCCAAC TTGATGTTAT CGCCCGGAAA TCAGTCCATC ATGATTGTCG AGGACGGCAC TAAGGTGGTT GATTATTTGC AAAACTCGCG CAACCCTCGC GTGGCGCACC TCGCCAAGCA GCTTTTGAAG CGTTTGCGCA TGGCAAAGCT CCGCGCGGCG TGCAAGTTCG CCGCGCGAAT GAAAGCGACT GGGAACGCAC TCATCGACGC AGGCATCGAA ATCGGCGAAG GTTACGAGTA G
|
Protein sequence | MTDAKVHAMM DDIAGWKRST GKMRRLLRQM RALRAAIAGE VETTWGKLTH RTRGTFLRES LPRLCEFKER EAQIAAFGLV QTLALHDDCA TALATKEMFK LCVATIKGKD KERAVAASGA LTTLVNHDDT RFLASDEGLD KSMTALITEK GLGVRVKRNC VVTFARIADD PEVASLMSAK APEQLIKNFL DFVDKTDDTD TEKWALIAIA RLAMNDEFSN LMEKKGYVPF LFELSRDKIP ARKLAAALVI AHMARNKDLR ETLVKYRAIQ LFCTIAMNTS ERIDMAEMQL VAALGLKNLA SNFDLRALAG KTGAIQACIF MLRSPQQEVK RFAALAIAEL ALYEPNGERF CKQGALKWII QLARTGDVRS ETAAITALSN LMLSPGNQSI MIVEDGTKVV DYLQNSRNPR VAHLAKQLLK RLRMAKLRAA CKFAARMKAT GNALIDAGIE IGEGYE
|
| |