Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_88888 |
Symbol | |
ID | 5005066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 44646 |
End bp | 45794 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | |
GC content | 60% |
IMG OID | 640420487 |
Product | predicted protein |
Protein accession | XP_001421036 |
Protein GI | 145353472 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0000824445 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0739348 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCGG TCGCGGCGGA TGCGGAGGCG AAGGAAGCGG CGGAGCGCGC GAGAGTTGAG GAGGAAAAGC GGCGGCAAGA GGCGGAGGAG GCGGCGAGGA AAAAGGCGGA GGAGGACGCC GAGGCGGCGA AGAAGCGAGC GGAGGAAGAG GCGAAGGAAA AGGCGCGCGA AGAGGAAGCG AAGCGCAAGC AGGCGGCGAG CGATTACGTC AGTGGTAAAG CCACGGGAGA AATCCCGCGC GTCGCCGCGG CGGCGGAGGC GTTGGAGCAG GAAACGAACT TGGCCAAGAC GCTCGTCGAG GCGCGCGCGA TGGTGGCCGA GTATCAGTCG CATCCGACGG CGAAATTGGA ACGCCGGAAA CTCACGAACA CCATCGTGGT GCACGTGCAA CAAATCGCGG CGACGAAGGA GCAAATCAAT AAGAAGAGTC GAGACATCAT GATGCTGCTG GTGCAATTGC AGGAGCCTCA AAAGACGTTC GCGTTGATGA GCATCGCGAA AAAGATGCTC TCGCAGTGCG ACGTGCAGGT GGCCAAACTC AATCGCTACG CGTTTGCGCT CGCCGAAGTC GCGGTGAGTA TCGCGATCGA CGTACCGAGG TTTGGTGTCT TGCTCGTCGC CCTCATACAC GAGGTTTGCG TCAACGCGGT GCCGAAGTAT TACCCGTTTG TTCCGGGACG TTACGCCACC GACGACGAAT ACTACAGTCT CATGGGGTAC GTCAAAAACG ATGAAGGCAC GGCGTTCGAA ACCACGGATT CCTACGTCGA TCGCATGACG GGTAGCATGC TCTTTTACGC CGCGTTTTTA CAAGTCGACG CGCCGAATCA CCCACACGGC GTCGACGCCG CGTGGCGATG GCTCGCGCGT CTGTTGAACA GATGTCCGCC CAATCGCCAC ACCGCGGTGG CTCTGGACTC ATTCCTCAAA ATCGCCGGCT TTCGCATGTA CGCGGCGTAT CGCGGTCAGT TCGTCAAAGT CCTCGAACTC ATCCATCGAG AGTTTCTTCC AAAGTTGGAC GCCAAGAACG ATCCCGACAT TCGGCCCGTG TCGTCGCGCA TCGCGACGTA CCTACAGGAG AGTCTGTACA CGAAATCTCC CGAAGGCCGC GACATGCCCA ACACCGACAC CAGCTCGCAC ACGTTTTGA
|
Protein sequence | MRAVAADAEA KEAAERARVE EEKRRQEAEE AARKKAEEDA EAAKKRAEEE AKEKAREEEA KRKQAASDYV SGKATGEIPR VAAAAEALEQ ETNLAKTLVE ARAMVAEYQS HPTAKLERRK LTNTIVVHVQ QIAATKEQIN KKSRDIMMLL VQLQEPQKTF ALMSIAKKML SQCDVQVAKL NRYAFALAEV AVSIAIDVPR FGVLLVALIH EVCVNAVPKY YPFVPGRYAT DDEYYSLMGY VKNDEGTAFE TTDSYVDRMT GSMLFYAAFL QVDAPNHPHG VDAAWRWLAR LLNRCPPNRH TAVALDSFLK IAGFRMYAAY RGQFVKVLEL IHREFLPKLD AKNDPDIRPV SSRIATYLQE SLYTKSPEGR DMPNTDTSSH TF
|
| |