Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_46799 |
Symbol | |
ID | 5004140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 110121 |
End bp | 111239 |
Gene Length | 1119 bp |
Protein Length | 311 aa |
Translation table | |
GC content | 62% |
IMG OID | 640419561 |
Product | predicted protein |
Protein accession | XP_001419904 |
Protein GI | 145351058 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0502] Biotin synthase and related enzymes |
TIGRFAM ID | [TIGR00433] biotin synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.970391 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ACGCGACCCT ACGCCGCGGC CGCGGACGCC GCGCGCGCGA CGGCGCCGCG GCGCGAGCTC GGCGCGGTGT ACAACGATTG GACCAAGGAC GAGGTGCGGG CGCTGTACGC GCGACCGCTG CTGGAGCTGG TGTTCGACGC GGCGAAGACG CATCGCATGC ACCACGACCC GAGGCAGGTG CAGCAGTGCA CGCTGCTGAG CATCAAGACG GGAGGGTGCC CGGAGACGTG TAATTACTGC GCGCAGAGCT CGTCGTGGAA GGGGGAGACG AAATTGAAGG CGGAAAAACT GATGGGCGTC GAAGAGGTGA TCGAGGCGGC GAAGAGGGCG AAGGAGGCGG GGAGCACGAG GTTTTGCATG GGGACGGCGT GGCGAGGGCC GAGTCAAGTC GGCGCGGGAC AGTTTGAACG CGTCCTGGAG ATGACGAAGG AGGTGAGGGA CATGGGGATG GAGGTGTGCG CGACGCTGGG GATGTTGACC CCGGAACAAG CGCTGAAGCT CAAGGATGCG GGGTTGACGG CGTATAATCA TAACTTGGAT ACGAGTCCGG AATATTACGA CAAGGTGACC TCGAGCCGCA AGTACGAGGA TCGGTTGAAC ACGATCGCCG CCGTGCGCGA GGCTGGTATT TCCGTCTGCT GCGGTGGAAT TCTTGGTTTG GGCGAAGAGG AGTCGGATCG GGCGAGTCTG ATGACGGTGT TGGCGACGCT TCCCGAGCAT CCGGAGAGCG TTCCCATCAA CGCGCTGGTG CCCGTGGAGG GAACGCCGTT CAAGGACATG ACTCCGCCCA GCGGTCTCGA GATGGTGCGC GCCATCGCCG TGGCGCGCAT TTTAATGCCC GCCACCGTCG TTCGATTGAG CGCCGGGCGT GTGAACATGA GCCCGGAGAC GCAAGCGTTG TGCTTCATGG CTGGCGCCAA CAGCGTCTTC ACGGGCGATA AACTCTTGAC CACGCCGAAT AACGAAAAGA GCGAAGATTC TTTCTTGTTC GAAGAGCTCG GTCTCGAGGG CCGTCCGGCT TTCGTGCCGT ACGCAGCGGG CGCGGCTTCG AGCGATGGAA GCGAGTGGAA ACACATGAAA CACGAGTTGT AATTTTCTGA ATTTCTATCT GAGAGTTCA
|
Protein sequence | MHHDPRQVQQ CTLLSIKTGG CPETCNYCAQ SSSWKGETKL KAEKLMGVEE VIEAAKRAKE AGSTRFCMGT AWRGPSQVGA GQFERVLEMT KEVRDMGMEV CATLGMLTPE QALKLKDAGL TAYNHNLDTS PEYYDKVTSS RKYEDRLNTI AAVREAGISV CCGGILGLGE EESDRASLMT VLATLPEHPE SVPINALVPV EGTPFKDMTP PSGLEMVRAI AVARILMPAT VVRLSAGRVN MSPETQALCF MAGANSVFTG DKLLTTPNNE KSEDSFLFEE LGLEGRPAFV PYAAGAASSD GSEWKHMKHE L
|
| |