Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50765 |
Symbol | |
ID | 5004053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 159641 |
End bp | 161678 |
Gene Length | 2038 bp |
Protein Length | 633 aa |
Translation table | |
GC content | 61% |
IMG OID | 640419474 |
Product | predicted protein |
Protein accession | XP_001419920 |
Protein GI | 145351091 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0155] Sulfite reductase, beta subunit (hemoprotein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0523138 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.66186 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCTCGGTCG CGAGCGCGCG CGCGTCGACC TCCGGACGCG AGCGAACGCG CTCGAGCGCG CGGCGCGTCG AGCGACGCGA GGGCGCGTCG AGCGACGCGA GGACGCGGCG CGAGGGATCG CGGGACGACG ACGACGACGA TGGACGCGGC GCGAGGGACG GCGACGACGG CGACGACGGT GAGGCGCGCG GTGCGGGCGC GCGGAGGACG CGCGGTGCGC GAGACGCGCG GCGCGCGGGC GCGGGACGCG CGACGAGGGA CGCCGCGGGC GGTGGTGGAG CCGGAGGGGG CGTCGGCGGG CGACGCGTCG GCGTCGGCGT CGGCGGGCGA CGCGTTGAAG TGGCAGGCCG AGGACGCGGC GAGCCTGAGG GCGCATAACG AGGAAGCGAT CGCGAAATAT GCAAACTTTC CGGAGTTGGA TAAGCCGGAC GCGCACGTGG CGCGAGATGC GGATGGGTAC TACGTCGTGA AGGAGGAGTG GCGAAAACCG ACGAATCCCT TTGAAAAGTT GAAGCTCGCG AAGGATCCGA TGCGAGAGTT GATCGGGATG AACGGGATCG AGGAGATGGC CAAGGCGAGC GCGGCGGATT TCAAGGCTTG GGACGAGGCG TTGAATGACC CGGACGAGAC CGATCAACGA CCGAAGTGGG CGGGTTTGTT TCATCGACGC AAAGGACACT ACGGGCGATA CATGATGCGA CTCAAGCTTC CGAACGGACT CATTAACTCG ACGCAGATGC GATATTTGGC GAGCGTGATT AAAAAGTACG GCGAAGATGG GTGCTGCGAC ATCACGACGA GACAAAACAT TCAGCTTCGT GGGGTTGAGT TGAAGGATGC GCCCGAAATC TTGCGCAAAC TCGAAGAGCT CGGTATGTGC TCGTTGCAAA GCGGGTTGGA CAACGTGCGC AACGCGACGG GGAACCCGCT CGCGGGTTTT GATCCGCAAG AAATCGTCGA CACGCGACCT TACACGCTCG CCATTCAAGA TTACGTCACC GGTGGTGGGC GCGGGAATCC AGCCATCGCT AACTTGGGGC GTAAGTGGAA CGTGTGCGTC GTCGGCAGCA GCGACTTTTT CGAGCATCCG GACATCAACG ACTTGGCCTT CATTCCAGCG ATGAAGGATG GCAAGTTCGG ATTCAACATG ATCGTCGGTG GCTTCATATC TTCTCAGCGC GCAGCGGAGT CGGTCTCTCT CGATGCGTGG ATCCCTGAGA ACGAGCTTGT CGCCGCGACG CACGCCGTGT TGACGACGTT TCGCGATTAC GGCCATCGCG GCAACCGCCA AAAGTGCCGC ATGATGTGGC TCATCGACGA GATGGGTTTG GAAACGTTTC GCACCGAAGT TGCGTCGCGC ATGCCAACGG GAGACTTGGC GCGCGGCGCC GAAGTGGATC TCATCGACCG AGAGTCTCCG CGACGCAGTT ACATCGGCGT GCACGCGCAA AAGCAAGAGG GCTTGAGCTG GGTCGCCGCT GCCGTGCCGG GCGGACGTAT GCAACCGGAA GATCTCGCGG AGATGGCTGA TCTCGCGGAC AAGTACGGCG AAGGTGAGAT TCGACTCACC GTCGAGCAAA ACTTTATCAT TCCCCACGTG CCGAACGATA AGATTGACGC CATTCTCCAA GAGCGTTTGT TTCAAGAGTA CACGCCGTTC CCGGGCAAAC TTGTGTCCAA CATGGTGGCG TGCACCGGAA ATCAGTTCTG CGGATTCGCG CAAATCGAGA CGAAGCGACA AGCGCTCGAA ATGGCGGAAC ACTTGGAAAG TTGCTTAGAA CTTTCGAAGG ACGTGCGCAT GATTTGGACA GGTTGCCCGA ACTCTTGCGC TCCGGTGCAA GTGGCGGACA TTGGCCTCAT GGGCGCGCAG GTGAAGAATC CGACGGGCGA GAAGGGCATG GTACCAGGGG TGAACATCTT CATCGGTGGT ACCGTCGGGC CGAACGGCCA CTTGAAGGAA GCGCCAGAAA TCGCAAAGGT CCCGTGCTCA GAGTTGAAGC CGGTTCTGGA ACAGATTATG ATTGAGAGGT TTGGCGCG
|
Protein sequence | MDAARGTATT ATTVRRAVRA RGGRAVRETR GARARDARRG TPRAVVEPEG ASAGDASASA SAGDALKWQA EDAASLRAHN EEAIAKYANF PELDKPDAHV ARDADGYYVV KEEWRKPTNP FEKLKLAKDP MRELIGMNGI EEMAKASAAD FKAWDEALND PDETDQRPKW AGLFHRRKGH YGRYMMRLKL PNGLINSTQM RYLASVIKKY GEDGCCDITT RQNIQLRGVE LKDAPEILRK LEELGMCSLQ SGLDNVRNAT GNPLAGFDPQ EIVDTRPYTL AIQDYVTGGG RGNPAIANLG RKWNVCVVGS SDFFEHPDIN DLAFIPAMKD GKFGFNMIVG GFISSQRAAE SVSLDAWIPE NELVAATHAV LTTFRDYGHR GNRQKCRMMW LIDEMGLETF RTEVASRMPT GDLARGAEVD LIDRESPRRS YIGVHAQKQE GLSWVAAAVP GGRMQPEDLA EMADLADKYG EGEIRLTVEQ NFIIPHVPND KIDAILQERL FQEYTPFPGK LVSNMVACTG NQFCGFAQIE TKRQALEMAE HLESCLELSK DVRMIWTGCP NSCAPVQVAD IGLMGAQVKN PTGEKGMVPG VNIFIGGTVG PNGHLKEAPE IAKVPCSELK PVLEQIMIER FGA
|
| |