Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_44507 |
Symbol | |
ID | 4999510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 806318 |
End bp | 808168 |
Gene Length | 1851 bp |
Protein Length | 563 aa |
Translation table | |
GC content | 59% |
IMG OID | 640414931 |
Product | predicted protein |
Protein accession | XP_001415936 |
Protein GI | 145341686 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0369] Sulfite reductase, alpha subunit (flavoprotein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAGAGACGT TCGACGACGC GGGGAAAGTG CGTCTGAGCG TGTTTTACGG CACGCAGACG GGGACGAGCG AGCGGTACGC GCGGGAGGTG GTGGCGCACG CGCGCGCGCG GTACGGCGCC GCCGTGGCGG CGCGCGCGGT GGATCTCGAA ACCGTGGATT CCATGCACGC CGAGGACGCG CTCGTGGCGG AGACCGGGTG CTGTGTGTTT CTGCAATCGA CGTACGGGGA CGGGGAACCG ACGGATACGA GCTCGGATTT CGTGTATTGG GCGCGGGACT GCGCGAGCGA CGGGCGGATG CCGGATTTGT TGGAAAACGT AACGTTCAGC GTGTTCGGGC TCGGTAATCG TTGCTACGAA CAGTTCAACG CGGCGGCGAA GATGGTTCAC AAGGCTTTGG TGGATTTAGG CGCTCAACCG CTGCTGAAGC TTCACTTGGG AGACGACGAC CAGTGCTTGG AGCAAGACTT TGAAAATTGG ATCGAGGCGT TTTGGCCCGC GTTCGAGGCC AAGTTTGGTT TACATTCCGA CGGCGGCGAC GAGGAGTTGC CTCGTTACGA CGTCATCATT ATGCGTGGTA GCGACGGCGA GCGCGCAGCG GCGAGTCAGG CGGCCAAATA CGAAAAGGAG CACTCTAACA CGCGCGTCGC GCCGACGGCG GCAAAGCCGT ACCTCTCCTC CGTCAAAGTT GTGCGCGAAC TCTTCAGTAA AGATGCCGAT CGAAGTTGCG TTCACGTAGA GTTTGATATT TCAGGCTCGA GCGTGCAGTA CAAGACGGGC GACCACTTGG GCGTGTTTGC GGAGAACGGT AAGGACGTCA CGAAGCGCGT AGCCAAAGCG CTGAAGCTCG ACGTCGACGA GGTCTTCCGT TTGGTGAAGC CGAGTGGTGC TCCTGCGTCG CTCGCCGAGC CGTTCGCGAC GCCCATGACG GTCGGAGATG CGATTGCGCG GTACGCGGAC GTTTTAACGC CCCCTCGAAA GCAAGCGCTC GCTGCGCTCG CCTCCGTCGC TTCGGGCAAG GACGCAGAAA AGCTCGCCTT TTTAGCCTCG CCGGCGGGCA AGGATGAATT CGCAAAGTAC ATAACGAAGC CCCACAGATC GCTTTTGGAG GTGATGGAGG ACTACTCCAG CGCTGTTCCA GATATTGGAT TATTCTTTGG CGCCGTTGCG CCTCGTTTAG CGGCTCGATT TTACAGCATC AGCTCAAGTC CGGCGGCTAA CAAAAACGTC GTCACCGCCA CCGTCGCCGT CGTCAAAGAG AAAGTCTTCA CCGGTCGCAT GCACGAAGGT GTCGCGAGTA CATTCTTACA GCGCGCGGCT GAAGGCCAAA AGATTCCGAT TTTCGTGCGA ACGAGCACTT TCCGACTGCC GACAAATCCA GAAGCGCCTG TTATCATGAT TGGACCAGGC ACCGGCTACG CCCCTTTCCG AGGTTTCTTG CAAGAGCGCA CGGCGCTTCA AGCTTCTGGT GCCAAGTTAG GCCCGGCGAT GCTGTTCTTT GGATGCCGTA ACAAAGATCG CGATTTCATA TACGAAGCAG AGATGCAGAC CGCGTTGCGA GAGGGCGTGA TTACCGACCT GGACGTGGCG TTTAGCCGCG ACGGGCCGAA GAAGGTGTAC GTGCAGGACA AGATTATCGA AAAGGCGTCG ACGGTGTACC CAATCGTCAA AGGTACCGTG GGCAAAAACG AGGGCGCGGT GTTCATTTGC GGTGACGCGA AGAACATGGC AAAGGATGTG AACAAGGCGC TCCTGAGTGT GTTGATGCGC GAGGGCGACT ACGCGGCGCA CGAAGCGGAG GAGATTTTAC GTCGCCTGAA GGCAGAGTTT AGATACCATC AGGACGTGTG G
|
Protein sequence | MHAEDALVAE TGCCVFLQST YGDGEPTDTS SDFVYWARDC ASDGRMPDLL ENVTFSVFGL GNRCYEQFNA AAKMVHKALV DLGAQPLLKL HLGDDDQCLE QDFENWIEAF WPAFEAKFGL HSDGGDEELP RYDVIIMRGS DGERAAASQA AKYEKEHSNT RVAPTAAKPY LSSVKVVREL FSKDADRSCV HVEFDISGSS VQYKTGDHLG VFAENGKDVT KRVAKALKLD VDEVFRLVKP SGAPASLAEP FATPMTVGDA IARYADVLTP PRKQALAALA SVASGKDAEK LAFLASPAGK DEFAKYITKP HRSLLEVMED YSSAVPDIGL FFGAVAPRLA ARFYSISSSP AANKNVVTAT VAVVKEKVFT GRMHEGVAST FLQRAAEGQK IPIFVRTSTF RLPTNPEAPV IMIGPGTGYA PFRGFLQERT ALQASGAKLG PAMLFFGCRN KDRDFIYEAE MQTALREGVI TDLDVAFSRD GPKKVYVQDK IIEKASTVYP IVKGTVGKNE GAVFICGDAK NMAKDVNKAL LSVLMREGDY AAHEAEEILR RLKAEFRYHQ DVW
|
| |