Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_46903 |
Symbol | |
ID | 5004387 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | + |
Start bp | 404163 |
End bp | 406161 |
Gene Length | 1999 bp |
Protein Length | 561 aa |
Translation table | |
GC content | 56% |
IMG OID | 640419808 |
Product | predicted protein |
Protein accession | XP_001420328 |
Protein GI | 145351962 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0369] Sulfite reductase, alpha subunit (flavoprotein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.160153 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTTTTCC CAAAATTGTC TGAAGTTCAC AAAGGGATTG AGGCGAAAAA AAACGCAAAG CTATGCTTCT TGTACGGCTC GCAGACCGGT AACGCGACGG AAATTTGCAA AAATCTCGCC GCCGAGGCGA GCGAGAAGGG GTACCCCGTG GAGGTGTGCG CGATGAATGA GGTCGAGCCA GAAGATGTCA TCAAACCTGG CGCTGTGATT ACGTTCGTCG TATCCAGCAC CGGCGACGGT GACGCTCCGG ATAACTGCGA CACATTCTTC ACTCGCCTCA AGCGCAAGGC AAAGAAGGAA AAAGGCGAGG GCGCCATCGG CGTCCAGTAC GCCGTCCTAG GACTGGGTGA TCAAAACTAC AGCGCGTTCA TGGCTGTGCC GCGTCAGTTC TCGCAAACGA TGGAGAACTT GGGCGCGAAG TGTTTCGCTA AGCGCGGTGA ATGCGACGAC ACACTTGGTT TGTACGAACA AGTTGACGCA TGGACGAGCA CGTTTTGGTC TCATCTCGAA GTCGCTCGAG GAAACTCACA CAAGTTGCGC GAGGGTGAGA CAATCGTGGA AGACGCTAAT GCCGCCACTG AAGCTCCAAA AGGCGATTCG AAGCCACCGC AAGCGGCTGC GCCGGCGAAG AAAGTCGAAG GCGTGCCACC ACTACCAATC TGCCGATCTG AAGTGCAGTG GCTGCCGAAA ACGACAGAAG TGGTTGCTAA TCGCGTCGCG CCCGGTCCAG ATTCCGAAGG TGCATACACC GTCTCCTCGC CGTACATGGC GACTATTCAC AAGCGTGAGG TACTCACAAA CTTGAAATCC GATCGGCGAG TGTTGCACAT GGAGTTTGAT CTCGGATCAA GTGGGATCTC TTACAAGCCG GGAGATTCCA TTGGCATCGT ACCGCAGAAC GACGCCGAGC TGGTTCGAGC CATCGTCGAT CGGCTGGGTC TCGATCAAGC CGCCATTTTC ACGCTGAATT GGAAGAAAGG TGACACGAAT GAACACGCGA CTCATCCGTT GCCGCATATT CACACCCCGT GCACGGTAAA GAGTGTGTTC ACGAATTATA TCGACATCAC GGGATGTCCG CGTAAGTCGC TCCTTCGCGT TCTCGCCGAA CACTGCGGCA ACGCGGAGGA AAAAGACGCC CTCTTGCACC TTTCATCGCG TGGTGGTCGA GCAGAATACG AAACACAGAT TCGCGCGCAG TCACCGACGC TATTGACGCT CTTGAACAAT TATCCGAGTT GTTGTCCTCC GTTAGCCGAG CTTTTAGATG CCCTCTCTCC GCTCGCGCCG AGACTGTATT CGATCACATG CGCGCCCGAG GTGGCACCAA CAACGCCGTC GGTTGCCTTC AGTGTCGTGC GGTTCCAGGT ACCCAGCGGT GAACATCGCC TCGGCGTCGC CACGAACTGG CTCGACGAGA TATCGGTCGA CGACAAGTGC GAGCACAAAG TCCCGGTGTA CATCAAGCCA AGCTTGAAGT TCGGCCTGCC CGAAGACTCG AGCGCGCCGC TAGTGATGAT TGGTCCGGGT ACCGGCGTCG CACCATTCCG CGGTTTCCTC CAGTCCAGAC GCGCTAAAGC ACAAAAAGGC GGTCGATTAT CTGAAGCTAT GTTGTTCTTC GGCTGTCGCA AGGCAGATGA GGACTTCCTG TACGAAGCCG ACTGGAAGAG TTTCACCGCC GATGGCTCGC TGACAAAGCT CGTCTGCGCC TTTAGCCGCG AGACCGCAGA AAAGGTATAC GTGCAGCACA AAATTGAAGA GCACGCCACC GAAGTCGCGC GCTTAATCTC TGAGGGCGCT TACGTCATGG TATGCGGCGA CGGCGCGCAC ATGGCCAAGG ATGTCCACGC CGCCCTCGTC CGCGTCGTCG CTCAAGCCGG CGTCTGCGGC GTCTCTGACG TCAAAGCCGC CGAGGCTCTC CTCGCCGACT TCACTAAATC CGGTCGTTAC GTCCGTGATA TTTGGAGCTA ACGCACGCGC GTTCGTATTG TTTAGATTT
|
Protein sequence | MNEVEPEDVI KPGAVITFVV SSTGDGDAPD NCDTFFTRLK RKAKKEKGEG AIGVQYAVLG LGDQNYSAFM AVPRQFSQTM ENLGAKCFAK RGECDDTLGL YEQVDAWTST FWSHLEVARG NSHKLREGET IWLPKTTEVV ANRVAPGPDS EGAYTVSSPY MATIHKREVL TNLKSDRRVL HMEFDLGSSG ISYKPGDSIG IVPQNDAELV RAIVDRLGLD QAAIFTLNWK KGDTNEHATH PLPHIHTPCT VKSVFTNYID ITGCPRKSLL RVLAEHCGNA EEKDALLHLS SRGGRAEYET QIRAQSPTLL TLLNNYPSCC PPLAELLDAL SPLAPRLYSI TCAPEVAPTT PSVAFSVVRF QVPSGEHRLG VATNWLDEIS VDDKCEHKVP VYIKPSLKFG LPEDSSAPLV MIGPGTGVAP FRGFLQSRRA KAQKGGRLSE AMLFFGCRKA DEDFLYEADW KSFTADGSLT KLVCAFSRET AEKVYVQHKI EEHATEVARL ISEGAYVMVC GDGAHMAKDV HAALVRVVAQ AGVCGVSDVK AAEALLADFT KSGRYVRDIW S
|
| |