Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119390 |
Symbol | Dfa1 |
ID | 5000314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 190681 |
End bp | 192597 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | |
GC content | 60% |
IMG OID | 640415735 |
Product | Diflavin flavoprotein A 1-like protein |
Protein accession | XP_001416098 |
Protein GI | 145342024 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0426] Uncharacterized flavoproteins [COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.254321 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGTCG ACGCGGCGCG CGCGCGCGCG GCGACGCGCG CGGACGCGAT GGCGTCGAGG TGCCGACTCG GACGACGCCC GGAGGGTTCG CGCGGTGCGA CGACGCGCGC GCGGCGCGCG CGGCGCGCGA GATGGGTCGA GTGCGCGGTG ACGGACCCGC CGGTGACGCG CGAGGCGGTC AGGACGTCGG ACGTGGAGCT GTTCGAGGGG AAGAAGCGAT TGCAGACGCA GGCGACGGCG CTCGGCGAGG GCACCACGCT GATACGGTCG CTCGATTGGG ATCGCGATCG GTTCGACATC GAGTTTGGGT TGGAAAAAGG GACGACGTAT AACTCGTACA TCATTCGCGG CGCGCCCGGG ACGGCGGCGC TGATCGACGC GAGTCACGAA AAGTTTCGGG AGTTGTACAT GAAGACGCTG ACTGGAGAAA TCGACCCGCT GGAGATCAAG TACGTGGTGT GCTCGCACAC CGAACCCGAT CACTCGGGAT TGATCGGAGA CGTGCTGAAG ATTGCGCCGA ATGCGACGGT GCTCGGGAGT AAGGTGTGCT TGGCGTTCTT GGAGAATTTG ATTCACGAGC CGTTCGAGTC GAGGGTGGTC AAGGGGGGCG ACGTCGTGGA TTTGGGCGAC GGACACGAGC TGAAGTTTAT CATGGCGCCC AACTTGCATT GGCCGGATAC GATGTTCACA TACGATCCGA AGTCACGATT GATGTACACG TGCGACGCGT TCGGGTCGCA TTATTGCAGT GAGGACCCGT TCGACGGGGA CTTGAGCGCG TTGATGCCGC ACTATCGTTT CTACTACGAG TGTTTGATGA AACCGAACGC TCGATCGGTG CTCACGGCGC TTCGCAAGTG CGAGTCCGAA GGCGCAGATT TTGTGGGCAT TTGTAACGGC CACGGACCGT TGCTGCGGTA CAACGTGGAC GAACTCGTGG GGGATTACAA GAAATGGAGC GAGAGCGCGT TGGCCAAGGC CAAGGCAAAC GTGGCGGTAT TTTACACCGC CGAGTACGGT TTCAGCGATC GACTTTCGCA GTCCATCGCG CGCGGGTTGA CGAAGACCGA AACAGAGGTT GTCATGATGG ATTTATTGTC CGCAGATTCT CAAGAACTCG TCGAAACGAC GAAGCACGCG GCTGGCATCG TCTTGCTTTC GCCGCCGCGC GCCGGTCCAG CCAACGAGCA GCTCGCCAAC ATCATCGGCG CCGTGGACGC CAAGCAAAAG TTCTTCATCG CGGAATCGTA TGGTGGTGAA GACGAGCCCG TGGACTTGTT GGCGAAGAAA CTCGCCGAGC TCGGCGTTAC CGAGGCTTTT TCGCCCCTCA AGGTGACGAG CGACCCGACC GAGGGCACGT ATCAATTGTT CGAAGAGGCT GGCACCGATT TGGGTCAACT TTTGACGAAG AAGAAGACAC TGGCGGACAT GAAGAGCGCC ATGTCACCTG ACGTCGCCAA GGCGCTCGGT CGTGTCAGTG GAGGCTTGTA CGTCGTCACC GCCGCGCAGG GCACCGCGCG TTCGGCGATG ATCGCGTCTT GGGTCGCCCA AGCCTCGTTC GAACCTCTCG GTTTTACCGT CGCGGTGGCG AAAGATCGCG CCATTGAATC ACTGATGCAA GTCAACGACA CGTTTGTCCT GAATTGCCTT CCCGAGAACG GCTTCGAACC GTTAATGAAG CACTTTCTGA CGCGATTCCC ACCCGGCGCC GACCGCTTCG AGGGCGTGGA GTGGGCGCCT GCCAACTGCG GCGCGCCCAT CCTCGGCGAC GCCGTGGCGT TCATGGAGTG CCGAGTGGTG TCTCGAATGG AAGCCAACGA TCACTGGATC GTGTACAGCG AAGTTTTCAA CGGTAAGGTG TTCAACCAAG ACGTACGAAC GGCGTCCCAT CATCGCAAAG TCGGCTCTTA CTACTGA
|
Protein sequence | MRVDAARARA ATRADAMASR CRLGRRPEGS RGATTRARRA RRARWVECAV TDPPVTREAV RTSDVELFEG KKRLQTQATA LGEGTTLIRS LDWDRDRFDI EFGLEKGTTY NSYIIRGAPG TAALIDASHE KFRELYMKTL TGEIDPLEIK YVVCSHTEPD HSGLIGDVLK IAPNATVLGS KVCLAFLENL IHEPFESRVV KGGDVVDLGD GHELKFIMAP NLHWPDTMFT YDPKSRLMYT CDAFGSHYCS EDPFDGDLSA LMPHYRFYYE CLMKPNARSV LTALRKCESE GADFVGICNG HGPLLRYNVD ELVGDYKKWS ESALAKAKAN VAVFYTAEYG FSDRLSQSIA RGLTKTETEV VMMDLLSADS QELVETTKHA AGIVLLSPPR AGPANEQLAN IIGAVDAKQK FFIAESYGGE DEPVDLLAKK LAELGVTEAF SPLKVTSDPT EGTYQLFEEA GTDLGQLLTK KKTLADMKSA MSPDVAKALG RVSGGLYVVT AAQGTARSAM IASWVAQASF EPLGFTVAVA KDRAIESLMQ VNDTFVLNCL PENGFEPLMK HFLTRFPPGA DRFEGVEWAP ANCGAPILGD AVAFMECRVV SRMEANDHWI VYSEVFNGKV FNQDVRTASH HRKVGSYY
|
| |