Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND06110 |
Symbol | |
ID | 3256840 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | + |
Start bp | 1684257 |
End bp | 1686246 |
Gene Length | 1990 bp |
Protein Length | 523 aa |
Translation table | |
GC content | 48% |
IMG OID | 638256551 |
Product | squalene monooxygenase, putative |
Protein accession | XP_570092 |
Protein GI | 58265872 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.426189 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTCCACATT TATTCTCTAA AAACCACGAA AAAATACCAC AAAGATGCTT GCGATGACCC CTCCAATATC ATCCCCCGAA ATAATCATAA TAGGTGCAGG AGTTATCGGC TCGGCGCTCG CTTACTCCCT CTCTCACACC GGTCGCAAGA TCCTCCTCCT CGAGCGTGAC CTCTCAGAAC CAGATAGAAT CGTCGGTGAA CTCCTTCAAC CCGGTGGCGT AGCTGCACTT GCTCAACTCG GGATGGTAGA TGTGCTGGAA GGGATTGATG CTGCGCCTGT GGAAGGTTAT TGTGTAGTGA ATGGCGAGGA GAAAGTCGGT GTGAACTATC CTCAGGTAGA TGGGAATGGT CACGGCAAGC TTATCGATGA GAAGGTCAAC GGGAAGAATT GGCATGTAGC TACCACTTCC GGGCTCAAAG AAGGTCGTTC ATTCCACCAT GGACGACTGA TATCTGCTCT CCGACGCAAG TGTATCGATC AGGCGCCCAA TGTTACAGTC GTTGAAGCGA CCGTCAAAGA CTTGCTGTTC TGCGAACACA CCAACCAAGT GATCGGAGTT TCCGCTTCTT TCAAATCCGC ATCTGGAAGA GAACCCAACG TCCGCAAATT CTATGCCCCC CTTACTGTCA TTGCCGATGG CTGCTTTTCT AAATTCCGTC ACCATCCCGC GCTTCGAACC AGGGTACCTG ATACCAGATC ACATTTCGTT GGACTTATCC TCCAGAACTG CGAACTGCCA ATGAAACACT ATGGCACTGT CTGTCTCACA CCGAATGGAC CTGTCTTGCT TTATCAAATT GGGAATGAAA AGGGCGAGGT GAGGATGCTC GTGGATGTCA AGGGCAAGCT GCCTAGTGTT GGAGACGGTT CTCTCAAGGT GAGTTTGAAA TTCGCATATA TAGCAGAATT CGTACTGATA CTTTTTACAG CAACACTTAA TTGACAACTA TCTCCCATAC ATTCCCGCCT CACTTCGATC TCCTCTCCTC GACGCTCTTT CCACTCAACG TCTCCGCTCC ATGCCCAACT CCTACCTCCC TCCCTCCATC CAGGGTCTCC GCTCCAATCT TCAAGGCGCC ATCCTCGTTG GTGACGCCTA CAACATGCGT CATCCCCTTA CAGGTGGTGG TATGACAGTT GCTTTCAATG ATGCTGTCCT CTTGACGGAA TATTTGAAAC CTGGCGGCAA GTTAAGGCGA AAACCTTGGG AAGACGGATT AGCCCCAGGT AGAGAGGGGT TGGAGGATTG GGACAAGATT GCGGAAAGGC TGAGGGAGTG GTTCTGGGAG AGAAAACAAT TGAGCGGTGT CGTAAATGTG CTTTCTATGG CTCTTTACAG CTTGTTCGGT GGTTCGGACA GTACGTCGCG TCGATGCGTC TTATATTCCT TGGAAACTAA CCTATTGTCT CAGAACCAGA TCTTGCGATC CTAAGAGAAG GATGTTTCAA GTACTTTGAA TTAGGGGGAG AGTGTGTGGC TGGCCCTGTC GGATTGCTTT CTGCGTGAGT TATCCACTTC CTATTTTCCG ATCATTACTC ACTCATTCTC TCTTGTTAGT CTCACACCGC GCCCTGTCCA ACTTTTCTAC CACTTTTTCA ACGTCGCCTT TTACTCCATT TACCTCCTAC TCATCCACGG TCCTCCCCAG CGTAGAACAA ACGGATCTGT AGGAGCCATT GCCATGTTAC CATTGAATTT GCTTTTAAGT TTCAAGGTGG TACGTGTTAT ATATTTTCTT CACCTAGTTG AGGATTCAAC GGGCTGATCA TTGATTTGGA TGATAGTTCT ACACGGCTTG TGTGGTTTTG CTGCCGTTCA TGCTTATTGA ATTTAGAAGC TAAAGAGCGT AGATTTACAT AGAAATTTTG AAGCTTGGCT CATGGAAGGC AACGAGCTTG AAGAGTTAAT TGATGGATTT TGGAACAGCA TCTTCAATTA TGTCTTAGGT CTAGGGTTTT CTTTTCAGAT TTGCATGTAC TATGTCTTGT
|
Protein sequence | MLAMTPPISS PEIIIIGAGV IGSALAYSLS HTGRKILLLE RDLSEPDRIV GELLQPGGVA ALAQLGMVDV LEGIDAAPVE GYCVVNGEEK VGVNYPQVDG NGHGKLIDEK VNGKNWHVAT TSGLKEGRSF HHGRLISALR RKCIDQAPNV TVVEATVKDL LFCEHTNQVI GVSASFKSAS GREPNVRKFY APLTVIADGC FSKFRHHPAL RTRVPDTRSH FVGLILQNCE LPMKHYGTVC LTPNGPVLLY QIGNEKGEVR MLVDVKGKLP SVGDGSLKQH LIDNYLPYIP ASLRSPLLDA LSTQRLRSMP NSYLPPSIQG LRSNLQGAIL VGDAYNMRHP LTGGGMTVAF NDAVLLTEYL KPGGKLRRKP WEDGLAPGRE GLEDWDKIAE RLREWFWERK QLSGVVNVLS MALYSLFGGS DKPDLAILRE GCFKYFELGG ECVAGPVGLL SALTPRPVQL FYHFFNVAFY SIYLLLIHGP PQRRTNGSVG AIAMLPLNLL LSFKVFYTAC VVLLPFMLIE FRS
|
| |