Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH00720 |
Symbol | |
ID | 3259200 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | - |
Start bp | 980215 |
End bp | 985878 |
Gene Length | 5664 bp |
Protein Length | 1691 aa |
Translation table | |
GC content | 48% |
IMG OID | 638258411 |
Product | histone-lysine n-methyltransferase, h3 lysine-9 specific, putative |
Protein accession | XP_572264 |
Protein GI | 58270216 |
COG category | [R] General function prediction only |
COG ID | [COG2940] Proteins containing SET domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTATAG CAAATTCGTC TTCTTCACCA GAAGATCAGA ATATCACCGG TCTTTTGGAA AATCATTCTC GAGAGACAGG AGAGCTGCCT AGTCCGCCTT TAGTTGAGGA TCGTGCGGCA CCGAATGGAA TGAAACTCGA TGGGGAGAAC ATTGTTAGTC AGAGTAATGG GATTAGTGCT GCTCAAGCGG CGGTAACCAC ATCTAGAACG TCTTCATTGA CATCAAATTT AAACAGACCC CAATCGGTCT CTGGGGCTCA ACCTGCTGTC GCTTCCTCTA GCAGATCTGC CATCAATTCT CCTCCCAGAC GATACCTTTT GATATCCCAA CAGTCACCTT CGTCTCGTAC GACACCTTGC AGATCTGCCG GTACTGCGTC TCCCACTTCA AGACCCCCAA CTCCTCCATT ACCGCCAACA CAGTCAACAC CTGTAACTGT ATCAGGGCTT CCAGCGCCCT TGATACTCTC CCCTGTACCG GCTGCTTCAT CAGCCATCTC GCCCACCATA CCTTCAATAC CTTTACACGC TTCACTTCCC TATCCTACAC TATCTTTACC GCCTTCAGCT ACACCAGCAC TTCCAAATTC CTTTACACTT TCTGCATCCA CTATCCTTCC TGCAGATTCT TCAATTTCTG CACTTCCATC AACTTCTGCA GCAGGTTTAC CTGTGATGCC TGTATTCATT TCACCCGCTA CATCTTCTCC AGCTGGTCCT GAACCATCAA CTCTTCAATC CGCTTCAGAA GCCACAAAAT CATCTAACCA AAAAGAATCG CCAAGGAAAA CGGACTCCAA ACAAGCGCCT TCCCCTTCAA GTACCCATGG TTTGTGGACG AATACAAGGC CATTTCACGC TTCTTCTAAT TTATCAGCTA CGTCTGGGAA ATCTTCAGCA GCTTCGTCGA GATCGACATC TCGGACACTG TTACCTAGTC GTGTGGCTAC GAGTGACAAT GTTGCCGGTC CCTCAAAAAC GACTTCTGTT TCATCAACTC ACCCACCAGC TAGAGCCCCA TCGTTATCTC TACCCTCAGA GCCAGAAAGG CAAGTTCACC CAAGGTCGCC ATCTCATGAA GGCGTAGGAA AGAGGGAGAA GAAATCCAAG GCGCTTTCTT CTGGTACTGG CCAAAGTACA CCCTCAAGAT CTTCATTATC AACTTACGAT GTGGCCTCAA ATCGGAAAAC TAAATCGACT GGCCTTAATG CGTTACCACA AAAGCCCTCC AGTGCAGGTA AGACCTCAAG TATCCCGCAA AGAACCAGTA CATTGGCCAC GGCCATGGCC ATACGTCCGT TTGTTCGATC GCCAACTTCT CCTTGTCGCC CATCGTCGTT TGAAGCTTCC GGACCTATCC AGTCTTCCGG AGGAGCAGTA AAAGCCGCTC AAAATCAAGA TAAAATACAT CCATTGTCGA CAACATCGTC CATATCTTCC CATTCTAGGG AAACAACGAC AAAGGCCGCC ACACCCTCCC ACGTCACAAT TTCATCACCA TCGCCTTCCC AATCCACTTC AGCATCGACA ACAGTGCACG TGACACAAGA TGCTCAAAAG TCAGCAACAA CCTCACCTAC CAGAGAAAAG ATACATCCTT CTAGAAGGAA GAGCCAGACA AGCGATTGTG TGGTTGCTCC GACAACTTCC CAGACTGCGG CCTCCTTGTC ATATCCGGCA GGGACATCAG ATGGGACTGG AGAAGCTGAT AAGAAGGCCG CGGGAACCAC TGTTCACGCA GGACCATCTT CCGCTGATTT TGGAACATTT GTAGTGCCTC CTACAACCGC TATTGCATCC ACGACTGTGT CCCCCTCGGT GACTCCACCT GAAGATGCTC CTTTTTTAAA GTCGGGTCCA ACAGTTCATC CACTTTTTCG CGCTGCGTCT GCAATTGGGG GTCAGGATAC TTTAACCGAG ACGGGAGCAG TTGGAGACGT GTTGTCGCAA GGATCAAAAG GCCGGGCATC ATCACCCCGG ATAACAACTA CTGGAGCAGT TTCGCATCTT GTGCTACCTA TTGATGTAGG CTCAAAAATT AAGCCAACAG CAGCGCCTTC ACTCTTTGCA TCAGGTGTGA CTATCGGAAA TGTCGATGAT CAGGCTGCTG CGCAGCCACA GAACCCAGCA CCTCTTCCGT CAGCTTTATC TGTTTTGTCG AAGAAGAACC TAGGGTCATC GTCCAAAGAG TCAAGCGTTA AATCTGCTGC ATCGTCTGTC TCCAGCTCAA GCACCCTGGG CGATTCGGTT TCCGGACTCG GTAGAAAATC GCATCACAAG TCGACGCAGT CGATACCCCA AAGTCCGCTC CCGACGACTA ATACCAGCAA TAGCAGCGGC ATAGAAGTCA ATTCTTGGCT TAATCCGCAA CCTTTCGGTC CTTCTTTCGC CGCTTCGCAT TTGAATACAC ATGCCAAGAA GAAGTGGAAA GGAAAGGAAA GAGAGAAGGA GAGGGAAAAG GAAAAGGCTC GAGAGAAGGA AAAGGCGAAA GAAGAGAACG AGCAATTGAG GCGAAGACGA ATGGCAGATT TGGAGAAGAT TTCGGCAAAC CTGGAAAAGT ATAAAGGACG TATGGGCGTT GAATCTCGAA CTCGCTCGAT CCCTCGTGCG GATGGCGAAA ACAGAAAGCG GCCGCCAAGT CCAGGAATCA GTGCAAGTAC TGATGGGGAG GTGAGAGTTG TGAAGAGGAG CAAACCAGCG TGGGCTATCG CGCCTGAAAA TGGACACGTC GCTGCCGCAA TGGAGAAAGA AACAAATGAC AACAAGAAGA CGACGGATGC CATCAAAATC AATCATCAAC CAGCAGCGTT TTATACTCCC CAGTTTGCTA CTCCCACCTC AGCTGCTAAG ACAACTACAA CTCCTCTATC TGTTACTGCC GGCTTGGGAC CACCAGTCAA TTCTTCCAGC TCATCCAGTA CGCCTTCATT GCTTAGTCGT TCGATTAATT TAGATATACC GAGGGAGACG CCGGAAGAGC GTATGGATGA TGACGACGGA AATGATTCGA CAGGTGGGTT GCTAGTCAGA AAAACCGGGA AAAATGATGC TGTCGAAGAG CAAGTGGAGC AAGTCCGTCT GGATTTGGAT GATGTCTCTA TACACGGTAG CTCCCCCGTT GGGCCAATTT CTACTTTCAC CTCACGGTTT AGGAATGTGT CTATCACGCC CTCTCAGACT CAAAAGATAT TAGAGCAGCC AGGTGAAGAT GTACCTATCC GACGGTCTGC CATAAAGATA AATGGGAAGC AAAAGGCGCA ACGCAGCAAG GATAAGAGTG ATGGAAGCGA GAGTGAGGAT GATGTTCCTC TCAACTGGCC TTCAAGGAAA GGAAAGGGGA AAGCCGCTCC CGAACCCCAA GATAATTCAG AAACACAAAG GGATGATGAT GGTGTGAATT TGGGATCAAG CAATGGAGAG GAGGGGTCCA GCGACGACCA CTCTAATCCT GTTCTACCGC CTTTCTTATT CAAAGATCGG CCTGCGCATG TGCGACAGCC GTTGCAGACA CTTTTCAAAG GCGGATTTGT TGCAAAACAT AGCTACCAGC GCCCTCTTGT CAAGTCTTCT GCTGTTTCTC CCGTTCCGAA TGCCAATCGC ACCTCCTCTG AGCCATCTGC CAAAGCTGAA CTGCTGCCCC AAAAGCGCAA ACGTAGGCCC AAGAAACTTA CCAGAGAGCA ATGGCACCAC ATCGCACAGA ACCATTTGTC AGATGTTGAT GACCTGCTTG ACGAATCTTC AAAGAAGAGG TTGAGTCCGA AGAGTGCGGA GAAGCTCGGT GCCGATCTTT CCAAGCTCAC AAGTCCTCGC GTTTTCTCTA TCGTTTCTCG CTCAGAGCCC CGGTCAAAGG GAGCGCCCAT TGAAGAAAAT GATAACGAGT ACTTTACGGA CTCTGACTCA CATACATCTG ACATTGCCCT GTTCCTCCAA CATCCTGATC CTCCTCCTCC TCCCGAGCGC ATTCGAGAAG CCAAGCGCAA TTTCGGTACG CGTACCATCG ATCCATGGAA CAGGCAGAAG CATACTTTTC GATCCAACCC GGCTCTGCAT CGTGCCATCT TTGAGGCATA TATTATGCAG TCTACGTCAA TGGAAGAGTC GGGTGGGGAT GATATCAAGG TGACGAATGA GGTCGACGCA GATGGTGGTC CGCCGGACTT TGAGTTTGTG TATTCAGATA CTATGTTATA TCCGGACGGT ATACCGCCAC CAGAGCTAGG TTTGGGGTGC GATTGTGATG GACCTTGCGA TCCGGATTCT GAGACTTGTA CTTGTGTAAA GAGGCAAGAG CTTTACTTTT ATGATCTGGG CTTGAAAGGT TTCGCATATG ATGAGTGAGT CGTTTTGACT ATCGGGTGCA CCGTATTCTG AGTTGATATT GTGATAGAAA TGGCAAAATC AGAGAAAACT CTGCTTCCAT TTGGGAGTGT AACGAGCTAT GTGGATGTCC TCCTGAGTGT ATGAATCGCG TCAGTTGGCC TCTATACTGT CCATTATATC ATTTCTTAAT AATGATTATT TTCATGTTAG GTTATCCAAC GAGGCCGTGC CAAGGATACC GGAATAGAGA TTTTCAAGAC CAAAGAAAAA GGTTGGGGTA TCCGCGCTCG GTCATTCATA CCAAGTGGAA CATACATCGG CAGTTACACA GGAGAACTGA TTAGGGAGGC GGAAAGTGAA CGACGAGGAG TTACCTACAC CGCCATTGGT CGAACGTACG TCCCTTCCCT TGTACATCTC TTCACACTGT ATGACATTTG ACGCTGACGC TTATCCACAG ATACGTTTTC GACCTCGACG GATGGCAAAT TCGGCACCCA CCCAAGGGGT TGGAGAAGAT TGATAAACGC GCCGCAGAGC TAGCTGAAGC GGTGAAAATG AGAGCCAAGG CTGCAATGCG AGAGAGCCAA GAAGATGCTT ACAATGCATA TAGTGGTAAG TTTCTTTCTC GTAAGCAAGG GACCAACAGG AACAAGGCTG ACGATGCACA AATTTAGTGG ATGCTTTCCA TTATGGGGTA CGTGTATATT GGTCTTGGGC AGAAAGCTGT TGAATTACTG ACGACTTGTA TTATATCTTG TGTTTACTGA TCATAGAACG TAAGCCTATT GCCCACTTGA AAAAGAGACC AGAAGCTGAT CCACTGTCTT AGTTCACTCG CTATTTTGTG AGAGGTTCGG AATTCTTGAT TCGAATTTGG ATACTGATCA AATGAACTTT AAGAATCACT CATGTGATCC CAATTTGGCG ATTACCCAAG CATATGTCAA GGACTTCCAT CCAGAACGAC CATTGTATGT CGCCATTTTT AAGAGTCAGA AGTGTCACAG AACTTCACTA AATACCGTTT CATAGGCTTG TGATCTTTAC TCGTCGAGAC ATCAAAAAGC ACGAAGAACT TTGTATCAGT TACAAGGGTA TACCCGTACG TCTCATATAT CCTTTTTTAG CTTCTGGTTT TCCTTGAGAG ATGTGGATGC TAACGATGTC TATGCCGACA TACAGGATGA CGACGTTCCA TCCCCAGAGC CTGTCAAAAA GAAGAAGGGA AATAAGGGCA AGAAACAGAT GTCTAAAACA TCTGCTTCGG CACATCCGCC AGAGATGATA GCTTTGAATT CAGACAAAGG CCCGGTGGAA GTGAAGGATA TCTGTCGATG GTAA
|
Protein sequence | MSIANSSSSP EDQNITGLLE NHSRETGELP SPPLVEDRAA PNGMKLDGEN IVSQSNGISA AQAAVTTSRT SSLTSNLNRP QSVSGAQPAV ASSSRSAINS PPRRYLLISQ QSPSSRTTPC RSAGTASPTS RPPTPPLPPT QSTPVTVSGL PAPLILSPVP AASSAISPTI PSIPLHASLP YPTLSLPPSA TPALPNSFTL SASTILPADS SISALPSTSA AGLPVMPVFI SPATSSPAGP EPSTLQSASE ATKSSNQKES PRKTDSKQAP SPSSTHGLWT NTRPFHASSN LSATSGKSSA ASSRSTSRTL LPSRVATSDN VAGPSKTTSV SSTHPPARAP SLSLPSEPER QVHPRSPSHE GVGKREKKSK ALSSGTGQST PSRSSLSTYD VASNRKTKST GLNALPQKPS SAGKTSSIPQ RTSTLATAMA IRPFVRSPTS PCRPSSFEAS GPIQSSGGAV KAAQNQDKIH PLSTTSSISS HSRETTTKAA TPSHVTISSP SPSQSTSAST TVHVTQDAQK SATTSPTREK IHPSRRKSQT SDCVVAPTTS QTAASLSYPA GTSDGTGEAD KKAAGTTVHA GPSSADFGTF VVPPTTAIAS TTVSPSVTPP EDAPFLKSGP TVHPLFRAAS AIGGQDTLTE TGAVGDVLSQ GSKGRASSPR ITTTGAVSHL VLPIDVGSKI KPTAAPSLFA SGVTIGNVDD QAAAQPQNPA PLPSALSVLS KKNLGSSSKE SSVKSAASSV SSSSTLGDSV SGLGRKSHHK STQSIPQSPL PTTNTSNSSG IEVNSWLNPQ PFGPSFAASH LNTHAKKKWK GKEREKEREK EKAREKEKAK EENEQLRRRR MADLEKISAN LEKYKGRMGV ESRTRSIPRA DGENRKRPPS PGISASTDGE VRVVKRSKPA WAIAPENGHV AAAMEKETND NKKTTDAIKI NHQPAAFYTP QFATPTSAAK TTTTPLSVTA GLGPPVNSSS SSSTPSLLSR SINLDIPRET PEERMDDDDG NDSTGGLLVR KTGKNDAVEE QVEQVRLDLD DVSIHGSSPV GPISTFTSRF RNVSITPSQT QKILEQPGED VPIRRSAIKI NGKQKAQRSK DKSDGSESED DVPLNWPSRK GKGKAAPEPQ DNSETQRDDD GVNLGSSNGE EGSSDDHSNP VLPPFLFKDR PAHVRQPLQT LFKGGFVAKH SYQRPLVKSS AVSPVPNANR TSSEPSAKAE LLPQKRKRRP KKLTREQWHH IAQNHLSDVD DLLDESSKKR LSPKSAEKLG ADLSKLTSPR VFSIVSRSEP RSKGAPIEEN DNEYFTDSDS HTSDIALFLQ HPDPPPPPER IREAKRNFGT RTIDPWNRQK HTFRSNPALH RAIFEAYIMQ STSMEESGGD DIKVTNEVDA DGGPPDFEFV YSDTMLYPDG IPPPELGLGC DCDGPCDPDS ETCTCVKRQE LYFYDLGLKG FAYDENGKIR ENSASIWECN ELCGCPPECM NRVIQRGRAK DTGIEIFKTK EKGWGIRARS FIPSGTYIGS YTGELIREAE SERRGVTYTA IGRTYVFDLD GWQIRHPPKG LEKIDKRAAE LAEAVKMRAK AAMRESQEDA YNAYSVDAFH YGNHSCDPNL AITQAYVKDF HPERPLLVIF TRRDIKKHEE LCISYKGIPD DDVPSPEPVK KKKGNKGKKQ MSKTSASAHP PEMIALNSDK GPVEVKDICR W
|
| |