Gene CNH00720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH00720 
Symbol 
ID3259200 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp980215 
End bp985878 
Gene Length5664 bp 
Protein Length1691 aa 
Translation table 
GC content48% 
IMG OID638258411 
Producthistone-lysine n-methyltransferase, h3 lysine-9 specific, putative 
Protein accessionXP_572264 
Protein GI58270216 
COG category[R] General function prediction only 
COG ID[COG2940] Proteins containing SET domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTATAG CAAATTCGTC TTCTTCACCA GAAGATCAGA ATATCACCGG TCTTTTGGAA 
AATCATTCTC GAGAGACAGG AGAGCTGCCT AGTCCGCCTT TAGTTGAGGA TCGTGCGGCA
CCGAATGGAA TGAAACTCGA TGGGGAGAAC ATTGTTAGTC AGAGTAATGG GATTAGTGCT
GCTCAAGCGG CGGTAACCAC ATCTAGAACG TCTTCATTGA CATCAAATTT AAACAGACCC
CAATCGGTCT CTGGGGCTCA ACCTGCTGTC GCTTCCTCTA GCAGATCTGC CATCAATTCT
CCTCCCAGAC GATACCTTTT GATATCCCAA CAGTCACCTT CGTCTCGTAC GACACCTTGC
AGATCTGCCG GTACTGCGTC TCCCACTTCA AGACCCCCAA CTCCTCCATT ACCGCCAACA
CAGTCAACAC CTGTAACTGT ATCAGGGCTT CCAGCGCCCT TGATACTCTC CCCTGTACCG
GCTGCTTCAT CAGCCATCTC GCCCACCATA CCTTCAATAC CTTTACACGC TTCACTTCCC
TATCCTACAC TATCTTTACC GCCTTCAGCT ACACCAGCAC TTCCAAATTC CTTTACACTT
TCTGCATCCA CTATCCTTCC TGCAGATTCT TCAATTTCTG CACTTCCATC AACTTCTGCA
GCAGGTTTAC CTGTGATGCC TGTATTCATT TCACCCGCTA CATCTTCTCC AGCTGGTCCT
GAACCATCAA CTCTTCAATC CGCTTCAGAA GCCACAAAAT CATCTAACCA AAAAGAATCG
CCAAGGAAAA CGGACTCCAA ACAAGCGCCT TCCCCTTCAA GTACCCATGG TTTGTGGACG
AATACAAGGC CATTTCACGC TTCTTCTAAT TTATCAGCTA CGTCTGGGAA ATCTTCAGCA
GCTTCGTCGA GATCGACATC TCGGACACTG TTACCTAGTC GTGTGGCTAC GAGTGACAAT
GTTGCCGGTC CCTCAAAAAC GACTTCTGTT TCATCAACTC ACCCACCAGC TAGAGCCCCA
TCGTTATCTC TACCCTCAGA GCCAGAAAGG CAAGTTCACC CAAGGTCGCC ATCTCATGAA
GGCGTAGGAA AGAGGGAGAA GAAATCCAAG GCGCTTTCTT CTGGTACTGG CCAAAGTACA
CCCTCAAGAT CTTCATTATC AACTTACGAT GTGGCCTCAA ATCGGAAAAC TAAATCGACT
GGCCTTAATG CGTTACCACA AAAGCCCTCC AGTGCAGGTA AGACCTCAAG TATCCCGCAA
AGAACCAGTA CATTGGCCAC GGCCATGGCC ATACGTCCGT TTGTTCGATC GCCAACTTCT
CCTTGTCGCC CATCGTCGTT TGAAGCTTCC GGACCTATCC AGTCTTCCGG AGGAGCAGTA
AAAGCCGCTC AAAATCAAGA TAAAATACAT CCATTGTCGA CAACATCGTC CATATCTTCC
CATTCTAGGG AAACAACGAC AAAGGCCGCC ACACCCTCCC ACGTCACAAT TTCATCACCA
TCGCCTTCCC AATCCACTTC AGCATCGACA ACAGTGCACG TGACACAAGA TGCTCAAAAG
TCAGCAACAA CCTCACCTAC CAGAGAAAAG ATACATCCTT CTAGAAGGAA GAGCCAGACA
AGCGATTGTG TGGTTGCTCC GACAACTTCC CAGACTGCGG CCTCCTTGTC ATATCCGGCA
GGGACATCAG ATGGGACTGG AGAAGCTGAT AAGAAGGCCG CGGGAACCAC TGTTCACGCA
GGACCATCTT CCGCTGATTT TGGAACATTT GTAGTGCCTC CTACAACCGC TATTGCATCC
ACGACTGTGT CCCCCTCGGT GACTCCACCT GAAGATGCTC CTTTTTTAAA GTCGGGTCCA
ACAGTTCATC CACTTTTTCG CGCTGCGTCT GCAATTGGGG GTCAGGATAC TTTAACCGAG
ACGGGAGCAG TTGGAGACGT GTTGTCGCAA GGATCAAAAG GCCGGGCATC ATCACCCCGG
ATAACAACTA CTGGAGCAGT TTCGCATCTT GTGCTACCTA TTGATGTAGG CTCAAAAATT
AAGCCAACAG CAGCGCCTTC ACTCTTTGCA TCAGGTGTGA CTATCGGAAA TGTCGATGAT
CAGGCTGCTG CGCAGCCACA GAACCCAGCA CCTCTTCCGT CAGCTTTATC TGTTTTGTCG
AAGAAGAACC TAGGGTCATC GTCCAAAGAG TCAAGCGTTA AATCTGCTGC ATCGTCTGTC
TCCAGCTCAA GCACCCTGGG CGATTCGGTT TCCGGACTCG GTAGAAAATC GCATCACAAG
TCGACGCAGT CGATACCCCA AAGTCCGCTC CCGACGACTA ATACCAGCAA TAGCAGCGGC
ATAGAAGTCA ATTCTTGGCT TAATCCGCAA CCTTTCGGTC CTTCTTTCGC CGCTTCGCAT
TTGAATACAC ATGCCAAGAA GAAGTGGAAA GGAAAGGAAA GAGAGAAGGA GAGGGAAAAG
GAAAAGGCTC GAGAGAAGGA AAAGGCGAAA GAAGAGAACG AGCAATTGAG GCGAAGACGA
ATGGCAGATT TGGAGAAGAT TTCGGCAAAC CTGGAAAAGT ATAAAGGACG TATGGGCGTT
GAATCTCGAA CTCGCTCGAT CCCTCGTGCG GATGGCGAAA ACAGAAAGCG GCCGCCAAGT
CCAGGAATCA GTGCAAGTAC TGATGGGGAG GTGAGAGTTG TGAAGAGGAG CAAACCAGCG
TGGGCTATCG CGCCTGAAAA TGGACACGTC GCTGCCGCAA TGGAGAAAGA AACAAATGAC
AACAAGAAGA CGACGGATGC CATCAAAATC AATCATCAAC CAGCAGCGTT TTATACTCCC
CAGTTTGCTA CTCCCACCTC AGCTGCTAAG ACAACTACAA CTCCTCTATC TGTTACTGCC
GGCTTGGGAC CACCAGTCAA TTCTTCCAGC TCATCCAGTA CGCCTTCATT GCTTAGTCGT
TCGATTAATT TAGATATACC GAGGGAGACG CCGGAAGAGC GTATGGATGA TGACGACGGA
AATGATTCGA CAGGTGGGTT GCTAGTCAGA AAAACCGGGA AAAATGATGC TGTCGAAGAG
CAAGTGGAGC AAGTCCGTCT GGATTTGGAT GATGTCTCTA TACACGGTAG CTCCCCCGTT
GGGCCAATTT CTACTTTCAC CTCACGGTTT AGGAATGTGT CTATCACGCC CTCTCAGACT
CAAAAGATAT TAGAGCAGCC AGGTGAAGAT GTACCTATCC GACGGTCTGC CATAAAGATA
AATGGGAAGC AAAAGGCGCA ACGCAGCAAG GATAAGAGTG ATGGAAGCGA GAGTGAGGAT
GATGTTCCTC TCAACTGGCC TTCAAGGAAA GGAAAGGGGA AAGCCGCTCC CGAACCCCAA
GATAATTCAG AAACACAAAG GGATGATGAT GGTGTGAATT TGGGATCAAG CAATGGAGAG
GAGGGGTCCA GCGACGACCA CTCTAATCCT GTTCTACCGC CTTTCTTATT CAAAGATCGG
CCTGCGCATG TGCGACAGCC GTTGCAGACA CTTTTCAAAG GCGGATTTGT TGCAAAACAT
AGCTACCAGC GCCCTCTTGT CAAGTCTTCT GCTGTTTCTC CCGTTCCGAA TGCCAATCGC
ACCTCCTCTG AGCCATCTGC CAAAGCTGAA CTGCTGCCCC AAAAGCGCAA ACGTAGGCCC
AAGAAACTTA CCAGAGAGCA ATGGCACCAC ATCGCACAGA ACCATTTGTC AGATGTTGAT
GACCTGCTTG ACGAATCTTC AAAGAAGAGG TTGAGTCCGA AGAGTGCGGA GAAGCTCGGT
GCCGATCTTT CCAAGCTCAC AAGTCCTCGC GTTTTCTCTA TCGTTTCTCG CTCAGAGCCC
CGGTCAAAGG GAGCGCCCAT TGAAGAAAAT GATAACGAGT ACTTTACGGA CTCTGACTCA
CATACATCTG ACATTGCCCT GTTCCTCCAA CATCCTGATC CTCCTCCTCC TCCCGAGCGC
ATTCGAGAAG CCAAGCGCAA TTTCGGTACG CGTACCATCG ATCCATGGAA CAGGCAGAAG
CATACTTTTC GATCCAACCC GGCTCTGCAT CGTGCCATCT TTGAGGCATA TATTATGCAG
TCTACGTCAA TGGAAGAGTC GGGTGGGGAT GATATCAAGG TGACGAATGA GGTCGACGCA
GATGGTGGTC CGCCGGACTT TGAGTTTGTG TATTCAGATA CTATGTTATA TCCGGACGGT
ATACCGCCAC CAGAGCTAGG TTTGGGGTGC GATTGTGATG GACCTTGCGA TCCGGATTCT
GAGACTTGTA CTTGTGTAAA GAGGCAAGAG CTTTACTTTT ATGATCTGGG CTTGAAAGGT
TTCGCATATG ATGAGTGAGT CGTTTTGACT ATCGGGTGCA CCGTATTCTG AGTTGATATT
GTGATAGAAA TGGCAAAATC AGAGAAAACT CTGCTTCCAT TTGGGAGTGT AACGAGCTAT
GTGGATGTCC TCCTGAGTGT ATGAATCGCG TCAGTTGGCC TCTATACTGT CCATTATATC
ATTTCTTAAT AATGATTATT TTCATGTTAG GTTATCCAAC GAGGCCGTGC CAAGGATACC
GGAATAGAGA TTTTCAAGAC CAAAGAAAAA GGTTGGGGTA TCCGCGCTCG GTCATTCATA
CCAAGTGGAA CATACATCGG CAGTTACACA GGAGAACTGA TTAGGGAGGC GGAAAGTGAA
CGACGAGGAG TTACCTACAC CGCCATTGGT CGAACGTACG TCCCTTCCCT TGTACATCTC
TTCACACTGT ATGACATTTG ACGCTGACGC TTATCCACAG ATACGTTTTC GACCTCGACG
GATGGCAAAT TCGGCACCCA CCCAAGGGGT TGGAGAAGAT TGATAAACGC GCCGCAGAGC
TAGCTGAAGC GGTGAAAATG AGAGCCAAGG CTGCAATGCG AGAGAGCCAA GAAGATGCTT
ACAATGCATA TAGTGGTAAG TTTCTTTCTC GTAAGCAAGG GACCAACAGG AACAAGGCTG
ACGATGCACA AATTTAGTGG ATGCTTTCCA TTATGGGGTA CGTGTATATT GGTCTTGGGC
AGAAAGCTGT TGAATTACTG ACGACTTGTA TTATATCTTG TGTTTACTGA TCATAGAACG
TAAGCCTATT GCCCACTTGA AAAAGAGACC AGAAGCTGAT CCACTGTCTT AGTTCACTCG
CTATTTTGTG AGAGGTTCGG AATTCTTGAT TCGAATTTGG ATACTGATCA AATGAACTTT
AAGAATCACT CATGTGATCC CAATTTGGCG ATTACCCAAG CATATGTCAA GGACTTCCAT
CCAGAACGAC CATTGTATGT CGCCATTTTT AAGAGTCAGA AGTGTCACAG AACTTCACTA
AATACCGTTT CATAGGCTTG TGATCTTTAC TCGTCGAGAC ATCAAAAAGC ACGAAGAACT
TTGTATCAGT TACAAGGGTA TACCCGTACG TCTCATATAT CCTTTTTTAG CTTCTGGTTT
TCCTTGAGAG ATGTGGATGC TAACGATGTC TATGCCGACA TACAGGATGA CGACGTTCCA
TCCCCAGAGC CTGTCAAAAA GAAGAAGGGA AATAAGGGCA AGAAACAGAT GTCTAAAACA
TCTGCTTCGG CACATCCGCC AGAGATGATA GCTTTGAATT CAGACAAAGG CCCGGTGGAA
GTGAAGGATA TCTGTCGATG GTAA
 
Protein sequence
MSIANSSSSP EDQNITGLLE NHSRETGELP SPPLVEDRAA PNGMKLDGEN IVSQSNGISA 
AQAAVTTSRT SSLTSNLNRP QSVSGAQPAV ASSSRSAINS PPRRYLLISQ QSPSSRTTPC
RSAGTASPTS RPPTPPLPPT QSTPVTVSGL PAPLILSPVP AASSAISPTI PSIPLHASLP
YPTLSLPPSA TPALPNSFTL SASTILPADS SISALPSTSA AGLPVMPVFI SPATSSPAGP
EPSTLQSASE ATKSSNQKES PRKTDSKQAP SPSSTHGLWT NTRPFHASSN LSATSGKSSA
ASSRSTSRTL LPSRVATSDN VAGPSKTTSV SSTHPPARAP SLSLPSEPER QVHPRSPSHE
GVGKREKKSK ALSSGTGQST PSRSSLSTYD VASNRKTKST GLNALPQKPS SAGKTSSIPQ
RTSTLATAMA IRPFVRSPTS PCRPSSFEAS GPIQSSGGAV KAAQNQDKIH PLSTTSSISS
HSRETTTKAA TPSHVTISSP SPSQSTSAST TVHVTQDAQK SATTSPTREK IHPSRRKSQT
SDCVVAPTTS QTAASLSYPA GTSDGTGEAD KKAAGTTVHA GPSSADFGTF VVPPTTAIAS
TTVSPSVTPP EDAPFLKSGP TVHPLFRAAS AIGGQDTLTE TGAVGDVLSQ GSKGRASSPR
ITTTGAVSHL VLPIDVGSKI KPTAAPSLFA SGVTIGNVDD QAAAQPQNPA PLPSALSVLS
KKNLGSSSKE SSVKSAASSV SSSSTLGDSV SGLGRKSHHK STQSIPQSPL PTTNTSNSSG
IEVNSWLNPQ PFGPSFAASH LNTHAKKKWK GKEREKEREK EKAREKEKAK EENEQLRRRR
MADLEKISAN LEKYKGRMGV ESRTRSIPRA DGENRKRPPS PGISASTDGE VRVVKRSKPA
WAIAPENGHV AAAMEKETND NKKTTDAIKI NHQPAAFYTP QFATPTSAAK TTTTPLSVTA
GLGPPVNSSS SSSTPSLLSR SINLDIPRET PEERMDDDDG NDSTGGLLVR KTGKNDAVEE
QVEQVRLDLD DVSIHGSSPV GPISTFTSRF RNVSITPSQT QKILEQPGED VPIRRSAIKI
NGKQKAQRSK DKSDGSESED DVPLNWPSRK GKGKAAPEPQ DNSETQRDDD GVNLGSSNGE
EGSSDDHSNP VLPPFLFKDR PAHVRQPLQT LFKGGFVAKH SYQRPLVKSS AVSPVPNANR
TSSEPSAKAE LLPQKRKRRP KKLTREQWHH IAQNHLSDVD DLLDESSKKR LSPKSAEKLG
ADLSKLTSPR VFSIVSRSEP RSKGAPIEEN DNEYFTDSDS HTSDIALFLQ HPDPPPPPER
IREAKRNFGT RTIDPWNRQK HTFRSNPALH RAIFEAYIMQ STSMEESGGD DIKVTNEVDA
DGGPPDFEFV YSDTMLYPDG IPPPELGLGC DCDGPCDPDS ETCTCVKRQE LYFYDLGLKG
FAYDENGKIR ENSASIWECN ELCGCPPECM NRVIQRGRAK DTGIEIFKTK EKGWGIRARS
FIPSGTYIGS YTGELIREAE SERRGVTYTA IGRTYVFDLD GWQIRHPPKG LEKIDKRAAE
LAEAVKMRAK AAMRESQEDA YNAYSVDAFH YGNHSCDPNL AITQAYVKDF HPERPLLVIF
TRRDIKKHEE LCISYKGIPD DDVPSPEPVK KKKGNKGKKQ MSKTSASAHP PEMIALNSDK
GPVEVKDICR W