Gene CNG03810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG03810 
Symbol 
ID3258667 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp1070224 
End bp1073558 
Gene Length3335 bp 
Protein Length834 aa 
Translation table 
GC content48% 
IMG OID638258004 
Producthistone-lysine N-methyltransferase, putative 
Protein accessionXP_572085 
Protein GI58269858 
COG category[R] General function prediction only 
COG ID[COG2940] Proteins containing SET domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGACA TCATCGAAGG TACAAAACCA ACTCTAGATG ATCTCTGGGC AGGGGATGAG 
GATCAGAAGC CCCAGACTCC ACCTGTATCG CCACCAAGGT CTCCATCCTT CAAGCACGAG
CACAAATCTT CATTTCCTGG ATCGACTTCC CCACCTCCGG CATCCCCCAT AGGTGAAGAA
GATCTCAAAC CTCGTACTGC CAGGGCATCA TCTTCAACCT CCAAGGTGAA GAGCAGAAAG
GCTTCCCCAG AGGAGTTTAA GCCTGTGCTT ATAGACGACC TTCCGACCGC GTGGGATGAG
GCTCATGAGA CGTTCGAAGC GCTGGAGAAG TGCGTGTATG AGAGAAAAGA TATTGGGCTG
AGTAAGGAGA ACGATGAGAT GATGGTGTGC GAGTGTGTCT ATAATAGACG TGAGTCGTGG
ATACATTATA GCTCTGGCTG GACGTAGCTC TGGTGGCGAT GGGGGATGGA GCTGCGAGTG
ACGCGGGTGC TGACGGAGGC GTCGCAGATG ACCCAGACGC TGACCCTTGT GGACCTGATT
CGGATTGCAT CAACCGTGCA TTGTATATCG AGTGTATCGC GGGGGAATGT AGAGCGGGCA
AGCACTGTCA TAACCAGCAG TACGTGAAGT ACATAGGCAG AATGGCGAAA GGGCCCGACT
GATGGGCGGC TTAGATTCTC GAAGAGGCAG TATGCCAACG TGGATGTGGT TCTTACGGAG
AAAAAGGGCT ACGGATTGAG AGCGAGCTCG ACCATTCCAG CGTCAGTTGA TTACATTTCC
CCACCCATGA TTTTCATGTG ATGAGGTAAA GCTGATGTAT GATTGCGCAG GAATACGCTC
ATCTACGAGT ATATTGGTGA GGTGGTGGCG GAGAAAACAT TCAGAAAGAG GATGCAGCAG
TATGCAGATG AAGGCATACG GCATTTCTAC TTTATGATGC TTCAGAAAGA AGAGGTTGGT
GCTCTAACGG TTATACTATG CCCTTTAAGC GTGCGCTAAT GTACTGGCCG TTCAATAGTA
CATTGATGCG ACCAAGAAGG GTGGGATAGG TAGATTCGCT AATCACTCTT GTAACCCCAA
TTGTGAAGTG CAAAAATGGG TGGTGGGACG TAGATTGCGA ATGGGTATCT TCACGAAGAG
AGATGTGATC AAAGGGGAGG AGATTACTTT CAACTATAAT GTCGACCGAT ATGGGTGAGT
CATAGAAATT CCATTAAAAC CCACTTCTGA CACTCTGATT AGCCATGATG CCCAAACTTG
TTACTGTGGA GAACCGAATT GTGTCGGAAC CATTGGTGGA AAGACTCAGA CCGATATTGG
TACTATGAAC GATCTGTTTT TGGATGGTAG GTTGCTTGAC ATTAAACAGA GTGTCCATAA
CTGACTTGTT GTAGCTTTGG GCATTACGGA TGAAGTCGAG GCCATGGGTA TGAAGGGCAG
CAAGAAGAAG AAGTCAAGGC AACTTGACGA AGACTTTGTC GTGAGTTATC TCATTACAAT
GCGAGAAGCG CATTCCTAAT AAACGATCAT AGCCTATTCT CCGACCCATT AGTGCGCACG
AGGTCCAGAA AGTTGCTGCC GCCATTCGAC AGTCTATGGA GAACAAGAAG ATGATGTCAA
GGTTATTGCA ACGTATTCAG GTAAGTTTAT TTCTCACTGT TGAGACCGTT ACTAACTTGT
TTACAGATGA CCGATGACGG GGCAATGCAC CGCCAGCTCA TGAGGATGCA CGGTTTCAGT
CTCATGTACA TGGTGTTGAC TGAGTTGGCG GACGATAACG AGATTGTGCT TCTTGTGAGT
GACGAAATAT CAACCTGAAT TTCAACAGCT AAGCACTTCA TAGGCTCTTG AGAGTATGAA
CAAGTGGAAG CTTCAGATCC GTAACAAGAT TGAAGATTCC AAAATTGAAG AGCCTGTTAA
AGCGCTCAGC CAAAGCGGAG ATGAGAAGAT CTGCGGGTTG GCTAAGCAGC TTATCGAGTA
TTGGTCTACT CTCGAACTTT CTTATAAGAT TCCTCGTGTT TCCAAAATTG CATCTGTGAG
TAGTCTAAAA GGCAGCGAAA TGGAAATAAC TGACGCGGAC TTTAGCTTGA TGCCGATGAT
GAAGCGGGTA CACAGACCAT TGCTGAAGCC AATGTCGTAT CTGCAGCCCG TCGTCCTGAC
GCTTGGGAGA ACACTCAAGA AATCCAAATC GATATTGCTC CCGTCCGTCC CCGCACTCTG
CCCGTCTCAC GTCCTCGCCC TCCTCCTCCA CCACCACCTC TCCCTGTCAA GAAACCTGCG
CTCAATTCCA TGAGCTCCAC TGATCGTCTA AAGCTCGATG CAATTATCGC CATGGCGGAG
CAGACTGTCC AAGCTCAAGC AGCTGCCGCG GCAGTGGAAG CGACAGCTTC TCCCCAAGCG
GGCTCCAGTA GGTCCGGAAG CAGACCTGCG GAAGACGAGG AAAGAAGAAA GAGGCAGAAA
AGAACGCATA TGACAGAGGA GGAGTTAGCG GAACAGAAGG AGAGGAGGTT GAGGAAGCTT
ATCGGTGCGG TTGTCGTCAA GTCAATGAAC AAGTACAAAG ACATGATGGA GCACGATACT
TTCAAAAAGT ATGCCAGAGA AGTGAGTCAA ACCAAATCAT CGCTATTTCA ACATTCGCTG
ATAGGCTACT TAGTGCACCG ATACTCTGGT CAAGAAGGAG AAGAAAAGAA ATCCTTCTTA
TCAAGACGTC AAGCACCCTT CACTATCTGA CGACAAGAAA GCCAAGATCA AGTCCTTTAC
CAAAGATTAC ACTCACAAAA TCTTGAAGCA TCTCAAGGAG AAGGGCAAGC TTCGCAATCC
GAAGAGCTCC TCGTCCCTAA GAACTAATAG CAACGACCCG CGTCAAGCGG CCTCATCATC
TACCAACGGT GACACCCCCA GCATATCTAC TACCCCTTCA CAGGGAGCCA TCCAAAGGTT
ACGAGATGGA GAATTGGTGG ATGACATCTT TGGCGCCGAT GAAGATATGG TGATGGACCT
CGATGAAGAC ACACCTGAAA TGCAAAACGA TCACCCTGCA CCTCCTTCAG TTCCCCCGGC
TACACCTCCT TTACCACCTG TTCATGTTGA AGTCGTGGAT AACGTGTCTA CACCTACATC
GCAATGGGAA ACATGATGTG GTAAAGTCGG ATGAGACAGG TCTGGAGACT GTGACGGATG
CATGGAAGGA AAGAAGAGTT TGAGACATGC CTTTATGTAC AGGATCATTC GGAGTAATTG
TATGGTTGCT ATGGGTGTTA ATCGGTGTAT CAGAGTTCAG CACTGGCGAG TACAGTCATA
GTAAAATGCA TGCCGCCATT ATAGGCCATC CGTCT
 
Protein sequence
MGDIIEGTKP TLDDLWAGDE DQKPQTPPVS PPRSPSFKHE HKSSFPGSTS PPPASPIGEE 
DLKPRTARAS SSTSKVKSRK ASPEEFKPVL IDDLPTAWDE AHETFEALEK CVYERKDIGL
SKENDEMMVC ECVYNRHDPD ADPCGPDSDC INRALYIECI AGECRAGKHC HNQQFSKRQY
ANVDVVLTEK KGYGLRASST IPANTLIYEY IGEVVAEKTF RKRMQQYADE GIRHFYFMML
QKEEYIDATK KGGIGRFANH SCNPNCEVQK WVVGRRLRMG IFTKRDVIKG EEITFNYNVD
RYGHDAQTCY CGEPNCVGTI GGKTQTDIGT MNDLFLDALG ITDEVEAMGM KGSKKKKSRQ
LDEDFVPILR PISAHEVQKV AAAIRQSMEN KKMMSRLLQR IQMTDDGAMH RQLMRMHGFS
LMYMVLTELA DDNEIVLLAL ESMNKWKLQI RNKIEDSKIE EPVKALSQSG DEKICGLAKQ
LIEYWSTLEL SYKIPRVSKI ASLDADDEAG TQTIAEANVV SAARRPDAWE NTQEIQIDIA
PVRPRTLPVS RPRPPPPPPP LPVKKPALNS MSSTDRLKLD AIIAMAEQTV QAQAAAAAVE
ATASPQAGSS RSGSRPAEDE ERRKRQKRTH MTEEELAEQK ERRLRKLIGA VVVKSMNKYK
DMMEHDTFKK YARECTDTLV KKEKKRNPSY QDVKHPSLSD DKKAKIKSFT KDYTHKILKH
LKEKGKLRNP KSSSSLRTNS NDPRQAASSS TNGDTPSIST TPSQGAIQRL RDGELVDDIF
GADEDMVMDL DEDTPEMQND HPAPPSVPPA TPPLPPVHVE VVDNVSTPTS QWET