Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG03810 |
Symbol | |
ID | 3258667 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 1070224 |
End bp | 1073558 |
Gene Length | 3335 bp |
Protein Length | 834 aa |
Translation table | |
GC content | 48% |
IMG OID | 638258004 |
Product | histone-lysine N-methyltransferase, putative |
Protein accession | XP_572085 |
Protein GI | 58269858 |
COG category | [R] General function prediction only |
COG ID | [COG2940] Proteins containing SET domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGACA TCATCGAAGG TACAAAACCA ACTCTAGATG ATCTCTGGGC AGGGGATGAG GATCAGAAGC CCCAGACTCC ACCTGTATCG CCACCAAGGT CTCCATCCTT CAAGCACGAG CACAAATCTT CATTTCCTGG ATCGACTTCC CCACCTCCGG CATCCCCCAT AGGTGAAGAA GATCTCAAAC CTCGTACTGC CAGGGCATCA TCTTCAACCT CCAAGGTGAA GAGCAGAAAG GCTTCCCCAG AGGAGTTTAA GCCTGTGCTT ATAGACGACC TTCCGACCGC GTGGGATGAG GCTCATGAGA CGTTCGAAGC GCTGGAGAAG TGCGTGTATG AGAGAAAAGA TATTGGGCTG AGTAAGGAGA ACGATGAGAT GATGGTGTGC GAGTGTGTCT ATAATAGACG TGAGTCGTGG ATACATTATA GCTCTGGCTG GACGTAGCTC TGGTGGCGAT GGGGGATGGA GCTGCGAGTG ACGCGGGTGC TGACGGAGGC GTCGCAGATG ACCCAGACGC TGACCCTTGT GGACCTGATT CGGATTGCAT CAACCGTGCA TTGTATATCG AGTGTATCGC GGGGGAATGT AGAGCGGGCA AGCACTGTCA TAACCAGCAG TACGTGAAGT ACATAGGCAG AATGGCGAAA GGGCCCGACT GATGGGCGGC TTAGATTCTC GAAGAGGCAG TATGCCAACG TGGATGTGGT TCTTACGGAG AAAAAGGGCT ACGGATTGAG AGCGAGCTCG ACCATTCCAG CGTCAGTTGA TTACATTTCC CCACCCATGA TTTTCATGTG ATGAGGTAAA GCTGATGTAT GATTGCGCAG GAATACGCTC ATCTACGAGT ATATTGGTGA GGTGGTGGCG GAGAAAACAT TCAGAAAGAG GATGCAGCAG TATGCAGATG AAGGCATACG GCATTTCTAC TTTATGATGC TTCAGAAAGA AGAGGTTGGT GCTCTAACGG TTATACTATG CCCTTTAAGC GTGCGCTAAT GTACTGGCCG TTCAATAGTA CATTGATGCG ACCAAGAAGG GTGGGATAGG TAGATTCGCT AATCACTCTT GTAACCCCAA TTGTGAAGTG CAAAAATGGG TGGTGGGACG TAGATTGCGA ATGGGTATCT TCACGAAGAG AGATGTGATC AAAGGGGAGG AGATTACTTT CAACTATAAT GTCGACCGAT ATGGGTGAGT CATAGAAATT CCATTAAAAC CCACTTCTGA CACTCTGATT AGCCATGATG CCCAAACTTG TTACTGTGGA GAACCGAATT GTGTCGGAAC CATTGGTGGA AAGACTCAGA CCGATATTGG TACTATGAAC GATCTGTTTT TGGATGGTAG GTTGCTTGAC ATTAAACAGA GTGTCCATAA CTGACTTGTT GTAGCTTTGG GCATTACGGA TGAAGTCGAG GCCATGGGTA TGAAGGGCAG CAAGAAGAAG AAGTCAAGGC AACTTGACGA AGACTTTGTC GTGAGTTATC TCATTACAAT GCGAGAAGCG CATTCCTAAT AAACGATCAT AGCCTATTCT CCGACCCATT AGTGCGCACG AGGTCCAGAA AGTTGCTGCC GCCATTCGAC AGTCTATGGA GAACAAGAAG ATGATGTCAA GGTTATTGCA ACGTATTCAG GTAAGTTTAT TTCTCACTGT TGAGACCGTT ACTAACTTGT TTACAGATGA CCGATGACGG GGCAATGCAC CGCCAGCTCA TGAGGATGCA CGGTTTCAGT CTCATGTACA TGGTGTTGAC TGAGTTGGCG GACGATAACG AGATTGTGCT TCTTGTGAGT GACGAAATAT CAACCTGAAT TTCAACAGCT AAGCACTTCA TAGGCTCTTG AGAGTATGAA CAAGTGGAAG CTTCAGATCC GTAACAAGAT TGAAGATTCC AAAATTGAAG AGCCTGTTAA AGCGCTCAGC CAAAGCGGAG ATGAGAAGAT CTGCGGGTTG GCTAAGCAGC TTATCGAGTA TTGGTCTACT CTCGAACTTT CTTATAAGAT TCCTCGTGTT TCCAAAATTG CATCTGTGAG TAGTCTAAAA GGCAGCGAAA TGGAAATAAC TGACGCGGAC TTTAGCTTGA TGCCGATGAT GAAGCGGGTA CACAGACCAT TGCTGAAGCC AATGTCGTAT CTGCAGCCCG TCGTCCTGAC GCTTGGGAGA ACACTCAAGA AATCCAAATC GATATTGCTC CCGTCCGTCC CCGCACTCTG CCCGTCTCAC GTCCTCGCCC TCCTCCTCCA CCACCACCTC TCCCTGTCAA GAAACCTGCG CTCAATTCCA TGAGCTCCAC TGATCGTCTA AAGCTCGATG CAATTATCGC CATGGCGGAG CAGACTGTCC AAGCTCAAGC AGCTGCCGCG GCAGTGGAAG CGACAGCTTC TCCCCAAGCG GGCTCCAGTA GGTCCGGAAG CAGACCTGCG GAAGACGAGG AAAGAAGAAA GAGGCAGAAA AGAACGCATA TGACAGAGGA GGAGTTAGCG GAACAGAAGG AGAGGAGGTT GAGGAAGCTT ATCGGTGCGG TTGTCGTCAA GTCAATGAAC AAGTACAAAG ACATGATGGA GCACGATACT TTCAAAAAGT ATGCCAGAGA AGTGAGTCAA ACCAAATCAT CGCTATTTCA ACATTCGCTG ATAGGCTACT TAGTGCACCG ATACTCTGGT CAAGAAGGAG AAGAAAAGAA ATCCTTCTTA TCAAGACGTC AAGCACCCTT CACTATCTGA CGACAAGAAA GCCAAGATCA AGTCCTTTAC CAAAGATTAC ACTCACAAAA TCTTGAAGCA TCTCAAGGAG AAGGGCAAGC TTCGCAATCC GAAGAGCTCC TCGTCCCTAA GAACTAATAG CAACGACCCG CGTCAAGCGG CCTCATCATC TACCAACGGT GACACCCCCA GCATATCTAC TACCCCTTCA CAGGGAGCCA TCCAAAGGTT ACGAGATGGA GAATTGGTGG ATGACATCTT TGGCGCCGAT GAAGATATGG TGATGGACCT CGATGAAGAC ACACCTGAAA TGCAAAACGA TCACCCTGCA CCTCCTTCAG TTCCCCCGGC TACACCTCCT TTACCACCTG TTCATGTTGA AGTCGTGGAT AACGTGTCTA CACCTACATC GCAATGGGAA ACATGATGTG GTAAAGTCGG ATGAGACAGG TCTGGAGACT GTGACGGATG CATGGAAGGA AAGAAGAGTT TGAGACATGC CTTTATGTAC AGGATCATTC GGAGTAATTG TATGGTTGCT ATGGGTGTTA ATCGGTGTAT CAGAGTTCAG CACTGGCGAG TACAGTCATA GTAAAATGCA TGCCGCCATT ATAGGCCATC CGTCT
|
Protein sequence | MGDIIEGTKP TLDDLWAGDE DQKPQTPPVS PPRSPSFKHE HKSSFPGSTS PPPASPIGEE DLKPRTARAS SSTSKVKSRK ASPEEFKPVL IDDLPTAWDE AHETFEALEK CVYERKDIGL SKENDEMMVC ECVYNRHDPD ADPCGPDSDC INRALYIECI AGECRAGKHC HNQQFSKRQY ANVDVVLTEK KGYGLRASST IPANTLIYEY IGEVVAEKTF RKRMQQYADE GIRHFYFMML QKEEYIDATK KGGIGRFANH SCNPNCEVQK WVVGRRLRMG IFTKRDVIKG EEITFNYNVD RYGHDAQTCY CGEPNCVGTI GGKTQTDIGT MNDLFLDALG ITDEVEAMGM KGSKKKKSRQ LDEDFVPILR PISAHEVQKV AAAIRQSMEN KKMMSRLLQR IQMTDDGAMH RQLMRMHGFS LMYMVLTELA DDNEIVLLAL ESMNKWKLQI RNKIEDSKIE EPVKALSQSG DEKICGLAKQ LIEYWSTLEL SYKIPRVSKI ASLDADDEAG TQTIAEANVV SAARRPDAWE NTQEIQIDIA PVRPRTLPVS RPRPPPPPPP LPVKKPALNS MSSTDRLKLD AIIAMAEQTV QAQAAAAAVE ATASPQAGSS RSGSRPAEDE ERRKRQKRTH MTEEELAEQK ERRLRKLIGA VVVKSMNKYK DMMEHDTFKK YARECTDTLV KKEKKRNPSY QDVKHPSLSD DKKAKIKSFT KDYTHKILKH LKEKGKLRNP KSSSSLRTNS NDPRQAASSS TNGDTPSIST TPSQGAIQRL RDGELVDDIF GADEDMVMDL DEDTPEMQND HPAPPSVPPA TPPLPPVHVE VVDNVSTPTS QWET
|
| |