Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA07480 |
Symbol | |
ID | 3253654 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 2047596 |
End bp | 2051756 |
Gene Length | 4161 bp |
Protein Length | 965 aa |
Translation table | |
GC content | 47% |
IMG OID | 638253071 |
Product | DNA mismatch repair protein MSH2, putative |
Protein accession | XP_567098 |
Protein GI | 58259371 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.271772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGTGA CGGATAAAGA AGGTCGCAGG GGACTTGACC TTCAAGCCTC CACCCACGCG CGTCGCCTTG AAAGATTACT CGACCTCGCT CATAATGAGG TTTGTCTTTG TCGCGTGGTC CATTCAATCT TTCCCTTTTC CTCCTCTCTT CTCTATAGAT CCAGACACTT TCCAAGATGG TAAGTGGCCT CCGGCCAATA TGTATCCCGG CGAAAGCAAA AACAGCTTTG CTGACGCCGT CCGTTGCAGC CCATGTATGG GAACGAATCC ACCTCGGCGC CCAAGCCGCT CTTCGATATG GGCAAGTTCC TGTATTTCCT GTTATATTTT GAAAAGTACT GACTGATGAA ATATTTAGAC AAGGACAGCG AGGAAAAATT TGTGCGATTT GTGGAGCGTA TGCCGACTGT GAGTTATACC TCAGCGACGC GTCGACATGA CGGTACAGGT GGGTGGACGT GTAACTAATA TAAAAATCGC CTTCAGAAAT TGGACGGCAT GATCAGGCTG TTCGATCGTG GGGTATGTTA ACTTTATTGA ACCAAAAAAA CAGGATTACT AGACTGATAC GCTGCAATAG GACTACTACT CGGCTCACGG CGCGGACGCC ATCTTCATTG CCAACGAAGT CTACAGAACC ACAAATGTCC TCAAATACCT TGGTTCAGGT TCTAAACCCT CTTCCTCTTC TGGACAATAT GCTCGAGGAT TACCTTCTGT CACCATATCC ATGGCTTTGA CCAAAGCTTT CCTCCGGGAG GCCCTTACAA CCAAGCAGAT GCGTGTCGAA ATCTACGCCC CCACGGGAGG AGTCGCTCCT GGAAGCCGAA AGGATCATTC CAAATGGGAG ATCTCAAAGA CAGCTTCGCC CGGCAATCTG AGCCAAGTAG AAGACCTGCT ATTCAGTGAC AGAGATCTGA CAGCGAATGC GGTCTCAATG GCCATCAGGG TGGTGGTCAA AGATGGGATA AACACTGTCG GTGTGGGTTT CGTAGATGTA CAAGAAAAGG TGGTAGGAGT GTCTGAATTC GTTGATGATG AGAACTTTTC GAACACCGAG GTATGTAGCT AGCTTTTTTT GAAACTATTT TGAGGCTGAC CAAGAACCAG TCGCTATTGA TCCAACTTGG TGTAAAGGAA TGTATATTGC AAGCAGATGA GAAGCGTCCA GAGCTGGCCA AATTAAGGAT GTTGGTGGAG TGGTGTGGTG TCATCGTCAC CGATCGCAAA TCGAGTAAGT ACCTTGATAC GTTGCCAGTC AGTGACTTAC GGGTCTTAGG CGAGTTCCAA ACCAAAAATG TTGAACAAGA CCTTAATCGG TTGTTGCACG AGTCTCATGC TGGTGCCGCT TTACGTATGT CCTAAAGAAT GAGGTTCTGT TATCTCAGCT AACCATGTGG CACAGCGGAG TTTGACCTCA AAATCGCCAT GTCAGCTCTA TCAGCACTTA TCAATTATCT CTCACTTCTA TCCGACCTCT CCCTCCATGG TCAACTCCGA TTATATCGTC ATGATCTTTC TCAGTACATG AAGCTTGACG CGTCCGCCCT CAAGGCTTTG AACCTGATGC CAAATCCTCA AGAGCTGGGT GGTAACAAGA ATATGAGCAT ATATGGGTTG TTAAACAGAT GCAAGACTAG TCAAGGGACA AGGTTGTTGG GAAGGTGGTT GAAACAGCCA CTGGTGAATC GCCATGAGAT TAGTGAGTAA TAGATATACT GTATGTTTAC TTGAACATCC ACTAACGAGA CTCTAGTTCA GAGACAGACT ATGGTTGAGG TTTTCGTTGA GGATTCTGTC AATCGCCAAT CTATTCAAAC AAAGTACCTC AAGCAGATGC CTGACTTTCA CAGAATCTCG AAAAAGTTCC ACAAACGAGT GGCTGGATTG GAAGACGTTG TCAGGGTGTA CCAAGCTGTG CAGCTGGTGA GATCCAAACG GCCACAAAAG GGGATAGAAC ACTGATTTAA GATTATTAGC TGCCTGGTTT GCAGGAAATT CTGGAAAATG CCGACACCCC AGAACCAGGA GCCAGGGATC TTATTGAGGA AATTTGGCTC AAGCCTTTAC GCGTATGTAC TTGCTCATTC TACCATCAAG TTCTTTTTTT GACCTCGATC CGTTAGGAAC ATATTGAAAA GCTTGGAAAT TATTCTTCCA TGGTAGAAGA CACCATCGAT CTTGACGAAC TTGCTAATCA CAACTATGTG ATACTTCCTA CTATCGATGA AGATCTTCAG AGATACAGAG AAGAGTTGTT AAACGTGCGA GATCAGCTTG ATGATGAACA CAGGCGAGTT GGAAGTGATC TGGGTCTGGA TATCGACAAG AAGCTTCATT TGGAGAATCA TCAAGTTTAC AAGTACTCCT TCAGGATTAC TAAGGCGGTA TGTCGACCTG CGTTATATCA AAAAGCGCCC ATCGCTAACG AAACTGCTTT TACAGGAGGC TAGCCTCATT CGTAACAAGA AGGAATATAT TGACCTCGCT ACCCAAAAAT CTGGTACCAT ATTCACCACT AAAACCCTCA AGGCGCTGAG CGAGGAGTAC TTCAGACTGC AGGAGTTGTA CGAGAAGCAG CAAAGGCACC TTGTCAAGGA GGTCGTCTCG ATCGCTTGTG AGTATTCGAA ATAACACTAA GTGGCAGCCG CTGATTTAAA GAAGCCTCGT ACACACCGGT TTTGGAAATG CTGGATAACT TGATTGCGGC TGTCGATGTC ATTGTCAGGT ATGTCTGTGC TTTGGCTAAA AGACGACAAG GGAGCTCATA CAGTATGATT AGTATGGCTC ACGTCTCTTC TGAGGCTCCC ATTCCTTATG TTAAACCCAT CTTGACTGAA AAAGGTACAT TCCGTTCCTT CAACCAAAAT CATGTCTGAC AAGAGTATAG GTACCGGTGA CGTCGTTGTT CTAGGCGCCC GTCATCCTTG TCTTGAAGTC CAAGACGATA TTGTCTTTAT CCCTAATGAC CATGAAATGC GCAAGGGTGA TTCCGAGTTT ATCATCCTTA CCGGACCGAA CATGGGTGGT AAATCGACGT ACATCCGACA GATTGGTGTC ATCGCCCTTA TGGCTCAGGT TGGATGCTTT GTGCCCGCCA CAGAAGCTCG GCTACCCATC TTTGACTGTA TCCTTGCGAG GGTTGGTGCT GGGGACAACC AACTGAAAGG AGTCAGTACA TTCATGGCCG AGATGTTGGA GACGGCGACC ATCTTGAGAG TAGGCACATA ACTTTCTAGT GTAAATATTG ATTTTTGTGT GCTGATAATT CCGTAGTCTG CTACCAAAGA CTCTCTGATC ATCATCGATG AGCTTGGTCG AGGTACATCT ACATACGATG GTTTTGGTCT TGCTTGGGCA ATATCAGAGT GCGTTTTTCA GCTTCCCGCA AGAGCGTTTT GCTGACTGCG GATTCCAGAT ACATTGCCGA AACGATTCAC TGCTTCTGTC TCTTCGCCAC CCATTTCCAC GAGCTTACCA GCCTTTCTGA AAAGAATTCT CACGTGAAGA ACTTGCACGT TGAAGCCCTT GTCAAGGACA AAGATGGAGA AGGGGGTGCG AAGGAAAGGG ACATTACGTT GCTGTACCAA GTCAAAGAAG GTATCTGTGA TCAAAGTTTC GGTATCCATG TGGCCGAGTT GGCAAACTTC CCTGAGAGTG TCGTCAAGGT GAATTCTCAT ATGTAGCCTT CCTTCAAGGG GTATATCGTT AACGTGTGAA TGTAGCTCGC CAAGCGTAAA GCGGAAGAGT TGGAAGATTT TGGAGGTGAG ACTTTATACT CGATAACCTA GGTATTCCGG ATGTTGACAT ATATCAACTG CTGCAGACGA CCAAACCCGA GCCCCATCAT CCAAGTTTTC AAAGACGGAA ATCGATGCTG GTACAGACAT CGTCAAAGAG TTCCTCGACA CTTGGAAATC TCGCGTCTCT GCCGCTGGAA GGGGAGGAGG GGCGGATGCT GAGATGGCCA TGAGTGAAGA TGAGATGGTT CAGCTGTTGA AGGATACCGC GGAAGAATTC AAGGACCGAT TGGAAGGGAA TGAGTGGGTG AAGAGCTTGA TGAGCACATT CTAGACGGTG GTTGCCTCTC GCTGGGGGTG GAGGAGGGCA GAGAAAAATA ACGAAAATGC CCTGTCGCTG CAGATGCAGG CTCTTTTCTT C
|
Protein sequence | MPMYGNESTS APKPLFDMDK DSEEKFVRFV ERMPTKLDGM IRLFDRGDYY SAHGADAIFI ANEVYRTTNV LKYLGSGSKP SSSSGQYARG LPSVTISMAL TKAFLREALT TKQMRVEIYA PTGGVAPGSR KDHSKWEISK TASPGNLSQV EDLLFSDRDL TANAVSMAIR VVVKDGINTV GVGFVDVQEK VVGVSEFVDD ENFSNTESLL IQLGVKECIL QADEKRPELA KLRMLVEWCG VIVTDRKSSE FQTKNVEQDL NRLLHESHAG AALPEFDLKI AMSALSALIN YLSLLSDLSL HGQLRLYRHD LSQYMKLDAS ALKALNLMPN PQELGGNKNM SIYGLLNRCK TSQGTRLLGR WLKQPLVNRH EIIQRQTMVE VFVEDSVNRQ SIQTKYLKQM PDFHRISKKF HKRVAGLEDV VRVYQAVQLL PGLQEILENA DTPEPGARDL IEEIWLKPLR EHIEKLGNYS SMVEDTIDLD ELANHNYVIL PTIDEDLQRY REELLNVRDQ LDDEHRRVGS DLGLDIDKKL HLENHQVYKY SFRITKAEAS LIRNKKEYID LATQKSGTIF TTKTLKALSE EYFRLQELYE KQQRHLVKEV VSIASSYTPV LEMLDNLIAA VDVIVSMAHV SSEAPIPYVK PILTEKGTGD VVVLGARHPC LEVQDDIVFI PNDHEMRKGD SEFIILTGPN MGGKSTYIRQ IGVIALMAQV GCFVPATEAR LPIFDCILAR VGAGDNQLKG VSTFMAEMLE TATILRSATK DSLIIIDELG RGTSTYDGFG LAWAISEYIA ETIHCFCLFA THFHELTSLS EKNSHVKNLH VEALVKDKDG EGGAKERDIT LLYQVKEGIC DQSFGIHVAE LANFPESVVK LAKRKAEELE DFGDDQTRAP SSKFSKTEID AGTDIVKEFL DTWKSRVSAA GRGGGADAEM AMSEDEMVQL LKDTAEEFKD RLEGNEWVKS LMSTF
|
| |