Gene CNA07480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA07480 
Symbol 
ID3253654 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp2047596 
End bp2051756 
Gene Length4161 bp 
Protein Length965 aa 
Translation table 
GC content47% 
IMG OID638253071 
ProductDNA mismatch repair protein MSH2, putative 
Protein accessionXP_567098 
Protein GI58259371 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.271772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGTGA CGGATAAAGA AGGTCGCAGG GGACTTGACC TTCAAGCCTC CACCCACGCG 
CGTCGCCTTG AAAGATTACT CGACCTCGCT CATAATGAGG TTTGTCTTTG TCGCGTGGTC
CATTCAATCT TTCCCTTTTC CTCCTCTCTT CTCTATAGAT CCAGACACTT TCCAAGATGG
TAAGTGGCCT CCGGCCAATA TGTATCCCGG CGAAAGCAAA AACAGCTTTG CTGACGCCGT
CCGTTGCAGC CCATGTATGG GAACGAATCC ACCTCGGCGC CCAAGCCGCT CTTCGATATG
GGCAAGTTCC TGTATTTCCT GTTATATTTT GAAAAGTACT GACTGATGAA ATATTTAGAC
AAGGACAGCG AGGAAAAATT TGTGCGATTT GTGGAGCGTA TGCCGACTGT GAGTTATACC
TCAGCGACGC GTCGACATGA CGGTACAGGT GGGTGGACGT GTAACTAATA TAAAAATCGC
CTTCAGAAAT TGGACGGCAT GATCAGGCTG TTCGATCGTG GGGTATGTTA ACTTTATTGA
ACCAAAAAAA CAGGATTACT AGACTGATAC GCTGCAATAG GACTACTACT CGGCTCACGG
CGCGGACGCC ATCTTCATTG CCAACGAAGT CTACAGAACC ACAAATGTCC TCAAATACCT
TGGTTCAGGT TCTAAACCCT CTTCCTCTTC TGGACAATAT GCTCGAGGAT TACCTTCTGT
CACCATATCC ATGGCTTTGA CCAAAGCTTT CCTCCGGGAG GCCCTTACAA CCAAGCAGAT
GCGTGTCGAA ATCTACGCCC CCACGGGAGG AGTCGCTCCT GGAAGCCGAA AGGATCATTC
CAAATGGGAG ATCTCAAAGA CAGCTTCGCC CGGCAATCTG AGCCAAGTAG AAGACCTGCT
ATTCAGTGAC AGAGATCTGA CAGCGAATGC GGTCTCAATG GCCATCAGGG TGGTGGTCAA
AGATGGGATA AACACTGTCG GTGTGGGTTT CGTAGATGTA CAAGAAAAGG TGGTAGGAGT
GTCTGAATTC GTTGATGATG AGAACTTTTC GAACACCGAG GTATGTAGCT AGCTTTTTTT
GAAACTATTT TGAGGCTGAC CAAGAACCAG TCGCTATTGA TCCAACTTGG TGTAAAGGAA
TGTATATTGC AAGCAGATGA GAAGCGTCCA GAGCTGGCCA AATTAAGGAT GTTGGTGGAG
TGGTGTGGTG TCATCGTCAC CGATCGCAAA TCGAGTAAGT ACCTTGATAC GTTGCCAGTC
AGTGACTTAC GGGTCTTAGG CGAGTTCCAA ACCAAAAATG TTGAACAAGA CCTTAATCGG
TTGTTGCACG AGTCTCATGC TGGTGCCGCT TTACGTATGT CCTAAAGAAT GAGGTTCTGT
TATCTCAGCT AACCATGTGG CACAGCGGAG TTTGACCTCA AAATCGCCAT GTCAGCTCTA
TCAGCACTTA TCAATTATCT CTCACTTCTA TCCGACCTCT CCCTCCATGG TCAACTCCGA
TTATATCGTC ATGATCTTTC TCAGTACATG AAGCTTGACG CGTCCGCCCT CAAGGCTTTG
AACCTGATGC CAAATCCTCA AGAGCTGGGT GGTAACAAGA ATATGAGCAT ATATGGGTTG
TTAAACAGAT GCAAGACTAG TCAAGGGACA AGGTTGTTGG GAAGGTGGTT GAAACAGCCA
CTGGTGAATC GCCATGAGAT TAGTGAGTAA TAGATATACT GTATGTTTAC TTGAACATCC
ACTAACGAGA CTCTAGTTCA GAGACAGACT ATGGTTGAGG TTTTCGTTGA GGATTCTGTC
AATCGCCAAT CTATTCAAAC AAAGTACCTC AAGCAGATGC CTGACTTTCA CAGAATCTCG
AAAAAGTTCC ACAAACGAGT GGCTGGATTG GAAGACGTTG TCAGGGTGTA CCAAGCTGTG
CAGCTGGTGA GATCCAAACG GCCACAAAAG GGGATAGAAC ACTGATTTAA GATTATTAGC
TGCCTGGTTT GCAGGAAATT CTGGAAAATG CCGACACCCC AGAACCAGGA GCCAGGGATC
TTATTGAGGA AATTTGGCTC AAGCCTTTAC GCGTATGTAC TTGCTCATTC TACCATCAAG
TTCTTTTTTT GACCTCGATC CGTTAGGAAC ATATTGAAAA GCTTGGAAAT TATTCTTCCA
TGGTAGAAGA CACCATCGAT CTTGACGAAC TTGCTAATCA CAACTATGTG ATACTTCCTA
CTATCGATGA AGATCTTCAG AGATACAGAG AAGAGTTGTT AAACGTGCGA GATCAGCTTG
ATGATGAACA CAGGCGAGTT GGAAGTGATC TGGGTCTGGA TATCGACAAG AAGCTTCATT
TGGAGAATCA TCAAGTTTAC AAGTACTCCT TCAGGATTAC TAAGGCGGTA TGTCGACCTG
CGTTATATCA AAAAGCGCCC ATCGCTAACG AAACTGCTTT TACAGGAGGC TAGCCTCATT
CGTAACAAGA AGGAATATAT TGACCTCGCT ACCCAAAAAT CTGGTACCAT ATTCACCACT
AAAACCCTCA AGGCGCTGAG CGAGGAGTAC TTCAGACTGC AGGAGTTGTA CGAGAAGCAG
CAAAGGCACC TTGTCAAGGA GGTCGTCTCG ATCGCTTGTG AGTATTCGAA ATAACACTAA
GTGGCAGCCG CTGATTTAAA GAAGCCTCGT ACACACCGGT TTTGGAAATG CTGGATAACT
TGATTGCGGC TGTCGATGTC ATTGTCAGGT ATGTCTGTGC TTTGGCTAAA AGACGACAAG
GGAGCTCATA CAGTATGATT AGTATGGCTC ACGTCTCTTC TGAGGCTCCC ATTCCTTATG
TTAAACCCAT CTTGACTGAA AAAGGTACAT TCCGTTCCTT CAACCAAAAT CATGTCTGAC
AAGAGTATAG GTACCGGTGA CGTCGTTGTT CTAGGCGCCC GTCATCCTTG TCTTGAAGTC
CAAGACGATA TTGTCTTTAT CCCTAATGAC CATGAAATGC GCAAGGGTGA TTCCGAGTTT
ATCATCCTTA CCGGACCGAA CATGGGTGGT AAATCGACGT ACATCCGACA GATTGGTGTC
ATCGCCCTTA TGGCTCAGGT TGGATGCTTT GTGCCCGCCA CAGAAGCTCG GCTACCCATC
TTTGACTGTA TCCTTGCGAG GGTTGGTGCT GGGGACAACC AACTGAAAGG AGTCAGTACA
TTCATGGCCG AGATGTTGGA GACGGCGACC ATCTTGAGAG TAGGCACATA ACTTTCTAGT
GTAAATATTG ATTTTTGTGT GCTGATAATT CCGTAGTCTG CTACCAAAGA CTCTCTGATC
ATCATCGATG AGCTTGGTCG AGGTACATCT ACATACGATG GTTTTGGTCT TGCTTGGGCA
ATATCAGAGT GCGTTTTTCA GCTTCCCGCA AGAGCGTTTT GCTGACTGCG GATTCCAGAT
ACATTGCCGA AACGATTCAC TGCTTCTGTC TCTTCGCCAC CCATTTCCAC GAGCTTACCA
GCCTTTCTGA AAAGAATTCT CACGTGAAGA ACTTGCACGT TGAAGCCCTT GTCAAGGACA
AAGATGGAGA AGGGGGTGCG AAGGAAAGGG ACATTACGTT GCTGTACCAA GTCAAAGAAG
GTATCTGTGA TCAAAGTTTC GGTATCCATG TGGCCGAGTT GGCAAACTTC CCTGAGAGTG
TCGTCAAGGT GAATTCTCAT ATGTAGCCTT CCTTCAAGGG GTATATCGTT AACGTGTGAA
TGTAGCTCGC CAAGCGTAAA GCGGAAGAGT TGGAAGATTT TGGAGGTGAG ACTTTATACT
CGATAACCTA GGTATTCCGG ATGTTGACAT ATATCAACTG CTGCAGACGA CCAAACCCGA
GCCCCATCAT CCAAGTTTTC AAAGACGGAA ATCGATGCTG GTACAGACAT CGTCAAAGAG
TTCCTCGACA CTTGGAAATC TCGCGTCTCT GCCGCTGGAA GGGGAGGAGG GGCGGATGCT
GAGATGGCCA TGAGTGAAGA TGAGATGGTT CAGCTGTTGA AGGATACCGC GGAAGAATTC
AAGGACCGAT TGGAAGGGAA TGAGTGGGTG AAGAGCTTGA TGAGCACATT CTAGACGGTG
GTTGCCTCTC GCTGGGGGTG GAGGAGGGCA GAGAAAAATA ACGAAAATGC CCTGTCGCTG
CAGATGCAGG CTCTTTTCTT C
 
Protein sequence
MPMYGNESTS APKPLFDMDK DSEEKFVRFV ERMPTKLDGM IRLFDRGDYY SAHGADAIFI 
ANEVYRTTNV LKYLGSGSKP SSSSGQYARG LPSVTISMAL TKAFLREALT TKQMRVEIYA
PTGGVAPGSR KDHSKWEISK TASPGNLSQV EDLLFSDRDL TANAVSMAIR VVVKDGINTV
GVGFVDVQEK VVGVSEFVDD ENFSNTESLL IQLGVKECIL QADEKRPELA KLRMLVEWCG
VIVTDRKSSE FQTKNVEQDL NRLLHESHAG AALPEFDLKI AMSALSALIN YLSLLSDLSL
HGQLRLYRHD LSQYMKLDAS ALKALNLMPN PQELGGNKNM SIYGLLNRCK TSQGTRLLGR
WLKQPLVNRH EIIQRQTMVE VFVEDSVNRQ SIQTKYLKQM PDFHRISKKF HKRVAGLEDV
VRVYQAVQLL PGLQEILENA DTPEPGARDL IEEIWLKPLR EHIEKLGNYS SMVEDTIDLD
ELANHNYVIL PTIDEDLQRY REELLNVRDQ LDDEHRRVGS DLGLDIDKKL HLENHQVYKY
SFRITKAEAS LIRNKKEYID LATQKSGTIF TTKTLKALSE EYFRLQELYE KQQRHLVKEV
VSIASSYTPV LEMLDNLIAA VDVIVSMAHV SSEAPIPYVK PILTEKGTGD VVVLGARHPC
LEVQDDIVFI PNDHEMRKGD SEFIILTGPN MGGKSTYIRQ IGVIALMAQV GCFVPATEAR
LPIFDCILAR VGAGDNQLKG VSTFMAEMLE TATILRSATK DSLIIIDELG RGTSTYDGFG
LAWAISEYIA ETIHCFCLFA THFHELTSLS EKNSHVKNLH VEALVKDKDG EGGAKERDIT
LLYQVKEGIC DQSFGIHVAE LANFPESVVK LAKRKAEELE DFGDDQTRAP SSKFSKTEID
AGTDIVKEFL DTWKSRVSAA GRGGGADAEM AMSEDEMVQL LKDTAEEFKD RLEGNEWVKS
LMSTF