Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL04240 |
Symbol | |
ID | 3254837 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | + |
Start bp | 178595 |
End bp | 183221 |
Gene Length | 4627 bp |
Protein Length | 519 aa |
Translation table | |
GC content | 42% |
IMG OID | 638253895 |
Product | mismatch repair-related protein, putative |
Protein accession | XP_567976 |
Protein GI | 58261132 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGCCTCGTT CTTCCAACCT ACGTTTACGT CTTTCCATCC TGACTATATC GGACAAGAAC CATAAGATTG TCACTGACCT ATTCAATCAT ATAACCAGAA TGCCTGTCCC CGCTCAACTG CGCACAATAG CTCCAGAGAA AATCAAGGCA GCCGTCTCTC CTAGCGCGAT CTCAGCGTTT ACACAGACAC GTCATCGTCC GGGGGCTGGC TCTGCTAATA ATAAACTTCA AGCCACAACA AGGCCTTCTA CAAGAGCTTC TGCTCGCATT GAAGATGTCT CTTCCTATGT TGTTGCTTTG CTTCAAGGCA AAGGTAGACT TTCTTGACCC ATTTTACTCA CTTTATATCG CTAAGCCACG CTGTTCAGGT CATGGAGTTG AAATTGGTAT AGCTGCAATA AGTTTATTGA CCGGTCAGGT GAGCTGTGCA GGTATTTTGT GATACAAGTC TAATACCAGT AAAACAGACT GTCATCACTC AAGTGAGATG TTCCGCTACT GCAATCTTCA TATGAGTTTG CGAACTTGAT TGCTGACTGA ATTGATAGGT TGCTGACAAT GCCAGTCAGT CTACCACTAT GGAAGGTCTG AAATAATACC ATGCCGTAGC TGATCTTTTC AGGCTTTGAA AAAACTGTAC AGCAGCTTTA TTCCCATCCA CCTAACACAA TCATTGTTCC AGATACCATG TTAAAGAGTG GAGCTGCGGA AGCCAGTTAC GGTGACCAGA GGGATATCAA CGTAAAGGAG CACCTCATGA GTAGGCTAGA AGACGAATAT GAGATTGAAT GTATTGGTGT GGACCGCAGT CACTGGAATC GAGAAACTGG TATGTCACTG TAATGTCGTT CTTTTCACCA GAGGCTGGTC CTCCTTCTTA TCCACATTTT TCAGGCCGTG ATTTTGTCGA GGACTTGGCA GTTGATGACG AGCTCAAAGC ATCTATCCTT ATGGCTATTG AAAACAAGTG ATGTATCAAA AGGCACTCAT GCTCGCTGAA GACTGATGAC CCAATAGATT CTATGCCCTA TGTGCTCTTT CTGGATTGTT CAGATATTTA CAGGTATGGA GGAACATTGA ATTTCCGGAG CGCAGTCTGA GGATGAAATA TGTGGTCTCT GAAGGTACGG TGCTATCTTC TGCTCACAAC AAGGCGACTT CTTGTGCTCA TCATGTTTAC CACAATGCTA TGTAGGGACG ATGTTCATTG ATATTGAAAC TGCCAAAAAT CTGGAGCTCG TGCGCAATAA TCTCACAAAT AAGGCCACCC ATACGTTGTA CTGTGAGTAA AAGTTTACGT CTATGGGTAT CCAGTCCAAG ATTAATGTAC TCTGACGTTA CCGGTAGCTG TTTTGAACCA TTGTCATACT CCTATGGGTA TGCGACTTCT TAGGACATGC ATACTCCAAC CAAGCAATGG TACAGTTGTG ATGTTTCACT AAAATCTTCC GCTCATGTCT AGCATCTGAA AGTTCTTAGT GATATAGAAG GAAGACTTGA TGCTGTTCAG GGTGTGTGGG CCAGCAACTG TGTCAGAAAT ATGCAATATG CTAATGAGCA ACAGAACTTG TGACAGCGCA AGAGAAGCTC ACAGTTTTGA GGTCAAAGTT GTCTGTTATG TCTAAGGTGT GTACTAGCAT ACAATGGTCA TCATCATACC TCATTATTTG CGAGCTGATG AATTGATTAG ATGGATTTGG AGTCCATTGT TGCTCAGGCA AGTCGATGGG CTTGGTGTTT AATATCACTT GGCTTTCATG GTTAATCTGA GTGTTAGATT TCACATCAAA ATCTGTCACA AACAGATGTG GCTATGACTG ATCGCAGGAT ATCTCTGCTG CTCAACCTTG TGGACTACTT ACAGAGCGTC AAATCAATCA ATGCTGAGCT TGCAGGAGGA CACTGTAAAT TGCTTGGCAT GATTGCTAGG GTATGTGACA ATTATCTATT GCCTTGGGTT AAACTAAGAA GTCTTTATTC TTTGATAAGC GCCTTTCCGA TAAACAACTT GGCAAGTTAT CCAGCATTAT CAACAGTGAG TGAAATAAGA CATCTCTGCC TGCAATACTT AGAGTGAATC AGACTGTCTT TCCAGAGATT CGACAACGTT GAGGAAAAAT GGAAAACACC AAAATGCCAG AATCGCTCGG TTGTTTGCTG TGAAAGCAGG TTTTGCACCT CTGCTGGATG TTGCTAGACA GACCTACCAG GAGAATCTTC AGGATATCTA TGACTGTAAG TCTAAAACAA ACAAGCCCAT ACATACATGG TACCTCTCTC ACCCCTCTAT ATAGTGGAAT TAGAAGTCAA TGGTGAGTGT CGTGACTGTT ACGACCACTA AAATAGTCAT AAAGGAAAAT ATGGCTTGAG CTGCCAAGTT GAGAGTGTGG GAGGTACATT TCAATTTAGC ATACCGTCAG GACAGCCTGA AAACTTGCTA CCATCTGAAT TTATTGGTAC TGAAAAGGCT AAGCACAAGT GCATCACTAG TAGGAATGGA TACACATGAG TTGTAACTGA TTTGATCTTG CCAGAATTCG GTTCACAAGT CAAGAGCTTG TATGGTTACA TTCATGGATG GACCCATTTT TGACCATGCG CCTGTAGCTC AAACGGTGTG CGAAACTGTC CCAATCTCAT CAAGAGGTTC TGTTGATCAG TGGACAAACC ATCAATGATC TTATCGCGCA AGTGAAGGGA AACTTGGGTG GGTGCTGTGT TGCATCAACC TGCTGCTGAC AAACATATAA GGTGGACTAT ATCATTGTGC TGAAGCTGTA TATTGCTGCC TATAACATCT CAGGTCTCAA ATGTTTTTTT TATCTAATAT AATACATAGA TTGCAAGTCT TGACATGATC GCAAGCTTTG CCTTTAGTGC CTTCAGTAAG CTGACATGTT GACAATGAGT ATGATGCTGA TCCTCTCTCT AGACAGCAAC TACAGTGAGT GACTAGTGAT TGTAAATCAT CTACTAGATG AATGACGAGC TAAGTGGCAT CAGTTCGTCC AGATTTCAAG GATACTCTAG CGGTATATAA GACTCTGTAT CATCCCTGAC CAAAACTGAT CAGAAGGACA TTTGTAGATT CATGGTGGCC GCCATCCAAT TTTGGACAAT TTACTTGGTG CTGGTGATTG TATACCAAAC AACGTGTGAG TATTTAAGCA GTTGGTTTCT TTTTTTTACT GATCTCACAT AACAACTTAG TTATGCTGCA AGAGGTTCTG CCACTTTTCA AATCATTCAA GGTCCTAAGT TTGTCACTTT CTTTTTTTCT TAATCATTAC TTTCTGATGG TGCTTTAGTA TGTCGGGCAA AAGCACTTAT CTAAAGCAGG TAGGGCTTTT GACTGTACAG GCTATGATTG GGTGCTTGTG AGTTGGAGAC ACTAAAACCA TATCAAGCAG TTGAGATGAC TAGTCCTCTC TTCTCTCAGT GTACCTGCGG AATATGCATG CTTTACAATT CATGATGCTC TTTTGAGTCG TTTATCAAAT GATGGTAAGG AACAGGCGAA TGTACACGAT GACTTCATCA GCTCAAAGTT GTAGATTCAA TGGAGAAATG TCTATCAACC TTTGCCTCGG AAATGGCTGC ATCAGCCATG ATACTGGGTG GGTCACAGTT CATTAACAGA CTGCTCATAG TTAGGACTCG CAAGTCCTAG GTCACTTGTT CTCATAGATG AAGTGAGTGG TGTACTGGCC ATGGACTATG TCATTGACAA CATTCCCAAG TTGGGGAGGG GCACTTCAAG TCTGGAAGGC ATGGGATTAT CCTATGCCAT AGCTGAATCG CTCATTAGGA GGCAAGTAAG TCAGTCTATT TGACAGACTT TTTCGCTTGT TGAAAATGCA GCAGTCTTTC GTCTTTTTCG CAACTCATTT CCAAGACCTT GCAGTAATTC TTGGGAATCT GTCTGGAGTT GTAAAGTCAG TCAAGACTGA AATATTCATC TTTTAGATAC TGATGAACAT GTCACAAGAC TACACCTAAA GGTCCAGGTA ACAAGCAGTA TTCATTCTAC ATGGGGTCCA CAATTACTTA TCTGAGCTAG CATTGCAATA CAGAAACCCT TTCACCAACA GAATTTGGCA GTACTTTTGC CTACAAAGTT GCTGAGGGTG CAGCTCCAAT GGTGCACTAT GGTTGGTGCT TTGTTTGGAA CATATAGGGA ATTTTGTCAT CTAGCTAATA AACTGCAGGC CTCGAGATAG CTAAACTAGC CGCCCTTCCT TCAACGGTTT TGCAAAGAGC ATCAGAGATT GCAACTCAAC TGAGTGAATT GGAAGAACAG GGTAAGAATG TGATCTTTAT ATGAATTTGA TACCAATCTG CATATATAGG TAGGCGGTCC AACTTAGCCA ACACTAGCAT GCGACGCCGA AAGATACTGT GGGAGGTAAG GTGTAAACAA TTCTTCCATG GGCAGGATCG CTCAACGATA TCTAGTTGAG GGCCAAACTG AAGCAGGTGC AAAGTAATTC AAGGCTTGAT AATGAGACAT TGGGAGAATT TCTTCTGAAT CTTCAAGCTC AATGTTATCT GACTCTTCGT CAATCCCTAG AGATGGTTCA AACAACAACC AGTGGTAGTG AAACAGGGCA TTAACAATCA GCTCATC
|
Protein sequence | MPVPAQLRTI APEKIKAAVS PSAISAFTQT RHRPGAGSAN NKLQATTRPS TRASARIEDV SSYVVALLQG KGHGVEIGIA AISLLTGQTV ITQVADNASF EKTVQQLYSH PPNTIIVPDT MLKSGAAEAS YGDQRDINVK EHLMSRLEDE YEIECIGVDR SHWNRETGRD FVEDLAVDDE LKASILMAIE NKFYALCALS GLFRYLQVWR NIEFPERSLR MKYVVSEGTM FIDIETAKNL ELVRNNLTNK ATHTMSGKST YLKQVGLLTV QAMIGCFVPA EYACFTIHDA LLSRLSNDDS MEKCLSTFAS EMAASAMILG LASPRSLVLI DELGRGTSSL EGMGLSYAIA ESLIRRQSFV FFATHFQDLA VILGNLSGVV KLHLKVQHCN TETLSPTEFG STFAYKVAEG AAPMVHYGLE IAKLAALPST VLQRASEIAT QLSELEEQGR RSNLANTSMR RRKILWELRA KLKQVQSNSR LDNETLGEFL LNLQAQCYLT LRQSLEMVQT TTSGSETGH
|
| |