Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK01700 |
Symbol | |
ID | 3254532 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | - |
Start bp | 499164 |
End bp | 503082 |
Gene Length | 3919 bp |
Protein Length | 1057 aa |
Translation table | |
GC content | 47% |
IMG OID | 638253660 |
Product | conserved hypothetical protein |
Protein accession | XP_567845 |
Protein GI | 58260870 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTAACCACTG GCACGAACTC TTGACTCAAT GGTCCCAAAG ATACGCTGGG GGCTGTATCG CCATCTCCGA TGCTTGCCGG TCACTCAGCC TCGGGCAATT AGATCACATG CTTTGGTGAC TACAGCTCGG TCATTTGACA AGGTGACAAC GGCGACCTCT GTCAAACCCA AACCTAAGCT CATCAAGACA AAGGCTAGAC TATCAGATTT GCCAGCGACG AAGCTGGGAT CGAACGGCTT GCCACAGAAG CCATTGGAAG CATGGGATGA AAGAGACGTG CCTCCCAACC GTTCGAAATC CCCCAAAACG GTCAGGGCAG CCAAAGCGCG TAGCGATGGC GCCAGGAGCC CAGTAATTCC CTGTTCAAAG GCAGAATTGG AACTTCTTCA GGATGCCATA CGACAGGGTG TTATCCCAGA TACTCCTTTG GCTCATGAGG TATATCTCAA TTGGAAGAGG TTTCCAGATT GCATTTTGCT CACGAGAGTG GGAAAGTTCT ATGAGGTAAG CTCGACATCT GATATGGCCT TGCTCCGGAT CGATTGACTC CCAACGGTAT GTAATAGTCA TATTTTGAAC CCGCTCGGTA TTTGGCTTCC ATCCTTTCGC TGTCCCTTGC AGAAAAGAAA TACGGCGCGA ACGAGGCTAA GAGGTCATAC CCATTCGCTG GTTTTCCGGT ACATGCACTG GACAAATATC TCAAGATGCT TGTTCAAGAC TTGGGACACA CCGTTGTGCT CGTGGAGGAG TATGATACAG AAGGAGCCGT TGCCCATACT GGCAAAAAGC TCACCGCAGG CTCCGGACCA AAGGAACGTA GGGTGTACAG AGTCGTTACG CCGGGGACTA TGGTCGATGA GTCGTGGGTG GATGCAGATC AGAGTCGATA CTTGCTCGCC ATCGCTGTAG GCAATGAGGG CCAGAATGGT CAGGAACTAT CTCTCGCTTA CACAGATGCT TCCACTGGAG AATTTTTCAC TAAGGATACG ACTGTCTCAC AGATGGAAGA TGAGCTCGCT CGAATTACTC CCCGTGAAGT TGTTCTAGAT AATTCGCTCT ACGAACTGTG GCGAGAACAT TATAACTATT CGGAAATGAA GCGGAAAGAT GCCTCTCAGG TTGAAGAGCT GCTCGCGCTA CTTCGAGTAC TAGGCGTCAG AGTGTCCTTT GCCGACCCCT GTCGTCCGCC TCCACTCTAC ACATCTGCCT CATCACCAAC TTTACACCCT ACCACACCGG AAGAGAATGC TGCTGCGCTC CTCCAACACC ATCTCCAGTA TGCTTTGCGA GAGTCGATGC CAGCACTTCG TCGACCTCAC AAGCAATCCA ATTCGGCCTT CATGCAAATC GATGCTGCAA CCCTCCAAGC TCTCGAGATA CGCCATGCTT TTAGGCCAGG GGGATTAATT GCTACGGGCG AGACACAAAC TAATTCATCT CCCTTGTCTG CCAAGGGTAC GCTTCTCTCA GTGGTATCCA AGACTATAAC ATCTTCCGGC CATCGACTAC TTATACGCAC TCTCACAGCT CCTTCGACAT CTCCCCATAT AATCAACTCC CGCTTGGCGC TGGTACAAGC CTTTATGGAT AGAGAGGATC TTAAAACTGA ACTTCGGCAC GAGCTGAAAG AGCTGGGGGA TATCATGCGG ATCATTCAGC GGTTTAGAGG CCAAAGAGGC ACTGGACGTG ATATATGGGA CGTTGGAAGG TGGATAAGGG GTGCTCAGAG AATATTGGAG ACTATCAAGG AAGAGATCAA AATCGAAGTT GGCCGAAATA ACGAGAAAGC GATACGGAAA TCGGAAGGCA TCACAAGGCT GCAGGAATTC GTGGACTCGT TTCGTGATCT TGACAGAATC GCCTCCAAAA TCGAATCCTC TGTGGAAGAA TCTGCCATCA TGTTCAGATC CGGCGATGAT AAGAGTATCA TTGATGAGCA AGAAGCTGGT GATGCGCTGC TTACAAGTCA GGCATCATCT AAAGAAAGTG AAGCAGACGA AAAGCAGAGG ATCAAGCGGG AGAGGGATGA GAGAGAGATG TCAGAGTGGT GGATTCGTCC TCAGTAAGGC CAAAAAAGTT GTTTCAAATG CAGACAAATA TCGTAATAAC GATTGCTGTT AGGTTCTCGC CTGCGCTCCA ACTTCGACAT GACGAACTGA GTGCTTTGAA AGCCGAAGCC CAAAAGTTAC AAGCAAGCCT AATCAAGAAG TATGGTGAGT ATTTCCAAGA ATTGCTCCAG GGTTATCTAG ATATATTACA TGTTAACGTC TTTCAGATAC GCCTACTCTG ACTATTGAGA AGAACCACAG ATTCAGCTAT CACATTCAAA TGTCAGCAAA AGATGCTGAA AAGGTGGCGA AAGCGAGATC TCTGGAGCGT ATAGGAAGCA TGACTGGTAA AACAGCATAC TTTGCCTACG CGGTGAGTAA TACCTTAATC CGAATGATGT TGGACTGATA TTCCTCTTAG CCGCTTGCGG AACTGGGTAC GAGAATTGAG ATCATGATGG AATATCTAGG CGCTGCTCAA AGGCGAGCTG CTCGGGAACT TCAAAATATG GTATCTCTAT CCCACTCTCA TTACTTTTGT TTTGAACTAA CAGAGCCCAA TAGGTGGTAG AGCAATCGGA CGCGATCCAG CAAAATTCCG AATTAGTTGA TGAGCTGGAT TTGAGCCTGA GCTTTGCTCA GAATGCAGTT GAGATGAACT GGGTCAGGCC AATACTGGAC AATTCGTAAG TGAATTTCAT AGGCCATCGA CAACAATACT CAAATCGGAT TTTAGTACGG AGCTACAGAT CATTAATGGT AGACATCCTT CCGTTGAATC TTCTTTGCTG TCTGCTTCTC GCAATTTCAC CCCAAACAGT ACTCACATGG CTTCTGATAC CCATTTGCAT GTCATCACCG GCCCCAACCA GGGAGGAAAG TCGACCCTTC TTCGACAAAC TGCAGTCATA GCAATACTTG CCCAGAGCGG AAGCTTCGTT CCTGCTGAAT ACGTGAAGAT GGGAATCGTG GATCGGGTTT TCAGCAGAGT GGGAGCTAGA GATGATCTAT GGAGGGATCG GAGCACGTTT ATGCTTGAAA TGGTTGAGTA AGTGTTTATT TAAATTTGAG CCTTGGCGCT GGAGCTTGTT GATCATGTGT CGAAATAGAA CTGCAGGGAT CTTGCGCCAT GCCACTGAGA GATCCCTCGT TATCATGGAT GAGTACGCGA TTCTTAGGAT GGTATTCGTA CTTCTACTGA AAATGATGGT GCAGGATTGG ACGCGGTACT ACGTTACAGG CAGGTGTCTC AATAGCGTAT GCCACACTTG ACTATATCCT CGAGAACATC AAGTGCCGGA CATTGTTCGC CACTCATTAT CACGAACTGG GACAAATGCT AGGATACGAT CCAAAAAGGG CTGAAGGAGA GGTGATAAAG GGAAGAAGTG GGATTGCTTT TTGGTGCACG GATGTAAATG AGGCGGTAAG TAGTATTTCA ATCTTCAAAA CGGGCTTGCC TCTGATGTTG AAACAGGATG GCGCTTTTTC TTATTCTTAC AAGCTACGAC CGGGTATAAA TTACGATTCT CATGCTATTG TAAGGCTAGC GGTCAGGCTG GAAGAAACTT CTGCTGATAA ATCTTTTACA GAAAGCTGCC AGTATCGCCG GCATGCCAGA ATCTTTTCTA CGTGTAGCCG AGTCGACGCT CGTAACCCTC CAATCAAAAT CCAATCTTAT TACATTGCCA TCATCTCATT AGGATATTTA TCTACCTTCC CATGTTTTGC CACGAATTGC AACGACTTCA AATTGTATAT ATAGATAAAT CATCAAATTC ATAGCATAGC ATAGCATACT ATCATAATCA TTACGAGACC GTCATCGGCT ACTTAATCAT ACATTATGCA TTACTTGCAG TCTGTTATT
|
Protein sequence | MVPKIRWGLY RHLRCLPVTQ PRAIRSHALV TTARSFDKVT TATSVKPKPK LIKTKARLSD LPATKLGSNG LPQKPLEAWD ERDVPPNRSK SPKTVRAAKA RSDGARSPVI PCSKAELELL QDAIRQGVIP DTPLAHEVYL NWKRFPDCIL LTRVGKFYES YFEPARYLAS ILSLSLAEKK YGANEAKRSY PFAGFPVHAL DKYLKMLVQD LGHTVVLVEE YDTEGAVAHT GKKLTAGSGP KERRVYRVVT PGTMVDESWV DADQSRYLLA IAVGNEGQNG QELSLAYTDA STGEFFTKDT TVSQMEDELA RITPREVVLD NSLYELWREH YNYSEMKRKD ASQVEELLAL LRVLGVRVSF ADPCRPPPLY TSASSPTLHP TTPEENAAAL LQHHLQYALR ESMPALRRPH KQSNSAFMQI DAATLQALEI RHAFRPGGLI ATGETQTNSS PLSAKGTLLS VVSKTITSSG HRLLIRTLTA PSTSPHIINS RLALVQAFMD REDLKTELRH ELKELGDIMR IIQRFRGQRG TGRDIWDVGR WIRGAQRILE TIKEEIKIEV GRNNEKAIRK SEGITRLQEF VDSFRDLDRI ASKIESSVEE SAIMFRSGDD KSIIDEQEAG DALLTSQASS KESEADEKQR IKRERDEREM SEWWIRPQFS PALQLRHDEL SALKAEAQKL QASLIKKYDT PTLTIEKNHR FSYHIQMSAK DAEKVAKARS LERIGSMTGK TAYFAYAPLA ELGTRIEIMM EYLGAAQRRA ARELQNMVVE QSDAIQQNSE LVDELDLSLS FAQNAVEMNW VRPILDNSTE LQIINGRHPS VESSLLSASR NFTPNSTHMA SDTHLHVITG PNQGGKSTLL RQTAVIAILA QSGSFVPAEY VKMGIVDRVF SRVGARDDLW RDRSTFMLEM VETAGILRHA TERSLVIMDE IGRGTTLQAG VSIAYATLDY ILENIKCRTL FATHYHELGQ MLGYDPKRAE GEVIKGRSGI AFWCTDVNEA DGAFSYSYKL RPGINYDSHA IKAASIAGMP ESFLRVAEST LVTLQSKSNL ITLPSSH
|
| |