Gene CNK01700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK01700 
Symbol 
ID3254532 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp499164 
End bp503082 
Gene Length3919 bp 
Protein Length1057 aa 
Translation table 
GC content47% 
IMG OID638253660 
Productconserved hypothetical protein 
Protein accessionXP_567845 
Protein GI58260870 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTAACCACTG GCACGAACTC TTGACTCAAT GGTCCCAAAG ATACGCTGGG GGCTGTATCG 
CCATCTCCGA TGCTTGCCGG TCACTCAGCC TCGGGCAATT AGATCACATG CTTTGGTGAC
TACAGCTCGG TCATTTGACA AGGTGACAAC GGCGACCTCT GTCAAACCCA AACCTAAGCT
CATCAAGACA AAGGCTAGAC TATCAGATTT GCCAGCGACG AAGCTGGGAT CGAACGGCTT
GCCACAGAAG CCATTGGAAG CATGGGATGA AAGAGACGTG CCTCCCAACC GTTCGAAATC
CCCCAAAACG GTCAGGGCAG CCAAAGCGCG TAGCGATGGC GCCAGGAGCC CAGTAATTCC
CTGTTCAAAG GCAGAATTGG AACTTCTTCA GGATGCCATA CGACAGGGTG TTATCCCAGA
TACTCCTTTG GCTCATGAGG TATATCTCAA TTGGAAGAGG TTTCCAGATT GCATTTTGCT
CACGAGAGTG GGAAAGTTCT ATGAGGTAAG CTCGACATCT GATATGGCCT TGCTCCGGAT
CGATTGACTC CCAACGGTAT GTAATAGTCA TATTTTGAAC CCGCTCGGTA TTTGGCTTCC
ATCCTTTCGC TGTCCCTTGC AGAAAAGAAA TACGGCGCGA ACGAGGCTAA GAGGTCATAC
CCATTCGCTG GTTTTCCGGT ACATGCACTG GACAAATATC TCAAGATGCT TGTTCAAGAC
TTGGGACACA CCGTTGTGCT CGTGGAGGAG TATGATACAG AAGGAGCCGT TGCCCATACT
GGCAAAAAGC TCACCGCAGG CTCCGGACCA AAGGAACGTA GGGTGTACAG AGTCGTTACG
CCGGGGACTA TGGTCGATGA GTCGTGGGTG GATGCAGATC AGAGTCGATA CTTGCTCGCC
ATCGCTGTAG GCAATGAGGG CCAGAATGGT CAGGAACTAT CTCTCGCTTA CACAGATGCT
TCCACTGGAG AATTTTTCAC TAAGGATACG ACTGTCTCAC AGATGGAAGA TGAGCTCGCT
CGAATTACTC CCCGTGAAGT TGTTCTAGAT AATTCGCTCT ACGAACTGTG GCGAGAACAT
TATAACTATT CGGAAATGAA GCGGAAAGAT GCCTCTCAGG TTGAAGAGCT GCTCGCGCTA
CTTCGAGTAC TAGGCGTCAG AGTGTCCTTT GCCGACCCCT GTCGTCCGCC TCCACTCTAC
ACATCTGCCT CATCACCAAC TTTACACCCT ACCACACCGG AAGAGAATGC TGCTGCGCTC
CTCCAACACC ATCTCCAGTA TGCTTTGCGA GAGTCGATGC CAGCACTTCG TCGACCTCAC
AAGCAATCCA ATTCGGCCTT CATGCAAATC GATGCTGCAA CCCTCCAAGC TCTCGAGATA
CGCCATGCTT TTAGGCCAGG GGGATTAATT GCTACGGGCG AGACACAAAC TAATTCATCT
CCCTTGTCTG CCAAGGGTAC GCTTCTCTCA GTGGTATCCA AGACTATAAC ATCTTCCGGC
CATCGACTAC TTATACGCAC TCTCACAGCT CCTTCGACAT CTCCCCATAT AATCAACTCC
CGCTTGGCGC TGGTACAAGC CTTTATGGAT AGAGAGGATC TTAAAACTGA ACTTCGGCAC
GAGCTGAAAG AGCTGGGGGA TATCATGCGG ATCATTCAGC GGTTTAGAGG CCAAAGAGGC
ACTGGACGTG ATATATGGGA CGTTGGAAGG TGGATAAGGG GTGCTCAGAG AATATTGGAG
ACTATCAAGG AAGAGATCAA AATCGAAGTT GGCCGAAATA ACGAGAAAGC GATACGGAAA
TCGGAAGGCA TCACAAGGCT GCAGGAATTC GTGGACTCGT TTCGTGATCT TGACAGAATC
GCCTCCAAAA TCGAATCCTC TGTGGAAGAA TCTGCCATCA TGTTCAGATC CGGCGATGAT
AAGAGTATCA TTGATGAGCA AGAAGCTGGT GATGCGCTGC TTACAAGTCA GGCATCATCT
AAAGAAAGTG AAGCAGACGA AAAGCAGAGG ATCAAGCGGG AGAGGGATGA GAGAGAGATG
TCAGAGTGGT GGATTCGTCC TCAGTAAGGC CAAAAAAGTT GTTTCAAATG CAGACAAATA
TCGTAATAAC GATTGCTGTT AGGTTCTCGC CTGCGCTCCA ACTTCGACAT GACGAACTGA
GTGCTTTGAA AGCCGAAGCC CAAAAGTTAC AAGCAAGCCT AATCAAGAAG TATGGTGAGT
ATTTCCAAGA ATTGCTCCAG GGTTATCTAG ATATATTACA TGTTAACGTC TTTCAGATAC
GCCTACTCTG ACTATTGAGA AGAACCACAG ATTCAGCTAT CACATTCAAA TGTCAGCAAA
AGATGCTGAA AAGGTGGCGA AAGCGAGATC TCTGGAGCGT ATAGGAAGCA TGACTGGTAA
AACAGCATAC TTTGCCTACG CGGTGAGTAA TACCTTAATC CGAATGATGT TGGACTGATA
TTCCTCTTAG CCGCTTGCGG AACTGGGTAC GAGAATTGAG ATCATGATGG AATATCTAGG
CGCTGCTCAA AGGCGAGCTG CTCGGGAACT TCAAAATATG GTATCTCTAT CCCACTCTCA
TTACTTTTGT TTTGAACTAA CAGAGCCCAA TAGGTGGTAG AGCAATCGGA CGCGATCCAG
CAAAATTCCG AATTAGTTGA TGAGCTGGAT TTGAGCCTGA GCTTTGCTCA GAATGCAGTT
GAGATGAACT GGGTCAGGCC AATACTGGAC AATTCGTAAG TGAATTTCAT AGGCCATCGA
CAACAATACT CAAATCGGAT TTTAGTACGG AGCTACAGAT CATTAATGGT AGACATCCTT
CCGTTGAATC TTCTTTGCTG TCTGCTTCTC GCAATTTCAC CCCAAACAGT ACTCACATGG
CTTCTGATAC CCATTTGCAT GTCATCACCG GCCCCAACCA GGGAGGAAAG TCGACCCTTC
TTCGACAAAC TGCAGTCATA GCAATACTTG CCCAGAGCGG AAGCTTCGTT CCTGCTGAAT
ACGTGAAGAT GGGAATCGTG GATCGGGTTT TCAGCAGAGT GGGAGCTAGA GATGATCTAT
GGAGGGATCG GAGCACGTTT ATGCTTGAAA TGGTTGAGTA AGTGTTTATT TAAATTTGAG
CCTTGGCGCT GGAGCTTGTT GATCATGTGT CGAAATAGAA CTGCAGGGAT CTTGCGCCAT
GCCACTGAGA GATCCCTCGT TATCATGGAT GAGTACGCGA TTCTTAGGAT GGTATTCGTA
CTTCTACTGA AAATGATGGT GCAGGATTGG ACGCGGTACT ACGTTACAGG CAGGTGTCTC
AATAGCGTAT GCCACACTTG ACTATATCCT CGAGAACATC AAGTGCCGGA CATTGTTCGC
CACTCATTAT CACGAACTGG GACAAATGCT AGGATACGAT CCAAAAAGGG CTGAAGGAGA
GGTGATAAAG GGAAGAAGTG GGATTGCTTT TTGGTGCACG GATGTAAATG AGGCGGTAAG
TAGTATTTCA ATCTTCAAAA CGGGCTTGCC TCTGATGTTG AAACAGGATG GCGCTTTTTC
TTATTCTTAC AAGCTACGAC CGGGTATAAA TTACGATTCT CATGCTATTG TAAGGCTAGC
GGTCAGGCTG GAAGAAACTT CTGCTGATAA ATCTTTTACA GAAAGCTGCC AGTATCGCCG
GCATGCCAGA ATCTTTTCTA CGTGTAGCCG AGTCGACGCT CGTAACCCTC CAATCAAAAT
CCAATCTTAT TACATTGCCA TCATCTCATT AGGATATTTA TCTACCTTCC CATGTTTTGC
CACGAATTGC AACGACTTCA AATTGTATAT ATAGATAAAT CATCAAATTC ATAGCATAGC
ATAGCATACT ATCATAATCA TTACGAGACC GTCATCGGCT ACTTAATCAT ACATTATGCA
TTACTTGCAG TCTGTTATT
 
Protein sequence
MVPKIRWGLY RHLRCLPVTQ PRAIRSHALV TTARSFDKVT TATSVKPKPK LIKTKARLSD 
LPATKLGSNG LPQKPLEAWD ERDVPPNRSK SPKTVRAAKA RSDGARSPVI PCSKAELELL
QDAIRQGVIP DTPLAHEVYL NWKRFPDCIL LTRVGKFYES YFEPARYLAS ILSLSLAEKK
YGANEAKRSY PFAGFPVHAL DKYLKMLVQD LGHTVVLVEE YDTEGAVAHT GKKLTAGSGP
KERRVYRVVT PGTMVDESWV DADQSRYLLA IAVGNEGQNG QELSLAYTDA STGEFFTKDT
TVSQMEDELA RITPREVVLD NSLYELWREH YNYSEMKRKD ASQVEELLAL LRVLGVRVSF
ADPCRPPPLY TSASSPTLHP TTPEENAAAL LQHHLQYALR ESMPALRRPH KQSNSAFMQI
DAATLQALEI RHAFRPGGLI ATGETQTNSS PLSAKGTLLS VVSKTITSSG HRLLIRTLTA
PSTSPHIINS RLALVQAFMD REDLKTELRH ELKELGDIMR IIQRFRGQRG TGRDIWDVGR
WIRGAQRILE TIKEEIKIEV GRNNEKAIRK SEGITRLQEF VDSFRDLDRI ASKIESSVEE
SAIMFRSGDD KSIIDEQEAG DALLTSQASS KESEADEKQR IKRERDEREM SEWWIRPQFS
PALQLRHDEL SALKAEAQKL QASLIKKYDT PTLTIEKNHR FSYHIQMSAK DAEKVAKARS
LERIGSMTGK TAYFAYAPLA ELGTRIEIMM EYLGAAQRRA ARELQNMVVE QSDAIQQNSE
LVDELDLSLS FAQNAVEMNW VRPILDNSTE LQIINGRHPS VESSLLSASR NFTPNSTHMA
SDTHLHVITG PNQGGKSTLL RQTAVIAILA QSGSFVPAEY VKMGIVDRVF SRVGARDDLW
RDRSTFMLEM VETAGILRHA TERSLVIMDE IGRGTTLQAG VSIAYATLDY ILENIKCRTL
FATHYHELGQ MLGYDPKRAE GEVIKGRSGI AFWCTDVNEA DGAFSYSYKL RPGINYDSHA
IKAASIAGMP ESFLRVAEST LVTLQSKSNL ITLPSSH