Gene CNE04630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE04630 
Symbol 
ID3257720 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp1293709 
End bp1296696 
Gene Length2988 bp 
Protein Length765 aa 
Translation table 
GC content48% 
IMG OID638257047 
ProductDNA binding protein, putative 
Protein accessionXP_571158 
Protein GI58268004 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACGCAACAT GCCATGTCAG AGGATTCTAT GGACATCGAT CTACGGCCAG AAGTAGAAGA 
AATTGAACCC GAGGGCCCGA AACCCATACA TAGGCTTACA AAAGATGTCA TCAACCAGAT
CGCTGCTGCC GAGGTGAGGC CAAGACGCGT CGGTAATGTT GCTCGAGCTG ACGCGTACAA
TACTACTGCA GATTATTCAT CGACCGTCAA ATGCTATCAA GGAGCTCCTT GAAAACTCTC
TAGATGCAGG CTCTACATCT ATCAAGATTT CAGTCAAGGA TGGAGGTCTG AAGCTCTTGC
AAATCACCGA TAACGGTCAT GGCATCAACA AAGATGACTT GCCCCTTCTT TGCGAGCGCT
ACGCGACTTC AAAGCTGCAA AAGTTTGAGG ATCTCCAGTC GTTAGGGACA TATGGCTTTA
GAGGCGAAGC TCTTGCAAGT ATAAGTTACT GCAGTCACGT CGAAGTTGTT ACGAAGACCA
AAAACGAGGG GTGTGGCTGG AAGTGAGTGG TAAATCTGGA CAGTACGCGT AGTAACCAAC
TGATGGTCCT GCCAGAGCTC ACTATCAAGA TGGCAGCTTG ATCCCAGCAA AGCCTGGAGG
TACCGCAGAC CCGAAGCCAG CGGCTGCCAA TGACGGAACG GTCATCACGG TAAGCTTCCC
ACCCTAAGGG CGCAAAAAGG TGAAGCTCAT ACACCGTGGA CGCTAGGCTG CAGACCTCTT
TTACAACATG CCACTTCGTA AGCGGGCATT CAAGTCAACC TCAGACGAAT ATAACCGTAT
TATCGACGTG GTCACCAAAT ATGCCATTCA TAATCCTCAC GTTGCGTGGG TATGCAAAAA
GGCCGGCACT GCTTTACCTG ATGTTGCCAC CCAGGTCGGT TCGAATACCA AGGCGAATAT
CGCGGCACTC TACACATCCG CACTGGCCAA TGAGTTGCTA GAAATACCAG AGTCTGAACT
GCAGCCTGCT AGGCTAGGTG CAAAGCTAAA AGGCTGGGTG AGTAATGCGA ATAGTAGCTG
GTCGAAGAAG GGGGGGTGGT TACTTTTCAT TAATAGTGAG TTATTTATCA TGTACGCCTT
GAAAGATGCT CATTCACTTC ACAATTCAGA TAGGCTAGTC GATTCGAACA AGTTGAAGAA
AGCTGTAGAA GGCCACTACA CCTCGTACCT CCCAAAAGGT GCTTCGCCCT GGGCATATCT
CAGGTACGTG TTTTTTGACG AATGTCATGG ACAGTACTAA TCGTATGACA TTAGTCTGCA
AATTGACCCC GCAAAAATTG ACGTGAATGT ACATCCCACA AAGTCAGAGG TCCGTTTTCT
CAATGAAGAT GAAATTGTCG ACGCTGTCGT GCAAGCCGTT CAAACCGCTC TAGAAGGTGC
CAACCTCTCG CGTTCTTTCA CCGTTCAAGT AATTCTTTTC CCCTCACTTC ATTCTCAACA
ATATGGCTTA CATATCTGCC ATAGACTCTG CTTCCTGGTG CCCCTACACC TTTAGGAAAA
CGTGAAAGTT CAAATTCCAC TATAGCATCT GCATCATTCT CTACCCGCAA AGCAGCTCCA
AACTATAAAG TCCGCATGGA CCCGTCCAAC CGTACCCTCG ACTCCATGTT CACTGTCATT
GACCCCTCCC AACTCTCCGG TTTTGTCGAA GACGGAGAAT TGCAGGAACA AGAACGACCT
TCCAAAAGGA GGAATGTTGA TCCAGAATTT CAAGGTGATG AGTCCATAGT ACTGGACGAT
GATAACGACG ACGAGGGACA AGCAGAAGAA GGGGAAAGAG AACAAGTTTT CGCGGATGAA
GGGGAAAGTG CGAAAGGGAA AGCGAAGGAG ATTGAGGAGA GCGTATGTCA TTTTACAAGT
ATCCAATCTT TGAGAAGGGC AGTCAAGAGG GATGGAAATG CTGGTGGGTT CCTGTTTCTT
ATGTCCTTGT TCTTGAGGAA ACTGATCATT TTGACAGAGC TTCACGAGAT CTTTCAACGG
CATGCTTTCG TCGGAGTTGT CGATCGATAT CAATGCCTTT CGCTTATCCA GCATAGCACG
AAGCTATTCC TTGTCAACCA TGGCTCATTG GGGTGAGCTC CCACTCCGAG GAGGAAACCT
AATTGTCCAA AATGCTAACC CTGATCCACA TTGTAAAGTG ATGAACATTT TTATCAACTT
GGTCTTCGGC AGTTCGGCGC ATTTAACCGT ATACGCCTTG ATCCTGCCCC ACAGTTGAAG
GAGCTTTTGA CGTTAGCGGC AGAGGACGAG CCTGGGCTGC TTGAAGCAGG GTTGGAGGTA
GAAAGTGTTG TGGATGTACG TCTCTTGGCC CATGTTTACG CTCCTTTTCA TGCTGCTCTT
GGTTCCTAAT CCTTTTCTTC TCAGTATATC GCAAGCTTGT TAAGAGACCG TCAGGAAATG
CTGGACGAAT ATTTTTCCCT TCTCATTACT GAAGACGGAA AAGTGGAGAC CCTCCCTATG
TTGTTGAAAG GATATACTCC GAATTTGGAT CGGTTGCCTC ACTTCTTACT ATGCCTTGGA
ACACAAGTGA GTTTGCCCCT CGTCTTGTGT GACGAATGCA CGGTGCTTTA TGTACTGTAA
TTTAAAGTGA ACTAATGGAC GGGACATGTA GGTGGACTGG GATAATGAAA AGGAATGTTT
CCAAACTTTC CTTCGCGAAC TCGCATTCTT CTATTCCCCT CGGCCTTTTG AAGACCAACC
CCCTCCACCG CACACTAAAG ATGAAAACAT GACCGGAGAC GAGTTAGAGG GTGTAGAGCC
CACCCCGGAA GAGATTCAGC ATCAGCTCTG GCAGCTCGAG CACGTCTTGT TCCCCAGCTT
TAGACGGCAC ACAGTATGGC CAAAGAGCTG TATGACGCAT GTCAATCAAC TGGCCGATTT
GCCGGACTTG TTTAGGATCT TTGAAAGATG TTAAAGGGGT TTGTCGCGCC CGGTACTTTA
TAAAAGGCTA TGTGGGAGCT TCTTGTGCAA TTTGACAAGT ATAATTGC
 
Protein sequence
MSEDSMDIDL RPEVEEIEPE GPKPIHRLTK DVINQIAAAE IIHRPSNAIK ELLENSLDAG 
STSIKISVKD GGLKLLQITD NGHGINKDDL PLLCERYATS KLQKFEDLQS LGTYGFRGEA
LASISYCSHV EVVTKTKNEG CGWKAHYQDG SLIPAKPGGT ADPKPAAAND GTVITAADLF
YNMPLRKRAF KSTSDEYNRI IDVVTKYAIH NPHVAWVCKK AGTALPDVAT QVGSNTKANI
AALYTSALAN ELLEIPESEL QPARLGAKLK GWVSNANSSW SKKGGWLLFI NNRLVDSNKL
KKAVEGHYTS YLPKGASPWA YLSLQIDPAK IDVNVHPTKS EVRFLNEDEI VDAVVQAVQT
ALEGANLSRS FTVQTLLPGA PTPLGKRESS NSTIASASFS TRKAAPNYKV RMDPSNRTLD
SMFTVIDPSQ LSGFVEDGEL QEQERPSKRR NVDPEFQGDE SIVLDDDNDD EGQAEEGERE
QVFADEGESA KGKAKEIEES VCHFTSIQSL RRAVKRDGNA ELHEIFQRHA FVGVVDRYQC
LSLIQHSTKL FLVNHGSLGD EHFYQLGLRQ FGAFNRIRLD PAPQLKELLT LAAEDEPGLL
EAGLEVESVV DYIASLLRDR QEMLDEYFSL LITEDGKVET LPMLLKGYTP NLDRLPHFLL
CLGTQVDWDN EKECFQTFLR ELAFFYSPRP FEDQPPPPHT KDENMTGDEL EGVEPTPEEI
QHQLWQLEHV LFPSFRRHTV WPKSCMTHVN QLADLPDLFR IFERC