Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE04630 |
Symbol | |
ID | 3257720 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | - |
Start bp | 1293709 |
End bp | 1296696 |
Gene Length | 2988 bp |
Protein Length | 765 aa |
Translation table | |
GC content | 48% |
IMG OID | 638257047 |
Product | DNA binding protein, putative |
Protein accession | XP_571158 |
Protein GI | 58268004 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGCAACAT GCCATGTCAG AGGATTCTAT GGACATCGAT CTACGGCCAG AAGTAGAAGA AATTGAACCC GAGGGCCCGA AACCCATACA TAGGCTTACA AAAGATGTCA TCAACCAGAT CGCTGCTGCC GAGGTGAGGC CAAGACGCGT CGGTAATGTT GCTCGAGCTG ACGCGTACAA TACTACTGCA GATTATTCAT CGACCGTCAA ATGCTATCAA GGAGCTCCTT GAAAACTCTC TAGATGCAGG CTCTACATCT ATCAAGATTT CAGTCAAGGA TGGAGGTCTG AAGCTCTTGC AAATCACCGA TAACGGTCAT GGCATCAACA AAGATGACTT GCCCCTTCTT TGCGAGCGCT ACGCGACTTC AAAGCTGCAA AAGTTTGAGG ATCTCCAGTC GTTAGGGACA TATGGCTTTA GAGGCGAAGC TCTTGCAAGT ATAAGTTACT GCAGTCACGT CGAAGTTGTT ACGAAGACCA AAAACGAGGG GTGTGGCTGG AAGTGAGTGG TAAATCTGGA CAGTACGCGT AGTAACCAAC TGATGGTCCT GCCAGAGCTC ACTATCAAGA TGGCAGCTTG ATCCCAGCAA AGCCTGGAGG TACCGCAGAC CCGAAGCCAG CGGCTGCCAA TGACGGAACG GTCATCACGG TAAGCTTCCC ACCCTAAGGG CGCAAAAAGG TGAAGCTCAT ACACCGTGGA CGCTAGGCTG CAGACCTCTT TTACAACATG CCACTTCGTA AGCGGGCATT CAAGTCAACC TCAGACGAAT ATAACCGTAT TATCGACGTG GTCACCAAAT ATGCCATTCA TAATCCTCAC GTTGCGTGGG TATGCAAAAA GGCCGGCACT GCTTTACCTG ATGTTGCCAC CCAGGTCGGT TCGAATACCA AGGCGAATAT CGCGGCACTC TACACATCCG CACTGGCCAA TGAGTTGCTA GAAATACCAG AGTCTGAACT GCAGCCTGCT AGGCTAGGTG CAAAGCTAAA AGGCTGGGTG AGTAATGCGA ATAGTAGCTG GTCGAAGAAG GGGGGGTGGT TACTTTTCAT TAATAGTGAG TTATTTATCA TGTACGCCTT GAAAGATGCT CATTCACTTC ACAATTCAGA TAGGCTAGTC GATTCGAACA AGTTGAAGAA AGCTGTAGAA GGCCACTACA CCTCGTACCT CCCAAAAGGT GCTTCGCCCT GGGCATATCT CAGGTACGTG TTTTTTGACG AATGTCATGG ACAGTACTAA TCGTATGACA TTAGTCTGCA AATTGACCCC GCAAAAATTG ACGTGAATGT ACATCCCACA AAGTCAGAGG TCCGTTTTCT CAATGAAGAT GAAATTGTCG ACGCTGTCGT GCAAGCCGTT CAAACCGCTC TAGAAGGTGC CAACCTCTCG CGTTCTTTCA CCGTTCAAGT AATTCTTTTC CCCTCACTTC ATTCTCAACA ATATGGCTTA CATATCTGCC ATAGACTCTG CTTCCTGGTG CCCCTACACC TTTAGGAAAA CGTGAAAGTT CAAATTCCAC TATAGCATCT GCATCATTCT CTACCCGCAA AGCAGCTCCA AACTATAAAG TCCGCATGGA CCCGTCCAAC CGTACCCTCG ACTCCATGTT CACTGTCATT GACCCCTCCC AACTCTCCGG TTTTGTCGAA GACGGAGAAT TGCAGGAACA AGAACGACCT TCCAAAAGGA GGAATGTTGA TCCAGAATTT CAAGGTGATG AGTCCATAGT ACTGGACGAT GATAACGACG ACGAGGGACA AGCAGAAGAA GGGGAAAGAG AACAAGTTTT CGCGGATGAA GGGGAAAGTG CGAAAGGGAA AGCGAAGGAG ATTGAGGAGA GCGTATGTCA TTTTACAAGT ATCCAATCTT TGAGAAGGGC AGTCAAGAGG GATGGAAATG CTGGTGGGTT CCTGTTTCTT ATGTCCTTGT TCTTGAGGAA ACTGATCATT TTGACAGAGC TTCACGAGAT CTTTCAACGG CATGCTTTCG TCGGAGTTGT CGATCGATAT CAATGCCTTT CGCTTATCCA GCATAGCACG AAGCTATTCC TTGTCAACCA TGGCTCATTG GGGTGAGCTC CCACTCCGAG GAGGAAACCT AATTGTCCAA AATGCTAACC CTGATCCACA TTGTAAAGTG ATGAACATTT TTATCAACTT GGTCTTCGGC AGTTCGGCGC ATTTAACCGT ATACGCCTTG ATCCTGCCCC ACAGTTGAAG GAGCTTTTGA CGTTAGCGGC AGAGGACGAG CCTGGGCTGC TTGAAGCAGG GTTGGAGGTA GAAAGTGTTG TGGATGTACG TCTCTTGGCC CATGTTTACG CTCCTTTTCA TGCTGCTCTT GGTTCCTAAT CCTTTTCTTC TCAGTATATC GCAAGCTTGT TAAGAGACCG TCAGGAAATG CTGGACGAAT ATTTTTCCCT TCTCATTACT GAAGACGGAA AAGTGGAGAC CCTCCCTATG TTGTTGAAAG GATATACTCC GAATTTGGAT CGGTTGCCTC ACTTCTTACT ATGCCTTGGA ACACAAGTGA GTTTGCCCCT CGTCTTGTGT GACGAATGCA CGGTGCTTTA TGTACTGTAA TTTAAAGTGA ACTAATGGAC GGGACATGTA GGTGGACTGG GATAATGAAA AGGAATGTTT CCAAACTTTC CTTCGCGAAC TCGCATTCTT CTATTCCCCT CGGCCTTTTG AAGACCAACC CCCTCCACCG CACACTAAAG ATGAAAACAT GACCGGAGAC GAGTTAGAGG GTGTAGAGCC CACCCCGGAA GAGATTCAGC ATCAGCTCTG GCAGCTCGAG CACGTCTTGT TCCCCAGCTT TAGACGGCAC ACAGTATGGC CAAAGAGCTG TATGACGCAT GTCAATCAAC TGGCCGATTT GCCGGACTTG TTTAGGATCT TTGAAAGATG TTAAAGGGGT TTGTCGCGCC CGGTACTTTA TAAAAGGCTA TGTGGGAGCT TCTTGTGCAA TTTGACAAGT ATAATTGC
|
Protein sequence | MSEDSMDIDL RPEVEEIEPE GPKPIHRLTK DVINQIAAAE IIHRPSNAIK ELLENSLDAG STSIKISVKD GGLKLLQITD NGHGINKDDL PLLCERYATS KLQKFEDLQS LGTYGFRGEA LASISYCSHV EVVTKTKNEG CGWKAHYQDG SLIPAKPGGT ADPKPAAAND GTVITAADLF YNMPLRKRAF KSTSDEYNRI IDVVTKYAIH NPHVAWVCKK AGTALPDVAT QVGSNTKANI AALYTSALAN ELLEIPESEL QPARLGAKLK GWVSNANSSW SKKGGWLLFI NNRLVDSNKL KKAVEGHYTS YLPKGASPWA YLSLQIDPAK IDVNVHPTKS EVRFLNEDEI VDAVVQAVQT ALEGANLSRS FTVQTLLPGA PTPLGKRESS NSTIASASFS TRKAAPNYKV RMDPSNRTLD SMFTVIDPSQ LSGFVEDGEL QEQERPSKRR NVDPEFQGDE SIVLDDDNDD EGQAEEGERE QVFADEGESA KGKAKEIEES VCHFTSIQSL RRAVKRDGNA ELHEIFQRHA FVGVVDRYQC LSLIQHSTKL FLVNHGSLGD EHFYQLGLRQ FGAFNRIRLD PAPQLKELLT LAAEDEPGLL EAGLEVESVV DYIASLLRDR QEMLDEYFSL LITEDGKVET LPMLLKGYTP NLDRLPHFLL CLGTQVDWDN EKECFQTFLR ELAFFYSPRP FEDQPPPPHT KDENMTGDEL EGVEPTPEEI QHQLWQLEHV LFPSFRRHTV WPKSCMTHVN QLADLPDLFR IFERC
|
| |