Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL06020 |
Symbol | |
ID | 3254964 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | - |
Start bp | 661222 |
End bp | 664131 |
Gene Length | 2910 bp |
Protein Length | 701 aa |
Translation table | |
GC content | 49% |
IMG OID | 638254077 |
Product | CAT1 catalase, putative |
Protein accession | XP_568133 |
Protein GI | 58261446 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0753] Catalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCACTACCTC TTGTTTCCGT CGAAAACACC GTAATCTTCT CTCGCGTTGA TGGCATGCTG ACCAATTGCA CGAGATATGA AAACCACCGC GGCCAGGCGC ATCGGGATAT GGGAGCGTCA CTGGCGGTTG GCGCTTGGGC GGCTGTGAAC AGCTTTCGAC CATTTATCTT CGCCTAAACT CATCCTTCTG ATACATGACT TTGACAAATA GAATGGTACA TATTTGGAAT ATGAGATGCA CCCCCGGCTC TCCTTTCCCA TTCCCGTGCT TACTGCGATC CACCATCTCT CTATCTCTGT CAGTCTTATC TGCCCGATAT GTCTGCTCGT AACCTAGCCT CTGGTACGCC GTACCGCAAG AACCTGTCGC CAGCCAACAG TTCGCACTTC GATGCTCCGT CAGTATCTTC TACTTCCGAC TCAGTCAAGT CGGCATATTC TGTTGGCTAC ACTCCTGTGC CAACTGCCGA CCGTTCCTAT TCCGGTTTAC TTCACGGCAC ACTATCCAAA GTGTTCCGCA AGGGCGGTCA AGAAATCATT GTATCCGAAG AGCCTGTGTC TTTTAGCTCT TCCAAGGAAG ACTCTGAATC CACTCGCTTC AATGGCTGTC CTTACATGAA CGGTAGTATC CCTCAGACTC CGACACCAAA GTCGACGTCC AAATCAGCTT CAGTTGCCGT TCCTGTTCCT TTGCCCCCTC CTTCTGCTGC GCGTCTGAAG GTCCAAAAAC AAAATGATTC AATCCCTCGA GAGAATGGCA ACGCTCTGGC GGACGATGTG GGAAAGTTGA CTGTTCGATC ATACGAGAAG GATAGGGACC TTAAGTCCCA GGAGGTGTAC GTATTAAGGT CCCGCCTTTT TTAACTTCCT AGACTGACAA GCACCTCAAA GTATCTATAC TACATCCAAC GGTGTTCCGG TGCCCCACCC GTATGCTGTT CAGCGGGCTG GTGTTAACGG TCCCCTTCTT CTGCAAGACT TCCATTTGAT TGACTTACTC TCTCACTTTG ATCGCGAAAG GTGAGTCTGA TCATGACTTG GTTCCTAAAC CTTGAGGACT GACTTTTGCA TAGGATCCCC GAAAGAGTGG TCCATGCCAA GGGCTCAGGT GCCCATGGTA CCTGGGAGTG TACAGACGGC CTAGAGGACC TCTGCCTCGC CAACATGTTC CAAAAGGGCA CTACCTGTCC TTTGACTATT CGATTTTCAA CCGTCGGGGG AGAATCGGGA TCTCCTGACC TTGCTCGGTA AGTAGTCTAA TATTTCTCCT TAGGATTCTT GAACTGACGT TCAAGCTTAG TGATCCTCGT GGTTTTGCTG TCAAATTTAG AACGGCTGAG GGTAACTGGG ACTTCGTTGC GAATAACACT CCCGTTTTCT TTCGTGAGTT ACGCCCTTTT ACCTAGCGTA ATGGACAAAA ATAATTTGCT GACATATTGT AGTGCGCGAC CCAGCCAAGT TCCCTCACTT CATTCACACT CAGAAACGGG ACCCAGCCAC CCATCTTAGC GGCGGAGATG ATTCCACCAT GTTCTGGGAC TACCTTTCTC AGAATCCTGA ATCCATTCAC CAGGTCATGG TACGTTATCA TATTGTACAC TAAACCAACT CGCCAAATGC TGACAGCCTT GGATAAACAG ATACTCATGT CTGATCGAGG CATTCCCGCG GGATGGCGTC ACATGCATGG TTATTACGGG CACACCCTCA AGATCGTTAA TGACAATGGC GACTGGGTTT ATGCCCAATT CCACCTCATC TCTGACCAGG GCAACAAGTT CTTTACGAGC GAGGAGGCAT CTACCAAATC ACCTGACTGG GGTCAGAAGG ACTTGTACGA AGCTATTGGG CGTGGAGAGT ACGTTTTGAC TTTATTCCTG ATAAATTCAT CTTCAAAGAC GTTTACTGAC TTACTTTCGC AGGTACCCAT CTTGGACGAT GAAGGTCCAA GTTATGACAC AAGAACAAGC AGAGGAGGCA TGGGAAAAGA AGCGGATCAA TGTCTTTGAC TTGACCCACG TCTGGCCTCA TGGAGATTAT CCACTCAGGA CTATAGGCAA GATCACCTTG AATGAGAATC CTAGTGTAAG TCAGCTTATA ATGCAATACT ATGGTATCAT ATCTGAGTTG ATCTTCTGTA GAATTACTTT GCTGAGGTTG AGCAAGCGAC GTTCAACCCT GCTCACATGA TCCCAGGCGT AGAACCCTCT GCCGACCCTG TACTCCAAGC CCGACTGTTC TCTTACCCTG ACGCGCACCG CCACCGTGTT GGTGCCAACT ACCAACAGCT TCCAGTCAAT CAATCGGCCA CCCCTTACGC AACGGGCAAC TTTCAACGCG ATGGTGCCAT GGCTTTCTAC AATCAGGGAG GAAGACCCGC TTATCTCTCC AGTATCGAGC CCATCAAGTT CCGAGAGAAG CGCGTCAACC TTAACAAAGT GCACGGTCAA TTTATCGGTG AAGCTGTCAG CTTCCTCAGC GAGATCCGTC CTGAGGACTT TAACGCCCCT CGTGCCCTTT GGCAGAAAGT GTTCAGCGAC GAGTCAAAGG AACGTTTCAT TCAGACTGTC GCCGGGCACA TGTCGACCTG CAAGCGCAAG GAAATTATTG CCCGTCAAAT TGCCATTTTC CGACAAGTAT CGCCTGATCT TGGTGCTCGT CTCGAGAAGG CCACCAATGT CAGGGGCTAT GGGAGTATTG AGGGGATGTC TTTCAACGGT ACTCATAATG GCTTTGGTGT TAAGCGTGGG GCGAACGGCC TTCGCCAAGA TGCGGACGTT GTGTTCAATA ATGGTGCTCC TCAGAAGACT CAGAGGGCTC GTTGACTCGT AATCTGCTTC TTAGTTGTGG CTATTTCTAG TGCGGTCGAA TTAGGTTTGT TTAGGCGGTT TTCTGATCAA CAAATAGTCG
|
Protein sequence | MSARNLASGT PYRKNLSPAN SSHFDAPSVS STSDSVKSAY SVGYTPVPTA DRSYSGLLHG TLSKVFRKGG QEIIVSEEPV SFSSSKEDSE STRFNGCPYM NGSIPQTPTP KSTSKSASVA VPVPLPPPSA ARLKVQKQND SIPRENGNAL ADDVGKLTVR SYEKDRDLKS QEVIYTTSNG VPVPHPYAVQ RAGVNGPLLL QDFHLIDLLS HFDRERIPER VVHAKGSGAH GTWECTDGLE DLCLANMFQK GTTCPLTIRF STVGGESGSP DLARDPRGFA VKFRTAEGNW DFVANNTPVF FLRDPAKFPH FIHTQKRDPA THLSGGDDST MFWDYLSQNP ESIHQVMILM SDRGIPAGWR HMHGYYGHTL KIVNDNGDWV YAQFHLISDQ GNKFFTSEEA STKSPDWGQK DLYEAIGRGE YPSWTMKVQV MTQEQAEEAW EKKRINVFDL THVWPHGDYP LRTIGKITLN ENPSNYFAEV EQATFNPAHM IPGVEPSADP VLQARLFSYP DAHRHRVGAN YQQLPVNQSA TPYATGNFQR DGAMAFYNQG GRPAYLSSIE PIKFREKRVN LNKVHGQFIG EAVSFLSEIR PEDFNAPRAL WQKVFSDESK ERFIQTVAGH MSTCKRKEII ARQIAIFRQV SPDLGARLEK ATNVRGYGSI EGMSFNGTHN GFGVKRGANG LRQDADVVFN NGAPQKTQRA R
|
| |