Gene CNL06020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL06020 
Symbol 
ID3254964 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp661222 
End bp664131 
Gene Length2910 bp 
Protein Length701 aa 
Translation table 
GC content49% 
IMG OID638254077 
ProductCAT1 catalase, putative 
Protein accessionXP_568133 
Protein GI58261446 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0753] Catalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCACTACCTC TTGTTTCCGT CGAAAACACC GTAATCTTCT CTCGCGTTGA TGGCATGCTG 
ACCAATTGCA CGAGATATGA AAACCACCGC GGCCAGGCGC ATCGGGATAT GGGAGCGTCA
CTGGCGGTTG GCGCTTGGGC GGCTGTGAAC AGCTTTCGAC CATTTATCTT CGCCTAAACT
CATCCTTCTG ATACATGACT TTGACAAATA GAATGGTACA TATTTGGAAT ATGAGATGCA
CCCCCGGCTC TCCTTTCCCA TTCCCGTGCT TACTGCGATC CACCATCTCT CTATCTCTGT
CAGTCTTATC TGCCCGATAT GTCTGCTCGT AACCTAGCCT CTGGTACGCC GTACCGCAAG
AACCTGTCGC CAGCCAACAG TTCGCACTTC GATGCTCCGT CAGTATCTTC TACTTCCGAC
TCAGTCAAGT CGGCATATTC TGTTGGCTAC ACTCCTGTGC CAACTGCCGA CCGTTCCTAT
TCCGGTTTAC TTCACGGCAC ACTATCCAAA GTGTTCCGCA AGGGCGGTCA AGAAATCATT
GTATCCGAAG AGCCTGTGTC TTTTAGCTCT TCCAAGGAAG ACTCTGAATC CACTCGCTTC
AATGGCTGTC CTTACATGAA CGGTAGTATC CCTCAGACTC CGACACCAAA GTCGACGTCC
AAATCAGCTT CAGTTGCCGT TCCTGTTCCT TTGCCCCCTC CTTCTGCTGC GCGTCTGAAG
GTCCAAAAAC AAAATGATTC AATCCCTCGA GAGAATGGCA ACGCTCTGGC GGACGATGTG
GGAAAGTTGA CTGTTCGATC ATACGAGAAG GATAGGGACC TTAAGTCCCA GGAGGTGTAC
GTATTAAGGT CCCGCCTTTT TTAACTTCCT AGACTGACAA GCACCTCAAA GTATCTATAC
TACATCCAAC GGTGTTCCGG TGCCCCACCC GTATGCTGTT CAGCGGGCTG GTGTTAACGG
TCCCCTTCTT CTGCAAGACT TCCATTTGAT TGACTTACTC TCTCACTTTG ATCGCGAAAG
GTGAGTCTGA TCATGACTTG GTTCCTAAAC CTTGAGGACT GACTTTTGCA TAGGATCCCC
GAAAGAGTGG TCCATGCCAA GGGCTCAGGT GCCCATGGTA CCTGGGAGTG TACAGACGGC
CTAGAGGACC TCTGCCTCGC CAACATGTTC CAAAAGGGCA CTACCTGTCC TTTGACTATT
CGATTTTCAA CCGTCGGGGG AGAATCGGGA TCTCCTGACC TTGCTCGGTA AGTAGTCTAA
TATTTCTCCT TAGGATTCTT GAACTGACGT TCAAGCTTAG TGATCCTCGT GGTTTTGCTG
TCAAATTTAG AACGGCTGAG GGTAACTGGG ACTTCGTTGC GAATAACACT CCCGTTTTCT
TTCGTGAGTT ACGCCCTTTT ACCTAGCGTA ATGGACAAAA ATAATTTGCT GACATATTGT
AGTGCGCGAC CCAGCCAAGT TCCCTCACTT CATTCACACT CAGAAACGGG ACCCAGCCAC
CCATCTTAGC GGCGGAGATG ATTCCACCAT GTTCTGGGAC TACCTTTCTC AGAATCCTGA
ATCCATTCAC CAGGTCATGG TACGTTATCA TATTGTACAC TAAACCAACT CGCCAAATGC
TGACAGCCTT GGATAAACAG ATACTCATGT CTGATCGAGG CATTCCCGCG GGATGGCGTC
ACATGCATGG TTATTACGGG CACACCCTCA AGATCGTTAA TGACAATGGC GACTGGGTTT
ATGCCCAATT CCACCTCATC TCTGACCAGG GCAACAAGTT CTTTACGAGC GAGGAGGCAT
CTACCAAATC ACCTGACTGG GGTCAGAAGG ACTTGTACGA AGCTATTGGG CGTGGAGAGT
ACGTTTTGAC TTTATTCCTG ATAAATTCAT CTTCAAAGAC GTTTACTGAC TTACTTTCGC
AGGTACCCAT CTTGGACGAT GAAGGTCCAA GTTATGACAC AAGAACAAGC AGAGGAGGCA
TGGGAAAAGA AGCGGATCAA TGTCTTTGAC TTGACCCACG TCTGGCCTCA TGGAGATTAT
CCACTCAGGA CTATAGGCAA GATCACCTTG AATGAGAATC CTAGTGTAAG TCAGCTTATA
ATGCAATACT ATGGTATCAT ATCTGAGTTG ATCTTCTGTA GAATTACTTT GCTGAGGTTG
AGCAAGCGAC GTTCAACCCT GCTCACATGA TCCCAGGCGT AGAACCCTCT GCCGACCCTG
TACTCCAAGC CCGACTGTTC TCTTACCCTG ACGCGCACCG CCACCGTGTT GGTGCCAACT
ACCAACAGCT TCCAGTCAAT CAATCGGCCA CCCCTTACGC AACGGGCAAC TTTCAACGCG
ATGGTGCCAT GGCTTTCTAC AATCAGGGAG GAAGACCCGC TTATCTCTCC AGTATCGAGC
CCATCAAGTT CCGAGAGAAG CGCGTCAACC TTAACAAAGT GCACGGTCAA TTTATCGGTG
AAGCTGTCAG CTTCCTCAGC GAGATCCGTC CTGAGGACTT TAACGCCCCT CGTGCCCTTT
GGCAGAAAGT GTTCAGCGAC GAGTCAAAGG AACGTTTCAT TCAGACTGTC GCCGGGCACA
TGTCGACCTG CAAGCGCAAG GAAATTATTG CCCGTCAAAT TGCCATTTTC CGACAAGTAT
CGCCTGATCT TGGTGCTCGT CTCGAGAAGG CCACCAATGT CAGGGGCTAT GGGAGTATTG
AGGGGATGTC TTTCAACGGT ACTCATAATG GCTTTGGTGT TAAGCGTGGG GCGAACGGCC
TTCGCCAAGA TGCGGACGTT GTGTTCAATA ATGGTGCTCC TCAGAAGACT CAGAGGGCTC
GTTGACTCGT AATCTGCTTC TTAGTTGTGG CTATTTCTAG TGCGGTCGAA TTAGGTTTGT
TTAGGCGGTT TTCTGATCAA CAAATAGTCG
 
Protein sequence
MSARNLASGT PYRKNLSPAN SSHFDAPSVS STSDSVKSAY SVGYTPVPTA DRSYSGLLHG 
TLSKVFRKGG QEIIVSEEPV SFSSSKEDSE STRFNGCPYM NGSIPQTPTP KSTSKSASVA
VPVPLPPPSA ARLKVQKQND SIPRENGNAL ADDVGKLTVR SYEKDRDLKS QEVIYTTSNG
VPVPHPYAVQ RAGVNGPLLL QDFHLIDLLS HFDRERIPER VVHAKGSGAH GTWECTDGLE
DLCLANMFQK GTTCPLTIRF STVGGESGSP DLARDPRGFA VKFRTAEGNW DFVANNTPVF
FLRDPAKFPH FIHTQKRDPA THLSGGDDST MFWDYLSQNP ESIHQVMILM SDRGIPAGWR
HMHGYYGHTL KIVNDNGDWV YAQFHLISDQ GNKFFTSEEA STKSPDWGQK DLYEAIGRGE
YPSWTMKVQV MTQEQAEEAW EKKRINVFDL THVWPHGDYP LRTIGKITLN ENPSNYFAEV
EQATFNPAHM IPGVEPSADP VLQARLFSYP DAHRHRVGAN YQQLPVNQSA TPYATGNFQR
DGAMAFYNQG GRPAYLSSIE PIKFREKRVN LNKVHGQFIG EAVSFLSEIR PEDFNAPRAL
WQKVFSDESK ERFIQTVAGH MSTCKRKEII ARQIAIFRQV SPDLGARLEK ATNVRGYGSI
EGMSFNGTHN GFGVKRGANG LRQDADVVFN NGAPQKTQRA R