Gene CNF03760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03760 
Symbol 
ID3258050 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1099595 
End bp1101758 
Gene Length2164 bp 
Protein Length539 aa 
Translation table 
GC content51% 
IMG OID638257495 
Productcarboxypeptidase C, putative 
Protein accessionXP_571636 
Protein GI58268960 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.343115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTATACTCAC CGTCTTAGAT TCACAACTAG CCATGCGTCT CTCCACCTTG GCTCTCATGG 
CCACACCGGC GTTGGCATTG CCCAACTACC TCCGTCTCAG CAACGGTCCT TCCGAGCTTG
CGAGTGACCT TGCCACCCAT GCTATTTCAT CTGCTCAGGC ATGGCTTCAG GGCGCCGTCT
CCAGCACCAG GAGCAGCGTT CAGAACGGAT GGAAGGGCGT TCAGGATGGA TTGGACCAAG
GACTCAAAGT GGAGACCGTT GATGAGGCTG GCATCGAGTG TAAGTTCATC AGCTCATCGG
TGCACACTCT TGCAGGCTGA CTTGCCACAC CAGACCTTGC CCTCTCTCAC CCTGCCTTCC
CCCTTCACCG CTTGCGTGTC GTCGAGCCCA AACTCTGTGA CCCCTCTGTC AAGCAGCTCT
CAGGCTACCT TGATATCTCC GAGACTCGTC ATCTTTTCTT CTGGTTCCAA GAGTCTCGTG
AGAACCCTGA TGAGGACCCG CTCGTTCTTT GGCTTAATGG TGGCCCCGGA TGTTCTTCCA
CCACTGGTCT TCTCTTTGAG CTGGGCGGAT GCAACATTAG GGACAAAGGA GAGAACACCA
CCTTTAACGA GCACTCTTGG AATTCTGTCG CTAACGTCTT GTACTTGGAC CGTGCGTGCC
TATTATACGC TCAGCGTAGC ACACACTGAC GAAGCGTAGA GCCTATCGGC GTAGGCTATT
CTTACGCCGA CGAGGGCGAA GTGAACAACT CTCCAGCTGC TGCCGAAGAT GTCTATGCTT
TCCTCGTTTT GTTTATCTCC AAGGTAATAT TTATTTTCCC TGACGCTCAA TTGAGAGCTA
ACTTTCTTCA GTTCCGAGAG TACAGCAAAC TTGATTTCCA TGTTGCGGGT GAATCCTATG
CCGGTACTTA CATTCCCAAC ATTGCCAGCG TACGTATCTC ATATTTCAAT CAGAATGGTT
AAATTAATAT TACATTCTAG GTCGTCCACA AGAACAACAT CGCTCTTGAC CTGGTCCCTA
CTCCTTCCGT CCCCAAGATC AACCTCAAGT CGGTCATGAT TGGTAACGGT CTTACCGACC
CATATGCCCA ATTCGGCTCT GTCCCTGAGT ATGCCACTCT GATTTGATTT TTCAAGGGGT
CTCTACTGAC TGCTCAACAG CTGGGCATGC AACTCTCCTT ACGCTCCTTA CGACGATCCC
TCTCCCGAAT GCGATTCTCT TCGTACGCGC GCTAACCGCT GCCAAGGTTT GATCAGCGGA
TGTTACAAGA CAAACTCGAG ATTCACCTGT GTTCCCGCGG CTCTCTACTG CTGGTCCATG
TTCAACGAAT TACAAGATCT CGTGCGTAAT CGTTCCCAGA CCATTCTGAT GGAACCAGGA
ATGCGGGCTG ACAGGAGAAT AGGGTCGTAA CATGTACGAT GTCCGGAAGA CCTGTGACAA
GTCTCCCGAG AAGGATGGAC CTCTGTGCTA CCGCGAAATG GGCTGGATGG AGACTTACTT
GAACAAACCC GAGGTGAAGA AGGAGTTGGG TGCTCCCGAG AGAGTCACTT TCCAGAGCTG
CAACATGCAG ATCAACCAAA ACTTGTAAGT CTCCACATGC GGTTGTAAGC AAATTTCGAA
GCTGATTTAG TAATAGCTTG TTGCACGGTG ATGGGATGCA CTATGCTGGT GGCCTTCTTC
CCGACCTTGT TGAGGATGAT ATCCGAGTTT TGATCTATGC TGGTCAAGCC GACATGCGTA
AGTCTTGTTT TCCCACTTCA CAGGTCAAAT GGCTCAACAC ATCCATGCAC TAGTCGTCAA
CTACATCGGA TGTGCTTCCG TTCTTGACAA CCTTCAGACG AGCTACCTCG CTTCCTACCT
TGCTGCGCCC TTCGTCAACT TCACCAGCCC CGATGGTGAG GTGTCCGGAT ACACCAAATC
TGCCAGCAAG GACGGCAAGG GCTCGGGCAA CGTCGCGTTT GTGGCTTTCC ACAATGCGGG
ACACATGGTA CCTCATGACG ATCCCGAAGG AGCGTTAAGA ATGGTGGGCC GATGGTTGAA
GAACGAACCT TTGGCTGTTG CTGAAGACGA GTAGTGGTAA TCTCGTCGAA ATTTAGAGGA
AGGATAATGT CCTTGTAAGA TCGAGAAATA AATATGTATA GGACAATGCA TACAGTAGAT
CATG
 
Protein sequence
MRLSTLALMA TPALALPNYL RLSNGPSELA SDLATHAISS AQAWLQGAVS STRSSVQNGW 
KGVQDGLDQG LKVETVDEAG IEYLALSHPA FPLHRLRVVE PKLCDPSVKQ LSGYLDISET
RHLFFWFQES RENPDEDPLV LWLNGGPGCS STTGLLFELG GCNIRDKGEN TTFNEHSWNS
VANVLYLDQP IGVGYSYADE GEVNNSPAAA EDVYAFLVLF ISKFREYSKL DFHVAGESYA
GTYIPNIASV VHKNNIALDL VPTPSVPKIN LKSVMIGNGL TDPYAQFGSV PDWACNSPYA
PYDDPSPECD SLRTRANRCQ GLISGCYKTN SRFTCVPAAL YCWSMFNELQ DLGRNMYDVR
KTCDKSPEKD GPLCYREMGW METYLNKPEV KKELGAPERV TFQSCNMQIN QNFLLHGDGM
HYAGGLLPDL VEDDIRVLIY AGQADMLVNY IGCASVLDNL QTSYLASYLA APFVNFTSPD
GEVSGYTKSA SKDGKGSGNV AFVAFHNAGH MVPHDDPEGA LRMVGRWLKN EPLAVAEDE