Gene CNN01410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN01410 
Symbol 
ID3255484 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp415250 
End bp418793 
Gene Length3544 bp 
Protein Length883 aa 
Translation table 
GC content49% 
IMG OID638254556 
Productdipeptidyl-peptidase and tripeptidyl-peptidase, putative 
Protein accessionXP_568605 
Protein GI58262390 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCGCTGCGA TGCCACCACA AGGCTATGAC TATGACCATC CACAGCGCTC TCCACGCTCA 
TCCACATCCA CCATATACCA TGACAACCTG GACATAGACC CATTCCAGGA AAAACACACG
GCTTTCAGAG ACGATCCCAC CATCGAACAA GGTATCCACG TCTATCCAGA CGACGACGAT
GGAGAGGGCT ACACTGTCGA ATCCAGCCGG GTAAGCTCTT TGCTCCTCTT CCCGCTTTCT
GTTACAGACT GACACACTCT GCCACCCAGA CACATCCTCG GAACAAGTCA CGCAAGGTAC
TCGCCGTCCT CGTCGTTATC GTCACTTTCG CAGGTATCAT CGGCGTGCTC GCTGCTTCAG
GATACTCGGT ACCCTCTTTC TCGTCCAAAG GTGGCACAAA GCATATCACC ATGGACCATG
TGTTTAACGG CACGTTCAAT GCCTACAGCA AGCAGATTGA CTGGGTCAAG GAGGGTAGGT
ACCGGGCAGA TGGAGAATCT TGCTTTCATT TACATGCTGA TCATGTTCGT CATACGGGTA
CACAGCGGAA GACGGCACGT TCTCACATAT CAACAAGGAA GGAAACATTG TCCTCAACAC
CGTCCGCAAC ATGACCACCG ACACCCTTCT CGTCAACGCA TCGCTTGTCC TTGACCTTGA
GGGCAATCGA CTCCCCTGGA CGGGATGGGC CCTCTCAGCT GATATGCAGT ACGTCCTCTT
CAGAACCGAC CATCTCAAAC AGTGGCGGCA CTCGTCATTC GGGAACTACT GGATACATCG
TCGACAAGAT TCGGCTACGT TTCCTGTCAT CCCGCCGACC AGCCCGCCGA CTATAGCAAA
GTGTACGTGG TCGCCTGTCG GGCATGCATT GGCGTTTGTC AGTAAGAACG ACGTCTATAT
CATCTCTGAA GATGACCTCT CCTCTGTCCC CTCCTCCTCC TCCTCCTCCC CTTCACATGT
AAGAGTAACA ACCGATGGAA GCCACACAAT CTTCAACGGC GTCCCCGACT GGGTATACGA
AGAAGAAGTG TTTGAAACGG ATACCGCCCT CTGGTGGAGT CCTGATGGCA AGAGGGTTGC
TTTTTTGAGA AGCGATGAGA GTAAAGTACA TGATTTCAAG TTGCAGTATT ATAACCCATC
GAATGATGCG TTCAAGGTGC ATCAGTACCA GACCGAGCTG GATATGAAGT AGGGGCCCCT
TTTCCTTTTT TCTTTCCGCT TGTGGTACTC TGGCTGACTT TTACAACAGA TACCCCAAAC
CTGGTACACC CAACCCCACC GTCACAGTCC ACACTTTTTC CCTCTCTTCC CTCTCCTCCT
CCGCCACCTC GGCCACCGCA CAACGTCTCA CCTGGCCAGG CGAGTTCCCT CTCCCAGACC
GCATCCTCAC CGAAATTGGC TGGGTAGCCG ATGATGCCCT CCTCGTTAAA GAGATTGATA
GAGCTGCGAG GGACGGGAAC GTCGTTCTTT TCCAGTTTGG GAGCGACCAT GAAGCAAAGG
TGGAAGGGGA GATTGTGAGG CGGTTAGGGA AGGACGGGGA GGAAGGCGAT GATGGATGGA
TCGATCATGC AAGTGCCCTC CTTTTCATAA CTTCCCACTT CCTCCCTAAA CAAACGAGCT
AACCTTGTAC AAAATTCTTT CTTTGTTTCT TTTTCTCATT TATAGGGCCA GAACGTCATC
CCCGTTAAAG GGCCAGTCCA AGGATACCTC GACATCGTCC CTAATCAAGG GTACAACCAC
ATTGCGCTGT TCACTCCGTT GAACGCTAGT AAGCCTCGGT GGATCACTGC AGGAGAATGG
GAAGTCACCG AGATATCGGG GGTTGATACC GAAAAGGGCC TTGTGTATGT TTTTTCTTTA
ACATCCCCCA CCCCACCTTT TTCCCCTCTC ACTCACCTGT GTCACATGAA TAAACATGTT
CATGTGCTGA TTCATCCCAT GACGATACAG TTACTTTACA GCTGCAACCC CTTCTATCGA
CAGACACATC TACTCTATCC CCTTACCCAC TCTTTCATCC ATCGATGATG AAGAGGATCA
AGACGCATCG ATGAGTAGCA TGGCCGCATT GACAGATACT ACTTCCCCTG GATATTTTGA
AGCTTCCTTC TCTCCCAAAG CGGGGTATTA TGTCTTGGGG TATAAAGGGC CAGAAGTACC
TTGGCAGAGG TTGATAGAAG CTGGGTCTGG AGAGAATCGT GAGTATCTTT ATGGGGGGAA
CGGGGGCTTC GATTACTTTT CTACTGGGAG GGAATGCTGA TGAGATGTTT TTCTCTTTTT
TCCCTTTTCT TTGGCTACGG GTTGGGATGA AAAGGAGCGA ATGTGTTATT GGAAGGTAAT
GCAGGACTGA ACAAGACAGT TTCAGAATTC TTGAAACCTC TTGTCACGAG GACGACTATC
GAAAATGATG GCTACGGTAA TGCACCCTTT CATCTCTCTC GTCCCGTCCA GTTTCCCACT
AACAACAGCC TCCTTTCTTC GGTATATGCA GAACTCAACA TGCTCGAGAT CCTTCCTCCC
AACCTCGACA TCACCGGCCG CAAAAAGTAC CCCGTCCTGA TCCGCGTCTA CGGCGGACCC
GGCTCTCAAA TGGTGTCTAA CCGATTCGAA AGGGATTGGC ATTCTTACCT CGCTGCCTCC
CAGCGGTACA TTATCATCAT GCTCGACGGT CGTGGAACGG GGTTTAGAGG TAGGAATCTG
AGGAACCCGG TGAGGGATGA TTTGGGGCAT TGGGAGGTGC AGGATCAGGT CGCGGCGGCG
AGGGAGATGG CGAAGAGGGT TTATGTGGAT AGATCGAGGA TTGGGATTTG GGGATGGGTG
AGTGCGGAAT TTGCTTTTCT TGGGAATGTC GCTCTTGGAG GAATAAAAAA AGAGCTGATT
GGGATGATTA GAGTTATGGA GGATACATGA CTTGTAAGGC TATTGAAGCC GATTCTGGTA
TATTCACTCT CGGAAGTAAG CCTTGCCCTG TATCCTTTCC ATCTTGGTAG AAATCATTCT
CACATTTTAT TTTTATTTTT ATTTTCACTA CACTATAGTG GCCGTCGCAC CCGTCACAGA
TTGGCTTTAC TACGACTCCA TCTACACCGA ACGATACATG TCAACTCCCT CCGCCAACAA
GGACGGGTAC ACCACCTCGG CAGTGAACAA TGTCACTTCC TTTTCGGGCG ATAAAGTCGA
TTTCATCTGG GCACATGGAA GCGGGGACGA TAATGTCCAC TATATGAATA GTGCGGCGCT
TTTGGATAAG TTGACGCAGG AGCAAGTGAG AGGGTGGAGG TTTAGGATGT TTACTGATTC
GTAAGTTCCG CCAAGCGTTT TTATCAACCC GAAAGAAAAA AGAAAGAAAA AAAAACATGT
CTAATTACTC TTTTCCCGCA ACGTTCAGCA ATCATTCAAT GGATAAGCGG ATGGCGTACC
GGGAAGTGTA CGAGTGGATG AATGATTTCT TGGAAGAAAA ATGGGGTAAA GGAGGGACTG
TACATCATTA GATAAAAACT ACATTTAGAT GTAGAAGGAG CACAAATGCA GCATGTTGTA
TAAA
 
Protein sequence
MPPQGYDYDH PQRSPRSSTS TIYHDNLDID PFQEKHTAFR DDPTIEQGIH VYPDDDDGEG 
YTVESSRTHP RNKSRKVLAV LVVIVTFAGI IGVLAASGYS VPSFSSKGGT KHITMDHVFN
GTFNAYSKQI DWVKEAEDGT FSHINKEGNI VLNTVRNMTT DTLLVNASLV LDLEGNRLPW
TGWALSADMQ YVLFRTDHLK QWRHSSFGNY WIHRRQDSAT FPVIPPTSPP TIAKCTWSPV
GHALAFVSKN DVYIISEDDL SSVPSSSSSS PSHVRVTTDG SHTIFNGVPD WVYEEEVFET
DTALWWSPDG KRVAFLRSDE SKVHDFKLQY YNPSNDAFKV HQYQTELDMK YPKPGTPNPT
VTVHTFSLSS LSSSATSATA QRLTWPGEFP LPDRILTEIG WVADDALLVK EIDRAARDGN
VVLFQFGSDH EAKVEGEIVR RLGKDGEEGD DGWIDHGQNV IPVKGPVQGY LDIVPNQGYN
HIALFTPLNA SKPRWITAGE WEVTEISGVD TEKGLVYFTA ATPSIDRHIY SIPLPTLSSI
DDEEDQDASM SSMAALTDTT SPGYFEASFS PKAGYYVLGY KGPEVPWQRL IEAGSGENRA
NVLLEGNAGL NKTVSEFLKP LVTRTTIEND GYELNMLEIL PPNLDITGRK KYPVLIRVYG
GPGSQMVSNR FERDWHSYLA ASQRYIIIML DGRGTGFRGR NLRNPVRDDL GHWEVQDQVA
AAREMAKRVY VDRSRIGIWG WSYGGYMTCK AIEADSGIFT LGMAVAPVTD WLYYDSIYTE
RYMSTPSANK DGYTTSAVNN VTSFSGDKVD FIWAHGSGDD NVHYMNSAAL LDKLTQEQVR
GWRFRMFTDS NHSMDKRMAY REVYEWMNDF LEEKWGKGGT VHH