Gene CNC02510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC02510 
Symbol 
ID3256463 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp717602 
End bp720718 
Gene Length3117 bp 
Protein Length819 aa 
Translation table 
GC content47% 
IMG OID638255471 
Productbeta-glucosidase, putative 
Protein accessionXP_569544 
Protein GI58264776 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GATAAACGTC TTAAACTGAA TTCTCTGTGG TGGGATACAT TATTTCCGTT CATCACAACC 
TAGCAGACCA TGGCGCACAT GGGTCTCAAT CAATCATCAG CTTCTCATTT GCCTTTCCAT
GGAAATCCCT ATATAAACAA ACGTCTTGAA TAGGTCTCCC CTTCTCTTCT TCGTCTCTTA
CGGATCTTCT TCTACGGCTC CTACTCTTCT CCCTTTTTTT CAATCAAGCT TTCTCAATCG
GTGTGGCCTG ACAGCTTCGT CACTGCCTTG CTACAACAAT GCTGGCCCTG CCTGTCACCC
TCCTATGGCT CAGCACCGGT TTTCCGCTTC TTACCTACGC CTCGCCGGTT ACATCCGATG
CTGACAATCA TACTATAAGT GCTACTGAAG CAAACAGTAG TAGCTCGACT TCATTCCTTC
CTCACAATAC CTCTCGACAT CATGAATCAA GTTCACAATC CCCCATCACA TATATTTCTC
CTCAGATAGC GGCCCCTATT AATGCCACCT ACGTTAATCC CTCTCACAAG TGGGAAAAGG
CTTATCGAAA GGCCAAAAAG TATCTTGACG ATTGGACATT AGAGGAGAAA GTACAGCTCA
CAACCGGCGT GGGATGGGAA AACGGTCGTT GTGTAGGCAA CATCGGGGCC ATACCAAGCA
GAAATTTTTC GGGTTTATGT CTTCAGGATT CGCCATTAGG TGTTAGGTTG GCGGATTTTG
TTTCTGCTTT TCCAGCTGGA ATTAATGCTG CCGCAACGTG AGACACCGTT TCAACTTTTT
TTTGCCTATG TATCTCACCT TCACAACATA GGTTTGACAA GGATCTTATC TACGCTCGGG
GCTATGCTTT GGGCCAAGAA TTCAAAGGCA AAGGTGTGCA TGTTGCTTTA GGTCCAATGA
CAAATATGGG ACGTGTGGCT GCTGGCGGCA GAAACTGGGA AGGCTTTGGT GGCGATCCCT
ACCTAAGTGG CTGGGCAACT GAAATGACTG TTCGAGGTAT ACAAGATGCT GGTGTACAAG
CCTGGTGAGC CTTTTTCTTC TGCAGCTTCC TATCTGACAG CTAATGTTAC TAGTGTCAAG
CACTACGTTG GAAATGAGCA GGAACGCAAT CGGACCACAG AATCCTCTAA TATCGACGAC
CGTACTCTTC GGGAGATATA CACCCATCCC TTCTTACGTG CAGTACAAGC GGATGTGGCT
TCTGTAATGT GTTCTTACAA CCTCATCAAT GGTTCATGGG CGTGCGAAAA CCCCAAAACG
CTCAATGGGG TACTGAAGAC TGACTTTGGT TTCCAAGGCT ACGTCTTGTC CGACTGGGGT
GCTCAGCATT CTGGAGTTGT CTCTGCCAAC AACGGTCTGG ATATGTCCAT GCCTGGAGAC
ATCGTTCTCG GAAGTCTGAC TAGCTACTGG GGTTCCAATT TGACAGAGTC AGTCAAAAAC
GGGAGCGTGA GCGAAGAAAG GCTCGACGAC ATGGCCGAAC GAATCATGGC GGCGTACTTC
CTAGTAGACC AAGATAAAGA TTATCCAGAA GTCAACTTCG ATTCCTTCCG TTTGTCTGGA
TCCAATAATT CTCACGTTGA TGTAAGGGGT GACCACTGGA AGTGAGTAAA AACAGAGCAA
ACATTAAGGT AACGAAGGCT TATCTTGTAT ACAGAATCAT TCGGCAAATG GGAGCCGCGT
CGACTGTCCT TCTCAAGAAT GTCGACCACG CCCTACCTCT TAGGAAGCCT CGAAGTATGA
CACTTATTGG TTCTGATCTT GGTCCCTCTC TCCGTGGGCC AAACGGGTTT TCCGACCGCG
GCGGGCTGTC TGGTACCCTT GCTATGGGCT GGGGTTCTGG CACAGCCGAA TTCCCTTATC
TTGTCGACCC TCTTTCCTCC ATATCCCTCC AAGCACGGGA AGATGGAACC ATTCTCAATT
GGTGGCTCGA TGACTGGAAC CTCTCGAATG CTTCCTATTG GGCGTCGGTA GCTGAGGTCG
CTATCGTTGG TATCAACAGC GACTCTGGCG AAGGCTACAT AACCGTTGAT GATAACGAGG
GTGATCGAAA CAACCTGACT GCTTGGAACA ATGGTGATGA GCTAGTAAAA GCTGTGGCTT
CTGCGAACAA CAACACCATC GTCATAGTTC ATTCTGTCGG GCCCATGATC ATTGAGGACT
GGATCGACCA CCCTAACATC ACTGCTGTCC TCTGGGCCGG TCTGCCGGGG CAAGAATCAG
GCAACTCTCT CGTCGATGTT CTTTACGGGG CATACAACCC TTCCGCTAGA CTTCCTTACA
CGATCGCTAA GAAACGGGAA GACTACTCAG CCGATATTGA CTATGTCACT TCTGATATTC
CCGCCATAAC TCAGGTCAAC TATACAGAAG GCTTGTTCAT CGATTATCGA CATTTTTTGG
CAAAGGACAT CACTCCTAGG TATGAGTTTG GCTTCGGGAT GAGTTACACG AGCTTTGAAT
TTGGAGACGT TTCACTGGAG GAGATCAAGG AAGAGGGTGC AGGAGACGAC ATTTACAACT
TCCAAGAGGT CGATGATGGG ACAGTAAAGG CCGGTCGCTT TTTGCTTGAC TAGTGAGTGT
TCTCTCTTGA TGGCCCAATT ACACCACTAA TAACATTGTT AGCTTGCACA AGGCTCAATG
GACTGTTACC ATCGATATCA CTAACACTGG TGGGGTCAAC GGCTGTGAGG TTCCTCAACT
CTACTTAGCA TATCCAGCCG ACTCTGGGGA GCCACCCAAG GTAATGAGAG ACTTTGCCAG
GATCAATCTT GATCCTGGAG CTAGCCAAAC AGTTACATTC AACTTATCGA GATATGACGT
TTCTATTTGG GATGTTGAAA GCCAGAAGTG GACTATCCCT GACGGTACGT TTGTCGTTGA
AGTGGGGAAG AGCAGTATGG ACAAGGATGC GAAGACAGCT TCATTCTGCC CGGGGAGCTG
TTAAAATATT GGTATCATTA TTGCTTGGCA TGTTTTTTTG TTTTGTTTTT GTTTTTAATA
GCTAAAATTT AAATGTGGAC ATTTGAAAAT CTTCGTACAT ATAAGTTAAA TATAGTGGTA
AAGAATACAG AATAAGTGAA TATAGTAGTA AACAATGCAT GATAAGTAGT AAAATTG
 
Protein sequence
MLALPVTLLW LSTGFPLLTY ASPVTSDADN HTISATEANS SSSTSFLPHN TSRHHESSSQ 
SPITYISPQI AAPINATYVN PSHKWEKAYR KAKKYLDDWT LEEKVQLTTG VGWENGRCVG
NIGAIPSRNF SGLCLQDSPL GVRLADFVSA FPAGINAAAT FDKDLIYARG YALGQEFKGK
GVHVALGPMT NMGRVAAGGR NWEGFGGDPY LSGWATEMTV RGIQDAGVQA CVKHYVGNEQ
ERNRTTESSN IDDRTLREIY THPFLRAVQA DVASVMCSYN LINGSWACEN PKTLNGVLKT
DFGFQGYVLS DWGAQHSGVV SANNGLDMSM PGDIVLGSLT SYWGSNLTES VKNGSVSEER
LDDMAERIMA AYFLVDQDKD YPEVNFDSFR LSGSNNSHVD VRGDHWKIIR QMGAASTVLL
KNVDHALPLR KPRSMTLIGS DLGPSLRGPN GFSDRGGLSG TLAMGWGSGT AEFPYLVDPL
SSISLQARED GTILNWWLDD WNLSNASYWA SVAEVAIVGI NSDSGEGYIT VDDNEGDRNN
LTAWNNGDEL VKAVASANNN TIVIVHSVGP MIIEDWIDHP NITAVLWAGL PGQESGNSLV
DVLYGAYNPS ARLPYTIAKK REDYSADIDY VTSDIPAITQ VNYTEGLFID YRHFLAKDIT
PRYEFGFGMS YTSFEFGDVS LEEIKEEGAG DDIYNFQEVD DGTVKAGRFL LDYLHKAQWT
VTIDITNTGG VNGCEVPQLY LAYPADSGEP PKVMRDFARI NLDPGASQTV TFNLSRYDVS
IWDVESQKWT IPDGTFVVEV GKSSMDKDAK TASFCPGSC