Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC02510 |
Symbol | |
ID | 3256463 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 717602 |
End bp | 720718 |
Gene Length | 3117 bp |
Protein Length | 819 aa |
Translation table | |
GC content | 47% |
IMG OID | 638255471 |
Product | beta-glucosidase, putative |
Protein accession | XP_569544 |
Protein GI | 58264776 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATAAACGTC TTAAACTGAA TTCTCTGTGG TGGGATACAT TATTTCCGTT CATCACAACC TAGCAGACCA TGGCGCACAT GGGTCTCAAT CAATCATCAG CTTCTCATTT GCCTTTCCAT GGAAATCCCT ATATAAACAA ACGTCTTGAA TAGGTCTCCC CTTCTCTTCT TCGTCTCTTA CGGATCTTCT TCTACGGCTC CTACTCTTCT CCCTTTTTTT CAATCAAGCT TTCTCAATCG GTGTGGCCTG ACAGCTTCGT CACTGCCTTG CTACAACAAT GCTGGCCCTG CCTGTCACCC TCCTATGGCT CAGCACCGGT TTTCCGCTTC TTACCTACGC CTCGCCGGTT ACATCCGATG CTGACAATCA TACTATAAGT GCTACTGAAG CAAACAGTAG TAGCTCGACT TCATTCCTTC CTCACAATAC CTCTCGACAT CATGAATCAA GTTCACAATC CCCCATCACA TATATTTCTC CTCAGATAGC GGCCCCTATT AATGCCACCT ACGTTAATCC CTCTCACAAG TGGGAAAAGG CTTATCGAAA GGCCAAAAAG TATCTTGACG ATTGGACATT AGAGGAGAAA GTACAGCTCA CAACCGGCGT GGGATGGGAA AACGGTCGTT GTGTAGGCAA CATCGGGGCC ATACCAAGCA GAAATTTTTC GGGTTTATGT CTTCAGGATT CGCCATTAGG TGTTAGGTTG GCGGATTTTG TTTCTGCTTT TCCAGCTGGA ATTAATGCTG CCGCAACGTG AGACACCGTT TCAACTTTTT TTTGCCTATG TATCTCACCT TCACAACATA GGTTTGACAA GGATCTTATC TACGCTCGGG GCTATGCTTT GGGCCAAGAA TTCAAAGGCA AAGGTGTGCA TGTTGCTTTA GGTCCAATGA CAAATATGGG ACGTGTGGCT GCTGGCGGCA GAAACTGGGA AGGCTTTGGT GGCGATCCCT ACCTAAGTGG CTGGGCAACT GAAATGACTG TTCGAGGTAT ACAAGATGCT GGTGTACAAG CCTGGTGAGC CTTTTTCTTC TGCAGCTTCC TATCTGACAG CTAATGTTAC TAGTGTCAAG CACTACGTTG GAAATGAGCA GGAACGCAAT CGGACCACAG AATCCTCTAA TATCGACGAC CGTACTCTTC GGGAGATATA CACCCATCCC TTCTTACGTG CAGTACAAGC GGATGTGGCT TCTGTAATGT GTTCTTACAA CCTCATCAAT GGTTCATGGG CGTGCGAAAA CCCCAAAACG CTCAATGGGG TACTGAAGAC TGACTTTGGT TTCCAAGGCT ACGTCTTGTC CGACTGGGGT GCTCAGCATT CTGGAGTTGT CTCTGCCAAC AACGGTCTGG ATATGTCCAT GCCTGGAGAC ATCGTTCTCG GAAGTCTGAC TAGCTACTGG GGTTCCAATT TGACAGAGTC AGTCAAAAAC GGGAGCGTGA GCGAAGAAAG GCTCGACGAC ATGGCCGAAC GAATCATGGC GGCGTACTTC CTAGTAGACC AAGATAAAGA TTATCCAGAA GTCAACTTCG ATTCCTTCCG TTTGTCTGGA TCCAATAATT CTCACGTTGA TGTAAGGGGT GACCACTGGA AGTGAGTAAA AACAGAGCAA ACATTAAGGT AACGAAGGCT TATCTTGTAT ACAGAATCAT TCGGCAAATG GGAGCCGCGT CGACTGTCCT TCTCAAGAAT GTCGACCACG CCCTACCTCT TAGGAAGCCT CGAAGTATGA CACTTATTGG TTCTGATCTT GGTCCCTCTC TCCGTGGGCC AAACGGGTTT TCCGACCGCG GCGGGCTGTC TGGTACCCTT GCTATGGGCT GGGGTTCTGG CACAGCCGAA TTCCCTTATC TTGTCGACCC TCTTTCCTCC ATATCCCTCC AAGCACGGGA AGATGGAACC ATTCTCAATT GGTGGCTCGA TGACTGGAAC CTCTCGAATG CTTCCTATTG GGCGTCGGTA GCTGAGGTCG CTATCGTTGG TATCAACAGC GACTCTGGCG AAGGCTACAT AACCGTTGAT GATAACGAGG GTGATCGAAA CAACCTGACT GCTTGGAACA ATGGTGATGA GCTAGTAAAA GCTGTGGCTT CTGCGAACAA CAACACCATC GTCATAGTTC ATTCTGTCGG GCCCATGATC ATTGAGGACT GGATCGACCA CCCTAACATC ACTGCTGTCC TCTGGGCCGG TCTGCCGGGG CAAGAATCAG GCAACTCTCT CGTCGATGTT CTTTACGGGG CATACAACCC TTCCGCTAGA CTTCCTTACA CGATCGCTAA GAAACGGGAA GACTACTCAG CCGATATTGA CTATGTCACT TCTGATATTC CCGCCATAAC TCAGGTCAAC TATACAGAAG GCTTGTTCAT CGATTATCGA CATTTTTTGG CAAAGGACAT CACTCCTAGG TATGAGTTTG GCTTCGGGAT GAGTTACACG AGCTTTGAAT TTGGAGACGT TTCACTGGAG GAGATCAAGG AAGAGGGTGC AGGAGACGAC ATTTACAACT TCCAAGAGGT CGATGATGGG ACAGTAAAGG CCGGTCGCTT TTTGCTTGAC TAGTGAGTGT TCTCTCTTGA TGGCCCAATT ACACCACTAA TAACATTGTT AGCTTGCACA AGGCTCAATG GACTGTTACC ATCGATATCA CTAACACTGG TGGGGTCAAC GGCTGTGAGG TTCCTCAACT CTACTTAGCA TATCCAGCCG ACTCTGGGGA GCCACCCAAG GTAATGAGAG ACTTTGCCAG GATCAATCTT GATCCTGGAG CTAGCCAAAC AGTTACATTC AACTTATCGA GATATGACGT TTCTATTTGG GATGTTGAAA GCCAGAAGTG GACTATCCCT GACGGTACGT TTGTCGTTGA AGTGGGGAAG AGCAGTATGG ACAAGGATGC GAAGACAGCT TCATTCTGCC CGGGGAGCTG TTAAAATATT GGTATCATTA TTGCTTGGCA TGTTTTTTTG TTTTGTTTTT GTTTTTAATA GCTAAAATTT AAATGTGGAC ATTTGAAAAT CTTCGTACAT ATAAGTTAAA TATAGTGGTA AAGAATACAG AATAAGTGAA TATAGTAGTA AACAATGCAT GATAAGTAGT AAAATTG
|
Protein sequence | MLALPVTLLW LSTGFPLLTY ASPVTSDADN HTISATEANS SSSTSFLPHN TSRHHESSSQ SPITYISPQI AAPINATYVN PSHKWEKAYR KAKKYLDDWT LEEKVQLTTG VGWENGRCVG NIGAIPSRNF SGLCLQDSPL GVRLADFVSA FPAGINAAAT FDKDLIYARG YALGQEFKGK GVHVALGPMT NMGRVAAGGR NWEGFGGDPY LSGWATEMTV RGIQDAGVQA CVKHYVGNEQ ERNRTTESSN IDDRTLREIY THPFLRAVQA DVASVMCSYN LINGSWACEN PKTLNGVLKT DFGFQGYVLS DWGAQHSGVV SANNGLDMSM PGDIVLGSLT SYWGSNLTES VKNGSVSEER LDDMAERIMA AYFLVDQDKD YPEVNFDSFR LSGSNNSHVD VRGDHWKIIR QMGAASTVLL KNVDHALPLR KPRSMTLIGS DLGPSLRGPN GFSDRGGLSG TLAMGWGSGT AEFPYLVDPL SSISLQARED GTILNWWLDD WNLSNASYWA SVAEVAIVGI NSDSGEGYIT VDDNEGDRNN LTAWNNGDEL VKAVASANNN TIVIVHSVGP MIIEDWIDHP NITAVLWAGL PGQESGNSLV DVLYGAYNPS ARLPYTIAKK REDYSADIDY VTSDIPAITQ VNYTEGLFID YRHFLAKDIT PRYEFGFGMS YTSFEFGDVS LEEIKEEGAG DDIYNFQEVD DGTVKAGRFL LDYLHKAQWT VTIDITNTGG VNGCEVPQLY LAYPADSGEP PKVMRDFARI NLDPGASQTV TFNLSRYDVS IWDVESQKWT IPDGTFVVEV GKSSMDKDAK TASFCPGSC
|
| |