Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH00370 |
Symbol | |
ID | 3259150 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | - |
Start bp | 1082822 |
End bp | 1085696 |
Gene Length | 2875 bp |
Protein Length | 852 aa |
Translation table | |
GC content | 50% |
IMG OID | 638258449 |
Product | beta-glucosidase, putative |
Protein accession | XP_572229 |
Protein GI | 58270146 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCACGA ACAACACAAA CCTTGACAGG TCGTTCTTGA CCGCAAATAT CGATGATCTC CTTAAGCAGC TCACCACAGA AGAGAAGATC TCCCTCCTTG CAGGCAAGGA TTGGTGGACG TGAGTCTTGC TTGCTTGCTT GCTTTGATAG ACATAGGCTG ACGTACATTG TATAAAAGCA CTGTACCGAT TCCAAGGCTC AACATCCCTT CGTGAGGTGC TTATGCTACT TAGAAGTGCG GGTATACTGA TGTTGGAGTA GCATCAAGAT GACCGATGGC CCTGGTGGCG CTCGAGGAGA CTCATTTTAC CATATGAGTG AGTACCATGG ACTTGCTGGT TAAATGTTCA GGGCTGATCA GAATTAGCAC CGGCTTGTGC CCTTCCAAGC GCTACTTCTC TGGCCTCCAC TTTTTCTCCC GATTTAATCC ATTCAGCCGG TAATCTCCTA GCACTTGAAA CCCTTGCTCG CAATGCTGTC TGCCTTCTTG CTCCCACTAT CAACATCCAA CGTTCCCCGC TCGGTGGTCG AGCATTCGAA TCCTTCTCTG AAGACCCCAC TTTGTCAGGC TTGATTGCGG CCGCTTATGT TGCTGGTCTG CAGGAAGGTG GTGTTAGTGC GGCGATCAAG CATTTCGTGG GGAATGATCA AGAGCATGAA CGGATGGGCG AGGACTCTGT CATCGCGCCT AGGGCGTTGA GGGAGATTTA CCTTCGACCA TTCCAGATTG CACTCAAGAA GTCTAAACCA CAAGCATTCA TGACGGCTTA TAACAAACTC AATGGGACGC ATTGTTCGGA GAACGAATGG TTACTTGAGG AGCTGCTGAG AAAAGAATGG GGGTTCGATG GGCTGGTAAT GAGTGATTGG TATGGTACCT ATTCCATTTC CGAAAGTATC AACGCCGGGC TCAACCTCGA GATGCCCGGG GCCACCCGAT GGCGTCCCAA CGGACTAGTG ACTCACCTTA TCAAAGCGCA CAAGATCGAC CCCAGGCAGT TGGACAAGGT TGCAGGTGGT GTATTAAGAT GGGTACAGAA GCTTGTGAAG AAAAACGAGG AGTTGGTATA TTCACCGCCT GGCAAAGAGA AGACTAGAAC TGAGGATCAA GCAGAAGACG CCAAGTTGCT TCGTCGTTTG GCTGGGGAAA GCATTGTACT TCTCAAAAAT GAGTTGAATG TTCTCCCAAT CAGGGAACCC AAAAAGATCG CCGTCATCGG TCCCAACGCC AAGGCCAAGG TCCTCACCGG CGGTGGATCT GCGCAACTTC GCTCAGCCTG GTCGTCAACT CCCTGGCAAG GTCTCTCCGA TAATGCTCCC CAAGGTGTTG AACTTTCCTA TTCCCTTGGA TGTCACTCCG ACAAGTTCCT GCCTATACTT GACGACAGTT TTACCTGTCC TGATGGCTCA CCCGGCTTCC AACTTTCACA CTTCCCCATC ACCTCCTCAG GCGATAAAGC TGAGGAACCC GTACATGTCG AGACATGGGA CTCATCCGAC ATGTTCCTTG CTGACTTTAC CGCTCCTGGT CTGACTAAGG AGTACTTTAC CCAGCTTGAT GCCGTCTGGA CACCGGTGGA GGATGGAGAG TATGAATTTG GAGTGGTCGT TACTGGCAAG GGGTGGTTCT GGATCAATGG AGAACTTGTT ATCGATGCGT CAAGGGAAGA TGAAAGGTCA ACGAGTTTTT TTAACTTGGG AACGAAGGAA ATCAAAGGCC GAACCAGGGC TACAAAGAAC AAGGTGAGTT GTCCAACTGA CGTTACATTG TCACGGAAGA GAATTGATAT TGCTTGTCGT AGAGATATGA CATTCGCTTC CTCCACGACA CCCGGCCATA CTCAGTCAAC AACATTAACA CTCCCATCGC AAGTGCCGGT ATGCGTCTTG GCTACATCCA GGTTATCCCC GCTTCGACTC TCCTCTCTAA CGCCGTCTCT CTCGCTGCAT CCTCAGACGT CGCCTTGCTC GTTATCGGGC TTAACTCGGG CTGGGAATCA GAAGGATACG ACCGTCCTGA CCTTTCTTTA CCGATGGACA CAGACAAGCT CGTCAATGCC GTTGCGGAAG CGAACCCCAA CACGATTGTT GTCATTCAAG CGGGTTCTGC CGTTTCAATG CCGTGGCTCG ACAAAGTGAA AGGTGTGGTG TTTGCTTGGT ACCTGGGAAA CGAGACTGGT AATGCCATCG CCGACATTAT CTACGGGTAT ACCAACCCTT CTGGTCGCCT CCCGATGACA TTCCCCAAAC GAGAGCTTGA TATTCCCGCC AATCTGAACT ACAAATCGGC ACGCACCAGG GTTTATTACG ACGAAGGTAT CTGGGTAGGG TACAAGCATT TTAATGCAAG AGGTATCGAC CCTCTCTTTC CCTTCGGCCA TGGTCTTTCC TACACCACTT TTGCTTATTC CGGCCTTCAC ATCTCTCAAG TACCAGAATC ACCAAAGAAC GTCGGTGCTG ATGGATGGAG AGTGGAAGTC GGGGTGCAAG TGGAAAATAT TGGCATGGAA GAGGGAGCGC ACACAGTCAT GTTCTGGCTG AGTCCGCCAC CTGAGAGCCC GAACGGACTG AAGCACCCGA AGTGGACTCT GCAAGGGTTT CAAAAGGTTT ATGGGCTCAA ACCAGGTGCA AAAAGAGAGA TCAAGGTCAC GTTTGATAAG TGTAAGTTTT CTCTAGAAAC CTACAGCTAA GAAGACAGCT TATGAGTATT AGATGCGGTC TCACACTGGG ATGAGCTCTG GAACACTTGG AGGGCGGAGT TGGGAGAGTG GACTGTTCGG GTTGGCTTAG ACGCACAAAA CATCAGTGGC GAGAAGGCGA CATTCAAGAT TGAGGATGAT CTTGAGTGGA GAGGCTTGTA AAGACAGTTC AATAT
|
Protein sequence | MPTNNTNLDR SFLTANIDDL LKQLTTEEKI SLLAGKDWWT IKMTDGPGGA RGDSFYHMTP ACALPSATSL ASTFSPDLIH SAGNLLALET LARNAVCLLA PTINIQRSPL GGRAFESFSE DPTLSGLIAA AYVAGLQEGG VSAAIKHFVG NDQEHERMGE DSVIAPRALR EIYLRPFQIA LKKSKPQAFM TAYNKLNGTH CSENEWLLEE LLRKEWGFDG LVMSDWYGTY SISESINAGL NLEMPGATRW RPNGLVTHLI KAHKIDPRQL DKVAGGVLRW VQKLVKKNEE LVYSPPGKEK TRTEDQAEDA KLLRRLAGES IVLLKNELNV LPIREPKKIA VIGPNAKAKV LTGGGSAQLR SAWSSTPWQG LSDNAPQGVE LSYSLGCHSD KFLPILDDSF TCPDGSPGFQ LSHFPITSSG DKAEEPVHVE TWDSSDMFLA DFTAPGLTKE YFTQLDAVWT PVEDGEYEFG VVVTGKGWFW INGELVIDAS REDERSTSFF NLGTKEIKGR TRATKNKRYD IRFLHDTRPY SVNNINTPIA SAGMRLGYIQ VIPASTLLSN AVSLAASSDV ALLVIGLNSG WESEGYDRPD LSLPMDTDKL VNAVAEANPN TIVVIQAGSA VSMPWLDKVK GVVFAWYLGN ETGNAIADII YGYTNPSGRL PMTFPKRELD IPANLNYKSA RTRVYYDEGI WVGYKHFNAR GIDPLFPFGH GLSYTTFAYS GLHISQVPES PKNVGADGWR VEVGVQVENI GMEEGAHTVM FWLSPPPESP NGLKHPKWTL QGFQKVYGLK PGAKREIKVT FDKYAVSHWD ELWNTWRAEL GEWTVRVGLD AQNISGEKAT FKIEDDLEWR GL
|
| |