Gene CNH00370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH00370 
Symbol 
ID3259150 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp1082822 
End bp1085696 
Gene Length2875 bp 
Protein Length852 aa 
Translation table 
GC content50% 
IMG OID638258449 
Productbeta-glucosidase, putative 
Protein accessionXP_572229 
Protein GI58270146 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCACGA ACAACACAAA CCTTGACAGG TCGTTCTTGA CCGCAAATAT CGATGATCTC 
CTTAAGCAGC TCACCACAGA AGAGAAGATC TCCCTCCTTG CAGGCAAGGA TTGGTGGACG
TGAGTCTTGC TTGCTTGCTT GCTTTGATAG ACATAGGCTG ACGTACATTG TATAAAAGCA
CTGTACCGAT TCCAAGGCTC AACATCCCTT CGTGAGGTGC TTATGCTACT TAGAAGTGCG
GGTATACTGA TGTTGGAGTA GCATCAAGAT GACCGATGGC CCTGGTGGCG CTCGAGGAGA
CTCATTTTAC CATATGAGTG AGTACCATGG ACTTGCTGGT TAAATGTTCA GGGCTGATCA
GAATTAGCAC CGGCTTGTGC CCTTCCAAGC GCTACTTCTC TGGCCTCCAC TTTTTCTCCC
GATTTAATCC ATTCAGCCGG TAATCTCCTA GCACTTGAAA CCCTTGCTCG CAATGCTGTC
TGCCTTCTTG CTCCCACTAT CAACATCCAA CGTTCCCCGC TCGGTGGTCG AGCATTCGAA
TCCTTCTCTG AAGACCCCAC TTTGTCAGGC TTGATTGCGG CCGCTTATGT TGCTGGTCTG
CAGGAAGGTG GTGTTAGTGC GGCGATCAAG CATTTCGTGG GGAATGATCA AGAGCATGAA
CGGATGGGCG AGGACTCTGT CATCGCGCCT AGGGCGTTGA GGGAGATTTA CCTTCGACCA
TTCCAGATTG CACTCAAGAA GTCTAAACCA CAAGCATTCA TGACGGCTTA TAACAAACTC
AATGGGACGC ATTGTTCGGA GAACGAATGG TTACTTGAGG AGCTGCTGAG AAAAGAATGG
GGGTTCGATG GGCTGGTAAT GAGTGATTGG TATGGTACCT ATTCCATTTC CGAAAGTATC
AACGCCGGGC TCAACCTCGA GATGCCCGGG GCCACCCGAT GGCGTCCCAA CGGACTAGTG
ACTCACCTTA TCAAAGCGCA CAAGATCGAC CCCAGGCAGT TGGACAAGGT TGCAGGTGGT
GTATTAAGAT GGGTACAGAA GCTTGTGAAG AAAAACGAGG AGTTGGTATA TTCACCGCCT
GGCAAAGAGA AGACTAGAAC TGAGGATCAA GCAGAAGACG CCAAGTTGCT TCGTCGTTTG
GCTGGGGAAA GCATTGTACT TCTCAAAAAT GAGTTGAATG TTCTCCCAAT CAGGGAACCC
AAAAAGATCG CCGTCATCGG TCCCAACGCC AAGGCCAAGG TCCTCACCGG CGGTGGATCT
GCGCAACTTC GCTCAGCCTG GTCGTCAACT CCCTGGCAAG GTCTCTCCGA TAATGCTCCC
CAAGGTGTTG AACTTTCCTA TTCCCTTGGA TGTCACTCCG ACAAGTTCCT GCCTATACTT
GACGACAGTT TTACCTGTCC TGATGGCTCA CCCGGCTTCC AACTTTCACA CTTCCCCATC
ACCTCCTCAG GCGATAAAGC TGAGGAACCC GTACATGTCG AGACATGGGA CTCATCCGAC
ATGTTCCTTG CTGACTTTAC CGCTCCTGGT CTGACTAAGG AGTACTTTAC CCAGCTTGAT
GCCGTCTGGA CACCGGTGGA GGATGGAGAG TATGAATTTG GAGTGGTCGT TACTGGCAAG
GGGTGGTTCT GGATCAATGG AGAACTTGTT ATCGATGCGT CAAGGGAAGA TGAAAGGTCA
ACGAGTTTTT TTAACTTGGG AACGAAGGAA ATCAAAGGCC GAACCAGGGC TACAAAGAAC
AAGGTGAGTT GTCCAACTGA CGTTACATTG TCACGGAAGA GAATTGATAT TGCTTGTCGT
AGAGATATGA CATTCGCTTC CTCCACGACA CCCGGCCATA CTCAGTCAAC AACATTAACA
CTCCCATCGC AAGTGCCGGT ATGCGTCTTG GCTACATCCA GGTTATCCCC GCTTCGACTC
TCCTCTCTAA CGCCGTCTCT CTCGCTGCAT CCTCAGACGT CGCCTTGCTC GTTATCGGGC
TTAACTCGGG CTGGGAATCA GAAGGATACG ACCGTCCTGA CCTTTCTTTA CCGATGGACA
CAGACAAGCT CGTCAATGCC GTTGCGGAAG CGAACCCCAA CACGATTGTT GTCATTCAAG
CGGGTTCTGC CGTTTCAATG CCGTGGCTCG ACAAAGTGAA AGGTGTGGTG TTTGCTTGGT
ACCTGGGAAA CGAGACTGGT AATGCCATCG CCGACATTAT CTACGGGTAT ACCAACCCTT
CTGGTCGCCT CCCGATGACA TTCCCCAAAC GAGAGCTTGA TATTCCCGCC AATCTGAACT
ACAAATCGGC ACGCACCAGG GTTTATTACG ACGAAGGTAT CTGGGTAGGG TACAAGCATT
TTAATGCAAG AGGTATCGAC CCTCTCTTTC CCTTCGGCCA TGGTCTTTCC TACACCACTT
TTGCTTATTC CGGCCTTCAC ATCTCTCAAG TACCAGAATC ACCAAAGAAC GTCGGTGCTG
ATGGATGGAG AGTGGAAGTC GGGGTGCAAG TGGAAAATAT TGGCATGGAA GAGGGAGCGC
ACACAGTCAT GTTCTGGCTG AGTCCGCCAC CTGAGAGCCC GAACGGACTG AAGCACCCGA
AGTGGACTCT GCAAGGGTTT CAAAAGGTTT ATGGGCTCAA ACCAGGTGCA AAAAGAGAGA
TCAAGGTCAC GTTTGATAAG TGTAAGTTTT CTCTAGAAAC CTACAGCTAA GAAGACAGCT
TATGAGTATT AGATGCGGTC TCACACTGGG ATGAGCTCTG GAACACTTGG AGGGCGGAGT
TGGGAGAGTG GACTGTTCGG GTTGGCTTAG ACGCACAAAA CATCAGTGGC GAGAAGGCGA
CATTCAAGAT TGAGGATGAT CTTGAGTGGA GAGGCTTGTA AAGACAGTTC AATAT
 
Protein sequence
MPTNNTNLDR SFLTANIDDL LKQLTTEEKI SLLAGKDWWT IKMTDGPGGA RGDSFYHMTP 
ACALPSATSL ASTFSPDLIH SAGNLLALET LARNAVCLLA PTINIQRSPL GGRAFESFSE
DPTLSGLIAA AYVAGLQEGG VSAAIKHFVG NDQEHERMGE DSVIAPRALR EIYLRPFQIA
LKKSKPQAFM TAYNKLNGTH CSENEWLLEE LLRKEWGFDG LVMSDWYGTY SISESINAGL
NLEMPGATRW RPNGLVTHLI KAHKIDPRQL DKVAGGVLRW VQKLVKKNEE LVYSPPGKEK
TRTEDQAEDA KLLRRLAGES IVLLKNELNV LPIREPKKIA VIGPNAKAKV LTGGGSAQLR
SAWSSTPWQG LSDNAPQGVE LSYSLGCHSD KFLPILDDSF TCPDGSPGFQ LSHFPITSSG
DKAEEPVHVE TWDSSDMFLA DFTAPGLTKE YFTQLDAVWT PVEDGEYEFG VVVTGKGWFW
INGELVIDAS REDERSTSFF NLGTKEIKGR TRATKNKRYD IRFLHDTRPY SVNNINTPIA
SAGMRLGYIQ VIPASTLLSN AVSLAASSDV ALLVIGLNSG WESEGYDRPD LSLPMDTDKL
VNAVAEANPN TIVVIQAGSA VSMPWLDKVK GVVFAWYLGN ETGNAIADII YGYTNPSGRL
PMTFPKRELD IPANLNYKSA RTRVYYDEGI WVGYKHFNAR GIDPLFPFGH GLSYTTFAYS
GLHISQVPES PKNVGADGWR VEVGVQVENI GMEEGAHTVM FWLSPPPESP NGLKHPKWTL
QGFQKVYGLK PGAKREIKVT FDKYAVSHWD ELWNTWRAEL GEWTVRVGLD AQNISGEKAT
FKIEDDLEWR GL