Gene CNC01640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC01640 
Symbol 
ID3256516 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp451769 
End bp454703 
Gene Length2935 bp 
Protein Length892 aa 
Translation table 
GC content49% 
IMG OID638255383 
Productglicosidase, putative 
Protein accessionXP_569969 
Protein GI58265626 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTATCG TTGAGCGCGA ACTTTTTGGG TTCCAGCTTG CTACCAGTCC CTCCATAGGG 
GATTCAATCA CCTCGTTCTC TCTCGTCTCC TCCCCGGAGA GCATCTATAA AGACTTCACC
TACAGCATCT CTTTCCCTTT ACCCAATGTG TACCGCATTC TCCTCACTGG ACCCAATAGG
ACCAGACCGC TACAGGACAA TGTTGTACTA GATACCTCCG AAAGCAAGTT CAAGGTGATC
GCCTTGGATA ATGATGGGTG CGAAGCCACT TTCGATTTCC CTAGCCCCAT CTCCGAGAGC
TTAGACGGAG CCGAAAAGCG ACGCCGCCAA CTCCATCTTT CATGGAAAGA GCACATCACT
CTCACAACAT ACGAGACCCT TCTCTCCACG GGACAACAGC TTCGTCTTTT GGGAGACCTT
CCGAACAGGT CTTATGCCCT TACCGAACAT GGTGTAATGA GACACTGGTG GGTTGAAATT
GACAATCTCC ATTTGGGCTT GGGGGAGAAG GCAGCGCCTC TAGATCTATC AAACAGGTCT
TTCATGAATC ACGGCTCCGA CTCTGCTGCC TATGACGCCT ATCAAACCGA TCCATTGTAC
AAGCATACAC CCTACTTAAT CTCCACACCG AAGCCCACGT CCGAAGGCGA AGAACTCCCT
TCCACTTATG CCATTTATCA TCCTACAAAC GCTGGAGGGG AGTGGAATGT TCGCAGGCTT
CATGATGACC CTTGGGGGTA TTTCAAGTCT TACACTCAGG ATTATGGAGG ACTTGAAGAA
TGGGTCTTAG TGGGAAAGGG AGTCGAGCAG GTTGTAAGAA CATTTGCTGA GATTGTTGGA
AGGCCAAGAC TGGTGGGCAG GGATTGGTTA GGATATCTTG GTGAGTCTCA TCGAACCAAG
TATGTCATCT CCGATGCTGA CTCAATGAAA GCCTCCGGTA TGGGTCTTGG AGAAAGTGAC
GATCCTCCCG CTCAGGAATT GCTTTCTACC TGGCCTGAAC TATGCCGAAA ATACGACATA
CCTTGCTCAG CTATGCATGT AAGTCTCGTC AAAGGCTTTA CTATGTAAAT TAACTGACAT
GGGCCGGCAT TAGCTATCGT CGGGCTATAC GGCAGACAAA GATGGAAATC GCTGTGTTTT
CACCATGAAT ACAAAGCGAT ATCCAGACTT CAAGGGGATG GTCGCTCATT TTCATAAAGC
AGGGATCAAA GTGGTTCCCA ACATCAAACC ATGTGATTTC GTATTCGCTA TCTTAATGAT
GCTTTTTGAC TGACCCCGAG AAATAGATGT TCTTCAGACG CATCCTCATT ACAAGGATCT
CCATTCATCC AACGCGCTGT TCTACGATCC TTACTCTAAA TCTCCTGTGG TCACACGTAT
CTGGTCGTCA GGTGTGGGTG ATAATGAAAA GGGCAGCTGG GTGGATATGA CAAGTGAGGA
GGGACGGCAA TGGTGGGCTG ACGGAGTGAA GAGTCTGATT GATCTAGGGG TCGATGGGAT
GTGGGAGTAA GCCATTTGTC TTGAAGCCAA AGAGTTCAGC TAATGAAAAT CAGTGATAAC
AACGAGTATT ATCTCCACGA TGATTCCTTT GTTTGTGCGA CTTCTTTGCC CCACGAATTT
ACGTCCTCTC CTTCTGGACC GGTGACCATC GGCGAAATAG GTCGCATGAT CAATACTGAG
CTTGTCAACT ATGTTTCGCA CACGACCCTA GAAAAAGCCC ACCCAACCCG ACGAACCTAT
GTCCTGACCC GTTCCGGGAA TGTCGGCACA TTCAAATATG CCAACTCGAC CTGGTCAGGT
GACAATTACA CTTCCTGGCA CAACCTTCGT GGATCTCAAG CTATTCAGTT CAATGCCGGT
ATGTCGCTTA TGCAAAGCTA TGGATCCGAC ATTGGTGGGT TTGGAGGACC ATTACCGGGA
GAGGAGATGT TTGTGAGGTG GGTGCAGTTG GGGGTGACAC ATTCGAGGTT TTGTATCCAT
AGCTTCAAGC CCGATCAAAG TGATATTAGC GGAGTGGGAG CGACAAACAC CCCTTGGATG
GTGAGTACAT TGTCAAATTC GTCTAGAGCC AGTGCTGATG ACCACGCAGT ATCCGGCTGT
GTTGCCTATC ATTAGGGAGG CGATTAAATG GCGATACGAG CATCTTCCTT TCTTGTAAGC
CACCTTATCC GGTCCGTTCC AACTTCAGTA ACTTATGAAA TTTAGTAACT CACTCATGTG
GGAGTCTCAT CTTCGCGCTA CTCCCACTAC GACCTGGCTC GGTTTTGGCC CATTCGCGTC
TGATCCTGCA CTCTACGAAG CCTCCATCTT AGAAGGCTTT GACGCTTGGC TCGGCACTGG
TCGAATCCTA GTCTGCCCTG CGCTATTTGA AGGTCAACTC ACTCGAGAAG TGTACTTCCC
CAAAGCATGC ATGGACGACA AGAGCTTGTA CTTTGATCTC CATGCACCAT ACAGAACGTA
TAAAGCGGGA GACAGAGTAA TGATCGCAAC GCCAATGGAG CATATGGGCC TGTTTGCAAG
GGAGGGTGCG GTCGTGCCAG TAGGCAAGAG ATATCACACT GTCACTCAAA AGCAAGGTCC
AGCAAGGCAA ACTCCAGATG GGGTTGATGT TGTGCTCGAA GACGAAGGTG GTGTAGTCGG
ACTTGATGAC TGGCGAGGCG TGAAGATCTT CCCTGGCCAC GAAGGAAATG CCTACAGAGG
CCAGTGGACA GAGGACGATG GCATTTCAGC GACACCGGAC AAAACAGTGG TCGAAGTGGA
GTATGTGGGT GGAAAGGACG AGGTGACAAT TGGATTCAAG TTGACGGAGC ATAAGTTCAA
GACGCTGTGG GGTCGCAAAT TAGCGGTCCT GTTGCCAGTT GGGGACGAAA GATCAGTGAA
GGGTGGGAAG GAAGATGTAT GGGAAGGACG AAAAGTTTGG AATATTGAAT TTTAG
 
Protein sequence
MPIVERELFG FQLATSPSIG DSITSFSLVS SPESIYKDFT YSISFPLPNV YRILLTGPNR 
TRPLQDNVVL DTSESKFKVI ALDNDGCEAT FDFPSPISES LDGAEKRRRQ LHLSWKEHIT
LTTYETLLST GQQLRLLGDL PNRSYALTEH GVMRHWWVEI DNLHLGLGEK AAPLDLSNRS
FMNHGSDSAA YDAYQTDPLY KHTPYLISTP KPTSEGEELP STYAIYHPTN AGGEWNVRRL
HDDPWGYFKS YTQDYGGLEE WVLVGKGVEQ VVRTFAEIVG RPRLVGRDWL GYLGESHRTK
YVISDADSMK ASGMGLGESD DPPAQELLST WPELCRKYDI PCSAMHLSSG YTADKDGNRC
VFTMNTKRYP DFKGMVAHFH KAGIKVVPNI KPYVLQTHPH YKDLHSSNAL FYDPYSKSPV
VTRIWSSGVG DNEKGSWVDM TSEEGRQWWA DGVKSLIDLG VDGMWDDNNE YYLHDDSFVC
ATSLPHEFTS SPSGPVTIGE IGRMINTELV NYVSHTTLEK AHPTRRTYVL TRSGNVGTFK
YANSTWSGDN YTSWHNLRGS QAIQFNAGMS LMQSYGSDIG GFGGPLPGEE MFVRWVQLGV
THSRFCIHSF KPDQSDISGV GATNTPWMYP AVLPIIREAI KWRYEHLPFF NSLMWESHLR
ATPTTTWLGF GPFASDPALY EASILEGFDA WLGTGRILVC PALFEGQLTR EVYFPKACMD
DKSLYFDLHA PYRTYKAGDR VMIATPMEHM GLFAREGAVV PVGKRYHTVT QKQGPARQTP
DGVDVVLEDE GGVVGLDDWR GVKIFPGHEG NAYRGQWTED DGISATPDKT VVEVEYVGGK
DEVTIGFKLT EHKFKTLWGR KLAVLLPVGD ERSVKGGKED VWEGRKVWNI EF