Gene CNB05150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB05150 
Symbol 
ID3255555 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1466406 
End bp1469820 
Gene Length3415 bp 
Protein Length956 aa 
Translation table 
GC content49% 
IMG OID638255160 
Productalpha glucosidase, putative 
Protein accessionXP_569264 
Protein GI58264216 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.133027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGCAATCACA GCCCAAATAG GACCATCATG ATCCGCTCGT CGCTATTGTG GCTGACCGTC 
ATAGCCACTC TGGCGCTTCT ACCCCTCACT CTCGCTGTCA AGTCAGAGGA CTTTAAGCAA
TGTTCACAGA CCTCGTTCTG TCGACGTCTC AGATCAATCG CGTCAAAGCA GGAAGCTGCA
CCGGAGGGTA CTTTCAACTC TCCTTATTCT CTGGGAACAG CTGTTCCCAT ACAAGATGGC
GGCTCTTGGA AGTGGCCTAT TCAGTCGTCG CTTTATCCCG AAATAAACTT TGAGTTACGC
GTTGACGTGC TCAAGGAAGG CAATGGGATT GCCCGCGTCA GGATTGATGA GGTAAGGTCT
TCTACTCCTT TTAAAAGGTA CGACGAAACC GCCAAATGGG CATTGCTCAA TGCCGAACCT
GACGTCGACC CTTCTGCCTC GCTCAAAATG GCCAACGGTA AGAGCGTCAT CACGTATGGC
CCTTCTTCGT CCATCTCCCT CGAAGTCATC CACTCCCCAT TGAAGATCAC CCAATTGCGT
AATGGCAAAC CCGAGATCGT GTTCAACGAC CGATCCCTCT TCCACATGGA ACACTTCCGT
GTTAAGGACG TGGAATCTGC TAAAGAAGTC TTGGGAGAGG GAGAGCAGGC TGTCCTTGGA
GGCGATGCTC TCGATCGCAG CTGGTTTGAA GAGTCCGATT CCGATGCTTT CCAAGAGAAG
TGGAAGAAAT GGGTCGACAC CAAGCCCAAA GGGCCTGAGG GTTTTGCTCT CGACATTTCC
TTCCCAGGTG TCCAACACGT ATACGGACTT CCCGAACACG CTTCACCACT TTCACTCCCT
GACACTGTCG GTCCTAACGC GTACTACTCT GATCCTTACC GTCTATTTAA CGTTGACATC
TTCGAGTACC TTGCCGACTC TCCCATGTCT TTGTACGGTG CTATCCCTTT ACTCCATGCT
CATTCCAAGA CCCATTCTGT TGGTGTACTG AACCTTGTCG GCTCTGAAAC CTGGGTTGAC
GTCCTCCACA CCTCAACAGA TGTCAAGACC CATTGGATTT CCGAGTCGGG TATTCTCGAC
CTCCTCCTTC TTCCCGGTCC GTCCCCCACC GATCTCTTCA AGCAATACGC CATTCTCACG
GGACCCACTC CGCTTCCTCC CCAGTGGTCA ACGGCGTATC ACCAGTGCAG ATGGAACTAT
AATGATCAAG ACGACGTACT TGAAGTGGAC GCAAAGTTCG ATGAGGCGGA TATGCCCCTT
GATGTCACTT GGCTTGATAT TGAATATGCT GAGGAGCATA GATATTTTGA CTGGGACAAG
AAGCATTTCC CTAATCCTAA TGCTATGCTC GATGCCGTGG CTTCCAAAGG CCGAAAGGTA
TGTTTTCGCT TCACTTCATA CTGGACAAAG CTGACTCGGA AAATCAAGAT GGTTGCTATT
ATTGACCCTC ATATCAAGAG GACCGACTCT TTCAGGATCT ACTCCGACTC CAAAGACCTT
GACATACTCG TCAAGAAATC GGATGGCAAC AACTTTGAAG GATGGTGCTG GACCGGTAGC
AGCGCATGGG TGGACTTTTT CAACCCCAAG AGTTGGGATT GGTGGACCAA GATGTTTGAC
TTCAAGGTCT GGAAGGTGTG TTCTTGACGT CTCCGTATTG TAACTAAGGG AATGGTGACT
GAACCTTGAA GCAGGACTCT ACCAATGCGC TTTTCATCTG GAATGACATG AACGAACCTT
CCGTTTTTGA CGGCCCTGAA ATAAGTATGC CCAGGGACAA CATTCATGCT GGGGGATGGG
AGCACCGAGA TGTGCACAAC ATCAACGGTA TGCTCTTTGT ATGTATGGAA TACCTTGTCT
TCTGTGTTTG TACTCATTCT GGTGTCGTTC ACTAGCATAA GCAAACTTCC CAAGCTCTGA
TCAAACGGGA AAAACCCGCT CAGAGACCTT TCGTCCTTTC CCGATCTTTC TTTGCCGGTT
CTCAACGTTA CGGCGCCATC TGGTGACTAT GCATCAGTAT GACGACAATT GGACGAACGC
TGACATTGAT ATCAGGACCG GTGATAATCT TGGTGACTGG GAACACCTAG CTGGTGAGAC
AGCCATGCTC CTCTCCAACA ACATTGCTGG CATGTCCTTC TGCGGCGGTA CGTGCCTTTA
TTCCGTAGTA CGTATCTACC CATACTAATG CTTCTTCATT TTGGCATAGC CGATGTTGGC
GGCTTCTTTG GTAACCCCTC ACACGAACTC CTTGTCCGCT GGTATCAAGC GGGCGCTTTC
ATGCCTTTCT TCCGAGCCCA TGCTCACCTC GACACCAAGC GCCGTGAACC CTATCTCTTC
GAAGAACCCA TCAGGAGCTA CCTCAAGGAC GCTCTGAGGT TGAGATACGC CCTGTTACCT
GTTTGGTACA ATGCTTTCAA GGAGGCTAGT GTCTGGGGCT TGCCCATTAT GAGGCCGCAA
TATGCAGTCT TCCCCGGAGA TGAGAAAGGT TTCAAGATTG ATGATCAGTA TTACATTGGT
GGGGAGGGGC TTTTGTTCAA GCCTGTTGTT CAGGAAGGTG CTGTGACGAC TGACGTTTAC
ATTTCTGATG ATCAGGTAGG TTCATACATG ACTACTGTTT ATAGCTAATA TTTTTAATCA
TTTTTGTAGC CCTATTACGA TTACTTTACC CACCGCCTCT ACCCTTCCTC CCCCCAAACC
ACTTTGACAC TCCACACACC ACTTTCTACT TTCCCTCTCC TCCTTCGCGG TGGTCACATC
ATCCCCATCC GTCCTCGTCC CCGTCGATCA TCCCCTCTCA TGTGGCAAGA TCCCTTCACT
CTCATCATCG CCGTTGGTAA AGATGGCAAA GCGAAGGGCC AAGTGTATCT TGACGATGGG
GTTGGTTATG CGTATGAGAG TGGGGAGTTT GTCTGGAGGG TGTTTGAGCT TGATGGCAAA
GTGCTTCGCT CAAAGTCACA TGAAGAGGGC ATAAAGAGTA AGGAGACAGG CGTAGCTGTG
TATAAGCAAG AAAATGCCTT TGCCGAGGCT GTCTCGCACG TCAAGATCAA CTCAATTGTC
TTCCTTGGTC TGTCCGACAA ACCAACTAGC ATCAAATCGA ATGGTATAAA GCTCGAGTAT
GAATGGGAGG AAGGGGAGGA TGCGAAGGGC AAGAAGGAGG GTAGGGCCAG TGAACTTAGG
GTGAAGAACC CCGGTGTGGG AGTTGTGGAA GACTGGGAAA TTGTTTTTGC GTAGAGCAGT
GTTACGAAGA TACTCAGTAG ATGTAGGATA ACGAAGGAGC GGAAGGGAAA TTGTGTATAG
AAGTCGAAGA TATTTACAAA TTATTATCAA ATCAGCCTTT CACAGAAAGA ACAAATGGAA
GACTTTACTA TCATTGTTCT TGGGGTTAGC TAGCTATGTC ATGGCTTCCT GCAGT
 
Protein sequence
MIRSSLLWLT VIATLALLPL TLAVKSEDFK QCSQTSFCRR LRSIASKQEA APEGTFNSPY 
SLGTAVPIQD GGSWKWPIQS SLYPEINFEL RVDVLKEGNG IARVRIDEVR SSTPFKRYDE
TAKWALLNAE PDVDPSASLK MANGKSVITY GPSSSISLEV IHSPLKITQL RNGKPEIVFN
DRSLFHMEHF RVKDVESAKE VLGEGEQAVL GGDALDRSWF EESDSDAFQE KWKKWVDTKP
KGPEGFALDI SFPGVQHVYG LPEHASPLSL PDTVGPNAYY SDPYRLFNVD IFEYLADSPM
SLYGAIPLLH AHSKTHSVGV LNLVGSETWV DVLHTSTDVK THWISESGIL DLLLLPGPSP
TDLFKQYAIL TGPTPLPPQW STAYHQCRWN YNDQDDVLEV DAKFDEADMP LDVTWLDIEY
AEEHRYFDWD KKHFPNPNAM LDAVASKGRK MVAIIDPHIK RTDSFRIYSD SKDLDILVKK
SDGNNFEGWC WTGSSAWVDF FNPKSWDWWT KMFDFKVWKD STNALFIWND MNEPSVFDGP
EISMPRDNIH AGGWEHRDVH NINGMLFHKQ TSQALIKREK PAQRPFVLSR SFFAGSQRYG
AIWTGDNLGD WEHLAGETAM LLSNNIAGMS FCGADVGGFF GNPSHELLVR WYQAGAFMPF
FRAHAHLDTK RREPYLFEEP IRSYLKDALR LRYALLPVWY NAFKEASVWG LPIMRPQYAV
FPGDEKGFKI DDQYYIGGEG LLFKPVVQEG AVTTDVYISD DQPYYDYFTH RLYPSSPQTT
LTLHTPLSTF PLLLRGGHII PIRPRPRRSS PLMWQDPFTL IIAVGKDGKA KGQVYLDDGV
GYAYESGEFV WRVFELDGKV LRSKSHEEGI KSKETGVAVY KQENAFAEAV SHVKINSIVF
LGLSDKPTSI KSNGIKLEYE WEEGEDAKGK KEGRASELRV KNPGVGVVED WEIVFA