Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB05150 |
Symbol | |
ID | 3255555 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 1466406 |
End bp | 1469820 |
Gene Length | 3415 bp |
Protein Length | 956 aa |
Translation table | |
GC content | 49% |
IMG OID | 638255160 |
Product | alpha glucosidase, putative |
Protein accession | XP_569264 |
Protein GI | 58264216 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.133027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGCAATCACA GCCCAAATAG GACCATCATG ATCCGCTCGT CGCTATTGTG GCTGACCGTC ATAGCCACTC TGGCGCTTCT ACCCCTCACT CTCGCTGTCA AGTCAGAGGA CTTTAAGCAA TGTTCACAGA CCTCGTTCTG TCGACGTCTC AGATCAATCG CGTCAAAGCA GGAAGCTGCA CCGGAGGGTA CTTTCAACTC TCCTTATTCT CTGGGAACAG CTGTTCCCAT ACAAGATGGC GGCTCTTGGA AGTGGCCTAT TCAGTCGTCG CTTTATCCCG AAATAAACTT TGAGTTACGC GTTGACGTGC TCAAGGAAGG CAATGGGATT GCCCGCGTCA GGATTGATGA GGTAAGGTCT TCTACTCCTT TTAAAAGGTA CGACGAAACC GCCAAATGGG CATTGCTCAA TGCCGAACCT GACGTCGACC CTTCTGCCTC GCTCAAAATG GCCAACGGTA AGAGCGTCAT CACGTATGGC CCTTCTTCGT CCATCTCCCT CGAAGTCATC CACTCCCCAT TGAAGATCAC CCAATTGCGT AATGGCAAAC CCGAGATCGT GTTCAACGAC CGATCCCTCT TCCACATGGA ACACTTCCGT GTTAAGGACG TGGAATCTGC TAAAGAAGTC TTGGGAGAGG GAGAGCAGGC TGTCCTTGGA GGCGATGCTC TCGATCGCAG CTGGTTTGAA GAGTCCGATT CCGATGCTTT CCAAGAGAAG TGGAAGAAAT GGGTCGACAC CAAGCCCAAA GGGCCTGAGG GTTTTGCTCT CGACATTTCC TTCCCAGGTG TCCAACACGT ATACGGACTT CCCGAACACG CTTCACCACT TTCACTCCCT GACACTGTCG GTCCTAACGC GTACTACTCT GATCCTTACC GTCTATTTAA CGTTGACATC TTCGAGTACC TTGCCGACTC TCCCATGTCT TTGTACGGTG CTATCCCTTT ACTCCATGCT CATTCCAAGA CCCATTCTGT TGGTGTACTG AACCTTGTCG GCTCTGAAAC CTGGGTTGAC GTCCTCCACA CCTCAACAGA TGTCAAGACC CATTGGATTT CCGAGTCGGG TATTCTCGAC CTCCTCCTTC TTCCCGGTCC GTCCCCCACC GATCTCTTCA AGCAATACGC CATTCTCACG GGACCCACTC CGCTTCCTCC CCAGTGGTCA ACGGCGTATC ACCAGTGCAG ATGGAACTAT AATGATCAAG ACGACGTACT TGAAGTGGAC GCAAAGTTCG ATGAGGCGGA TATGCCCCTT GATGTCACTT GGCTTGATAT TGAATATGCT GAGGAGCATA GATATTTTGA CTGGGACAAG AAGCATTTCC CTAATCCTAA TGCTATGCTC GATGCCGTGG CTTCCAAAGG CCGAAAGGTA TGTTTTCGCT TCACTTCATA CTGGACAAAG CTGACTCGGA AAATCAAGAT GGTTGCTATT ATTGACCCTC ATATCAAGAG GACCGACTCT TTCAGGATCT ACTCCGACTC CAAAGACCTT GACATACTCG TCAAGAAATC GGATGGCAAC AACTTTGAAG GATGGTGCTG GACCGGTAGC AGCGCATGGG TGGACTTTTT CAACCCCAAG AGTTGGGATT GGTGGACCAA GATGTTTGAC TTCAAGGTCT GGAAGGTGTG TTCTTGACGT CTCCGTATTG TAACTAAGGG AATGGTGACT GAACCTTGAA GCAGGACTCT ACCAATGCGC TTTTCATCTG GAATGACATG AACGAACCTT CCGTTTTTGA CGGCCCTGAA ATAAGTATGC CCAGGGACAA CATTCATGCT GGGGGATGGG AGCACCGAGA TGTGCACAAC ATCAACGGTA TGCTCTTTGT ATGTATGGAA TACCTTGTCT TCTGTGTTTG TACTCATTCT GGTGTCGTTC ACTAGCATAA GCAAACTTCC CAAGCTCTGA TCAAACGGGA AAAACCCGCT CAGAGACCTT TCGTCCTTTC CCGATCTTTC TTTGCCGGTT CTCAACGTTA CGGCGCCATC TGGTGACTAT GCATCAGTAT GACGACAATT GGACGAACGC TGACATTGAT ATCAGGACCG GTGATAATCT TGGTGACTGG GAACACCTAG CTGGTGAGAC AGCCATGCTC CTCTCCAACA ACATTGCTGG CATGTCCTTC TGCGGCGGTA CGTGCCTTTA TTCCGTAGTA CGTATCTACC CATACTAATG CTTCTTCATT TTGGCATAGC CGATGTTGGC GGCTTCTTTG GTAACCCCTC ACACGAACTC CTTGTCCGCT GGTATCAAGC GGGCGCTTTC ATGCCTTTCT TCCGAGCCCA TGCTCACCTC GACACCAAGC GCCGTGAACC CTATCTCTTC GAAGAACCCA TCAGGAGCTA CCTCAAGGAC GCTCTGAGGT TGAGATACGC CCTGTTACCT GTTTGGTACA ATGCTTTCAA GGAGGCTAGT GTCTGGGGCT TGCCCATTAT GAGGCCGCAA TATGCAGTCT TCCCCGGAGA TGAGAAAGGT TTCAAGATTG ATGATCAGTA TTACATTGGT GGGGAGGGGC TTTTGTTCAA GCCTGTTGTT CAGGAAGGTG CTGTGACGAC TGACGTTTAC ATTTCTGATG ATCAGGTAGG TTCATACATG ACTACTGTTT ATAGCTAATA TTTTTAATCA TTTTTGTAGC CCTATTACGA TTACTTTACC CACCGCCTCT ACCCTTCCTC CCCCCAAACC ACTTTGACAC TCCACACACC ACTTTCTACT TTCCCTCTCC TCCTTCGCGG TGGTCACATC ATCCCCATCC GTCCTCGTCC CCGTCGATCA TCCCCTCTCA TGTGGCAAGA TCCCTTCACT CTCATCATCG CCGTTGGTAA AGATGGCAAA GCGAAGGGCC AAGTGTATCT TGACGATGGG GTTGGTTATG CGTATGAGAG TGGGGAGTTT GTCTGGAGGG TGTTTGAGCT TGATGGCAAA GTGCTTCGCT CAAAGTCACA TGAAGAGGGC ATAAAGAGTA AGGAGACAGG CGTAGCTGTG TATAAGCAAG AAAATGCCTT TGCCGAGGCT GTCTCGCACG TCAAGATCAA CTCAATTGTC TTCCTTGGTC TGTCCGACAA ACCAACTAGC ATCAAATCGA ATGGTATAAA GCTCGAGTAT GAATGGGAGG AAGGGGAGGA TGCGAAGGGC AAGAAGGAGG GTAGGGCCAG TGAACTTAGG GTGAAGAACC CCGGTGTGGG AGTTGTGGAA GACTGGGAAA TTGTTTTTGC GTAGAGCAGT GTTACGAAGA TACTCAGTAG ATGTAGGATA ACGAAGGAGC GGAAGGGAAA TTGTGTATAG AAGTCGAAGA TATTTACAAA TTATTATCAA ATCAGCCTTT CACAGAAAGA ACAAATGGAA GACTTTACTA TCATTGTTCT TGGGGTTAGC TAGCTATGTC ATGGCTTCCT GCAGT
|
Protein sequence | MIRSSLLWLT VIATLALLPL TLAVKSEDFK QCSQTSFCRR LRSIASKQEA APEGTFNSPY SLGTAVPIQD GGSWKWPIQS SLYPEINFEL RVDVLKEGNG IARVRIDEVR SSTPFKRYDE TAKWALLNAE PDVDPSASLK MANGKSVITY GPSSSISLEV IHSPLKITQL RNGKPEIVFN DRSLFHMEHF RVKDVESAKE VLGEGEQAVL GGDALDRSWF EESDSDAFQE KWKKWVDTKP KGPEGFALDI SFPGVQHVYG LPEHASPLSL PDTVGPNAYY SDPYRLFNVD IFEYLADSPM SLYGAIPLLH AHSKTHSVGV LNLVGSETWV DVLHTSTDVK THWISESGIL DLLLLPGPSP TDLFKQYAIL TGPTPLPPQW STAYHQCRWN YNDQDDVLEV DAKFDEADMP LDVTWLDIEY AEEHRYFDWD KKHFPNPNAM LDAVASKGRK MVAIIDPHIK RTDSFRIYSD SKDLDILVKK SDGNNFEGWC WTGSSAWVDF FNPKSWDWWT KMFDFKVWKD STNALFIWND MNEPSVFDGP EISMPRDNIH AGGWEHRDVH NINGMLFHKQ TSQALIKREK PAQRPFVLSR SFFAGSQRYG AIWTGDNLGD WEHLAGETAM LLSNNIAGMS FCGADVGGFF GNPSHELLVR WYQAGAFMPF FRAHAHLDTK RREPYLFEEP IRSYLKDALR LRYALLPVWY NAFKEASVWG LPIMRPQYAV FPGDEKGFKI DDQYYIGGEG LLFKPVVQEG AVTTDVYISD DQPYYDYFTH RLYPSSPQTT LTLHTPLSTF PLLLRGGHII PIRPRPRRSS PLMWQDPFTL IIAVGKDGKA KGQVYLDDGV GYAYESGEFV WRVFELDGKV LRSKSHEEGI KSKETGVAVY KQENAFAEAV SHVKINSIVF LGLSDKPTSI KSNGIKLEYE WEEGEDAKGK KEGRASELRV KNPGVGVVED WEIVFA
|
| |