Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC04500 |
Symbol | |
ID | 3256569 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 1361743 |
End bp | 1368054 |
Gene Length | 6312 bp |
Protein Length | 1581 aa |
Translation table | |
GC content | 48% |
IMG OID | 638255671 |
Product | UDP-glucose:sterol glucosyltransferase, putative |
Protein accession | XP_570011 |
Protein GI | 58265710 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCAC CCATATCCCC CACCCCCCCT CCGCTTCAGC CTCCGTTCCC ACCCACTGCC ATCGCTCGTG GTCCCGATCG TCCCGATCCT CCTCCCCAAC ACCAGCAGGC CGCCGAGTCA CTCGTCAATG CCGCCGCCCA GCATGTCGCC CCGACTTGTC CTCCGACATC CGATGAGTTA CCGCAGATGG AAGACCAAGC CACAAATTCT TCTAATGACT CTTTAATACC AAGTCGGCAA GCCCCCGACC AAGAAGAGAC AGAAAATGCG ATCACAGCGG GGACTCCGCC CGATGAGATG GAACCAACAA AGGACGCCCA AACTGTACGA TTCTCATCCT CTTCCCCGGC ATCCTACTCT ACACACGAAT ATCCCACAGA AGGAATCAAT GAACCGCGAA CGTCTTCTCG GGCGCCGAAT ACAGCTTCTT CCCAGATGGC TGAGTCCAGC TGTGATTTTA GATCTTCCCG AGATATTGGT TCCGTAAGTT CCTTTCTTGT GCAGTCACTT TTTTGTGTGC TCTGTACGGC TGTCAGGTGT GAAGCATTCT CCTGGACCTT TACACTTATG ATGCTCACGT GTCCGCTTCC CCCTACAAGT GGTAATGGGC GCAGTGAACC GTTTCAGATT TGCTGTTAAT TGAAGATTTC CGAGATCTTT GTTGGCCTAC ATGCAAAAAG CGTGCAATAT CAATGCATTG CTTGAGGGCC TGCTTCTTTT GCCAAAAAAC AGTGGCGACC GGTGGACGTT GGTCATATCG ACCGAAGTCG ACGTCACCTA TTACGGTACC CACGGTCTAT GCCCGAAGGC CGTCCTTTGT GTTTGTTGTC TGTCATCTTA TCCTTTTGTT ATTTTTCAGA GGTGGACGAG TGCTGACCGA AGCCCGCCCT TCTTTATTCT TGCCACTCTG TGTATACCCG TCATCCACAA TTAATTGGCT ACTCCACCGA CCTCGGTCCA TGTACAACAG ATCCGCATGG GCGCTTCAGC TCTTGTATCT GCCCTCAATG CTCTTCCTTG GGAGGAAGAT GACGACTCTG ATGATGGGGA GGACGATGAT GAATTTATAG AGCCAGCCCG AGGCTCTTCT TCAACGATAT ACGAGAGAAA ACAACGGCCA CAAAGTAAGT CAATCTGCTA CTTTTCATTG GATCCATGCT CACGCTATTC GCTATTTGTC TAGTTTCTCG CCTTGGGTCG TCTCTCAATA CCATCAGACC GATGCCTTCG TCTAGCACGA CTCATGCTAC AGCACCTTCT ACTTCCCATG GCTTTCATCC CACACACACT CTTCATTTCC CTTTCCGCCA AAACGCTATC GCACGCAGAG CTTGCCAGCC TGGCACAACA GAGCTAGATT ACCAATACGC TACTCCCGAG ACATCGTCTA GAAGGACGAG TGCGGCTGGG AGCGAGTCAA GCTCGGAGGG TGAGGTTCCA CTTCCAAAAG GCTTTGTCTC TCATCCCAAC CTTATTGTAC CGAGTGGGGA AGGCGAAGCT GCAGCACACC CTGATCCCAA GCTGATCAGC GATAGGATAA CGAAAGAGCA GCAGATTGCA GATGTAGAGG AACAAGCAGA AATTCTCAGG TCTGCCGAGG AACAAGAAAT GAGGCTGGGA AAGGAGTTTG TACCGCCCAA AAGCCGCGAC TCGGCGGATT TAAATGTAGA CGCTGCTTTG AGAGAAGGTG GAAGCGAGAG GGAGGATGTG ATTGAAGAGC AGATGCAGAC CAACGAGGCA GAGAAGAGGC TCACAAGAAA TGAGAAATTG GCAGAACGGC TTATGGAGGT GTTTGGTCTT GAAGAACGGG AAGAGGTATT GGAAGAGATG AAATGTTGGT TGCTGAGGAG CGTCAGTAAG TAGCCGATCC CTTATGCCAT GACGAGGTGG TTGATGTTCC ATTAGTGCTT AAAGGTTACA TGTACCTCAC GAAAAGGCAT ATATGCTTCT TCGCCAACAT GCCCAATGAG AATGTACGTC AAGAACTGCA GTGCGATAAG CTAATGATGA TGAAAGAACC TGCTCGTTAA ATCCGGTCCT TTGCACAAGA AAGCATCCCG TTCAAAACTG AATACAAAGT TCTGGGTTGT ACTGAAGAAC GACGTTCTTA GTTGGTATGA GAGTACCTCA GATCCTTATT TCCCAAAAGG CAACATTTCT CTCCAATACT GCCATAGCTG TGACGCTGTG AGCGGCACTA GGTTCAAAGT TCGAACATCG GAGAGAAATT ACACTTTCAC GGCGGATACC GAGTCAAGTA GGGACGAGTG GGTCAAGGCA ATCCAAAAGG TCATGTTTAA AACGCAGCAC GAGGGCGAAA CTATAAAGGT CATTTATTTC AAGTGTTATT ACCTGAAGGC GTGTGCTAAA TTATTTAATA GCTCATTATC CCTCTTGAAG CAATAGTAGA TGTCGAAAAG AGTCCGACTC TAGAGTTTAC AGAGACAATA GAGGTTAGCC TCCCGTTTCA TCTCAAATGC CGATCGTTGA CCAATCATTT CATTACCAGG TCAAATGCAT TGACGCGGAA GATCAAATGT CCGTCGATAG CTACTTTTTC GCATCATTTC CCGATAACGA CTATGCCTTC TCTGCCATCC AAAAGCTTGT GCGAGAACGA CCATCACCAC CTGAGCTTCC CCGCATATCC TCTGTCACTA CAATCCATGC TAATCAGGAG CCGCTTGACA CCAGTCATGC TACTATCAAG CGCCATGGGA CGGATTCCTC AGCGGAAAAA CTGGGGATGG CTAGTCATCG GCCTTTCCGG AAAATTAGTT CAGTTTTGAA ACCATTGATT TTGAAATCCA GTGATGGAGA ACCACTTGAG GAGCACAGCC AAGGTCCTCA TCATAATGAC GAAGACGCAT CGCACCTACC TCATATAGAG GCCATTTCAA ATAGGCGACG GTCAGAAGAA GAGTCCGACA ACGATTATTT CGACGGCTAT CCACCTCGGC AGGTTGGCCC TCCTCCTCCG TCAATGAATG ATGACGCTCG TAATTGGAGG CCCTCATGGA TTCGTAAACC GGCATCCAAA TTGTTCGGTA GTTCTCCAAG CGGTTCGTTC GTCTCCCACC CTGGTCGATT GCCCACGGAT AGTTCGACTA CTGTCACCGA GTCAGGTCCT AGCTTACGTA GTCGAACTGG TCGGACAAAG CAAGCATCCG TAACAGAAGT GATGGAGCCT CCTATTCAGT ATGAAGAAGA GGTCAGTGAG GACGAAATGT CAAACAAACC TTCTGTTGTA GACAGTAATT CAGCGGAGAC GGCGAGGAAG AGAGCGGCAA GGTTATCTTG GACGTCGGAA ACTTCTAGTG GAAGTCAAAT GGTCAAGAGT AAATCGGACT TTTCAATGCT GGGCTCTGAG AGTGGGCATA GTGAGAGCGC GGAAACGGTA AGAAAGTTTA GGACATTCTT TGCCTTGAGT GACAAGGAGG AATTGATTGA TCGTGGGTGT CTCTTTCAGA GAAATCTGCT CTTACTCTCT AACGCTGGCT ATCGCCTAAG ACTTTCCTGG CTACCTTTAC CGAGTATTGC CTGTATCAGG AAGGTTTTTC ATCTCTACCA ATTACTTTTG CTTTCGGTCA TCTCAACTGC TCTATAAAAC AAAGGAAAGT TTTAGACTGG TTGACAACTG CATAAGGCAG CTGATGGTGT ATGTGTAGAT GATCATTCCG ATTCGCGACC TTTACGGCCT CAAAGCACAG AAAGCTTTCC GCTTTGGACA TTCTGGTCTT ACTGTGGTCA TTAAAGGGCA CGAGGAGATT TTCATTGAGT TCCGTTCCGC CAGTCGTCGA AAAGCCTGCA TCGCCCTACT TGAAGAACGT ATGGAGGCTG TTCGACTCAG TGGAGAAAAC ACTATTGTTG ACTCTCACAA GATTGAAGCA CGTATAATGG AGGACCTTGA TGAGTCTACA CCGGTTGAAC CCAAGTCCCC TTGGCCTGTG TCACCTTCGC CCCTCTTTGG TTCCACAACT TCCACCTCAT TCCTGGAGTT TAAACCAGAG CCAATGAAGA TCACTTGCTT GACTATTGGC AGCCGGGGAG ATGTCCAGCC ATACATCGCG TTGTGCAAAG GATTACAGGC AGAAGGCCAC ATTACCAAAA TTGCTACTCA CGGAGAGTAC AAAGCATGGG TTGAAGGAGT GAGTTCAGTT CCGTGCTCTT TCTATTGGCA GGCTTGGCTA ATCTATGGAT AGCATGGCAT CGCCTTCGAA AGTGTTGGTG GTGATCCTGC TGAGCTCATG CAAATGTGTG TAGATAACGG AATGTTCACG GTCTCTTTCC TCAAGGAAGG TCTCCAAAAG GTATCTTTCA TTCTGCTTTT GCACGGCTTT TGCTCACGCA ATCATGCAGT TCCGAGGGTG GTTGGATGAC CTTCTCAATT CTTCATGGGA AGCTTGCCAA GGATCTGACT TACTGATTGA ATCTCCAAGC GCAATGAGCG GCATCCACGT CGCAGAGGCA TTAAGAATAC CGTATTACCG AGCTTTCACG GTGTGTGCCT ACCATTGGGA AAGTATGGCG ATGACTGATC AACCGAGATA GATGCCTTGG ACCCGGACGA GAGCCTATCC TGTGAGCAGC CATGGACGCC ATTCTTGCAA CCTCTTCTGA CATATTCGCA GCACGCGTTT GCTGTTCCTG AACATGGTCG CGGAGGCCCA TATAACTATA TGACTTACAC AATGTTCGAT CAAGTCTTCT GGCGAGCAAT CTCTGGACAA GTAAATCGGT GGAGACGTAA CGTCCTGGGC CTTGATGCCA CTACATTTGA CAAAATGGAG CAGCACAAGG TTCCCTTCCT GTATAACTTC TCGCCTACTG TGGTACCACC CCCTCTTGAT TGGACAGAGT GGATTCATGT CACTGGATAT TGGTTCCTGG ATAAAGCAGA CGAGAAGCAG GGGGAGAAGA GTTGGACACC TCCGCAGGGC CTTGTTGACT TTATTGACAA GGCGCATGGG GAGGAAAAGA AAGTGGTATA CATGTGAGTC GTCGTTAGCA CCACCTTGTC TAGTTTACTG ACTAGAATAA CATCGAATTT AGCGGGTTTG GTTCGATTGT TGTGTCTGAT CCAGAAGAAA TGACTCGCTG TGTTGTGGAA GCAGTAGTGA ATAGCGGCGT GTGCGCAATT TTGTCAAAGG GCTGGTCTGA TCGAGGGTCG AAAAAGGGGG AGCCGAAAGG GGATTCAGAG GGAGCTGATG GTGTCAAATA TCCCCCTGAA ATTTTGTAAG CACCGTTTGC CAATGGTCAA ATTTGAAAAC TAATCATCAT CGCCTGTTCT AGCGCTATCG ACTCCATAGA TCATGGTTGG CTGTTCCCAC GAATTGATGC AGCTTGTCAT CATGGCGGTG CTGGGACAAC GGGAGCAAGT TTGAGGGGTA AGCTCTTCTT TTTGGGATTG TAGAAAGATT ACTGATTTTG TGTTGCAGCT GGAATTGTAA GTAGGATTGA TTATAAAAGT ATTTGTTCCA CTTCGAACTG ACCTTTTCTT AGCCTACAAT CATCAAACCG TTCTTCGGCG ATCAAGCGTT CTGGGCAGAA CGGGTTGAAA GTCTGAATGT TGGCTCGTCC ATCCGCAGGT TGACGTCTCA TCAACTTGCA TCTGCCTTGA TCAAGGCGAC TACCGATGAG AAGCAGATTT CGAAAGCGAG AGTAGTTGGT GAAATGATCA GGAAGGAAAA TGGTATAACG AGGGCTATAG AAGCCATCTA CCGTGATCTG GTGAGTCGGT CAATTAGTGC GGAACATGTC TGACTTTGGT GTTAGGAATA CGCCAAATCA ATCATCAAAT CCCTTCCATC TACCGATGAC AGAACCCCGG AGAGGATTTC TAGTCACCTT CATCCTTTAA CCACTGCCGA CCTCAGCTTT AACCGGGTAC GTTCACGATC ACGCTCTCGT TCTCGTTCTT CACAAGGTAG ATTTTCCCCA CGGCGTCACA CTGTGGACGA TGATGGCTGG TCAGTTGTCT CTGGCGGTAG TCGAAGTCGT AGCGGCAGTG CTAGTGCTGT GACCAGTCCT GAACGCCGTC CACTGAATAT TGGATCTGCA CTTGGAAGTC ATGTTTTTAA AACTGCCTTA TTACCTAATA CATTTGGGAA ATGGAGGAAC TTGGAGGAAG GGGATGACAG ATAGATAATG TTTTTATTAT TATGGAACTG TACAAATATA ATTAGCAATG CCAGGTTGTA AAGCTTTAGT CAATATGAAT CTCTTGTATG TATACAGAGG TATGAGCACA AAATGGCGGT GCCCTTGTAC CTGTGTCTGT TATCCGCTGC GAGGTATGGT TTTAACGAAC GA
|
Protein sequence | MSPPISPTPP PLQPPFPPTA IARGPDRPDP PPQHQQAAES LVNAAAQHVA PTCPPTSDEL PQMEDQATNS SNDSLIPSRQ APDQEETENA ITAGTPPDEM EPTKDAQTVR FSSSSPASYS THEYPTEGIN EPRTSSRAPN TASSQMAESS CDFRSSRDIG SIRMGASALV SALNALPWEE DDDSDDGEDD DEFIEPARGS SSTIYERKQR PQTPSTSHGF HPTHTLHFPF RQNAIARRAC QPGTTELDYQ YATPETSSRR TSAAGSESSS EGEVPLPKGF VSHPNLIVPS GEGEAAAHPD PKLISDRITK EQQIADVEEQ AEILRSAEEQ EMRLGKEFVP PKSRDSADLN VDAALREGGS EREDVIEEQM QTNEAEKRLT RNEKLAERLM EVFGLEEREE VLEEMKCWLL RSVMLKGYMY LTKRHICFFA NMPNENNLLV KSGPLHKKAS RSKLNTKFWV VLKNDVLSWY ESTSDPYFPK GNISLQYCHS CDAVSGTRFK VRTSERNYTF TADTESSRDE WVKAIQKVMF KTQHEGETIK LIIPLEAIVD VEKSPTLEFT ETIEVKCIDA EDQIYFFASF PDNDYAFSAI QKLVRERPSP PELPRISSVT TIHANQEPLD TSHATIKRHG TDSSAEKLGM ASHRPFRKIS SVLKPLILKS SDGEPLEEHS QGPHHNDEDA SHLPHIEAIS NRRRSEEESD NDYFDGYPPR QVGPPPPSMN DDARNWRPSW IRKPASKLFG SSPSGSFVSH PGRLPTDSST TVTESGPSLR SRTGRTKQAS VTEVMEPPIQ YEEEVSEDEM SNKPSVVDSN SAETARKRAA RLSWTSETSS GSQMVKSKSD FSMLGSESGH SESAETVRKF RTFFALSDKE ELIDHFPGYL YRVLPVSGRF FISTNYFCFR SSQLLYKTKE SFRLMIIPIR DLYGLKAQKA FRFGHSGLTV VIKGHEEIFI EFRSASRRKA CIALLEERME AVRLSGENTI VDSHKIEARI MEDLDESTPV EPKSPWPVSP SPLFGSTTST SFLEFKPEPM KITCLTIGSR GDVQPYIALC KGLQAEGHIT KIATHGEYKA WVEGHGIAFE SVGGDPAELM QMCVDNGMFT VSFLKEGLQK FRGWLDDLLN SSWEACQGSD LLIESPSAMS GIHVAEALRI PYYRAFTMPW TRTRAYPHAF AVPEHGRGGP YNYMTYTMFD QVFWRAISGQ VNRWRRNVLG LDATTFDKME QHKVPFLYNF SPTVVPPPLD WTEWIHVTGY WFLDKADEKQ GEKSWTPPQG LVDFIDKAHG EEKKVVYIGF GSIVVSDPEE MTRCVVEAVV NSGVCAILSK GWSDRGSKKG EPKGDSEGAD GVKYPPEIFA IDSIDHGWLF PRIDAACHHG GAGTTGASLR AGIPTIIKPF FGDQAFWAER VESLNVGSSI RRLTSHQLAS ALIKATTDEK QISKARVVGE MIRKENGITR AIEAIYRDLE YAKSIIKSLP STDDRTPERI SSHLHPLTTA DLSFNRVRSR SRSRSRSSQG RFSPRRHTVD DDGWSVVSGG SRSRSGSASA VTSPERRPLN IGSALGSHVF KTALLPNTFG KWRNLEEGDD R
|
| |