Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK00070 |
Symbol | |
ID | 3254444 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | - |
Start bp | 19050 |
End bp | 22023 |
Gene Length | 2974 bp |
Protein Length | 834 aa |
Translation table | |
GC content | 50% |
IMG OID | 638253501 |
Product | phosphoketolase, putative |
Protein accession | XP_567776 |
Protein GI | 58260732 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3957] Phosphoketolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTGCATTTA CAGCCAGTTC ACTCAATACA ACTTCTCAGC ATGGCTGAAG AAACATCCTC CCTCACCTCT TTTGGTCAGG CTCGCTCGAC TGTCAAGGAC CAGCCCTTGA CAGTGGAGGA GCTCAAGAAG ATCGATGCCT ACATGCGGGC CTCCCTTTAC CTCTGTCTTG GTATGCTCTA TCTCCGTCAA AATCCACTTC TCAAGGAGCC TCTCAAGAAG GAACATCTCA AGGCTCGTCT TTTGGGTCAC TGGGGTTCCG ACGCTGGTCA GATTTTCACC TACATCCACA TGAACCGTCT CATTAAGAAG TATGATTTGG ATGCTCTTTT CGTTTCTGGT CCTGGTGAGT CCCTTCATTT TGTTCTATAA TTTCTGGTAT CTGACGACAC TTTAGGTCAC GGTGCCCCCG CTGTCCTCTC TCAGTCCTAT CTCGAAGGTG TCTACACTGA AGTCTACCCT AACATAACCG AAGATGTCGA AGGCATGCGA CGCTTCTTCA AGCAATTCTC ATTCCCCGGC GGGGTTGGTT CTCACGCTAC TCCCGAGACT CCTGGATCTC TTCACGAAGG TGGTGAACTC GGTTACTCCA TTTCCCACGC CTTTGGTACG GTCTTTGACA ATCCCAACCT CATTACTCTT ACTATGGTCG GTGATGGTGA AAGTGAGACT GGCCCTCTGG CTGCTTCTTG GCACAGTACT AAGTTTTTGA ATCCTATCAC CGATGGCGCT GTTTTGCCTG TCTTACATCT CAACGGGTAG GCCACAATTT CCTTTCTCGA TTAGTATACA ACTGACATTC TTGACTTAGT TATAAGATCA ACAACCCCAC CGTCCTTGCT CGTATCTCCC ACGAAGAGAT TGAAGCTCTC TTTATCGGCT ATGGCTGGAA GCCCTACTTT GTCGAAGGCT CTGACCTTAC CTCTATGCAC CAAGCTATGG CTGCTACTCT CGAAAAGGCT GTCCTTGAGA TCAAGGCCTA CCAGAAGCAA GCTCGAGACT CTGGCAAGGC TTTCCGTCCT CGTTGGCCTA TGATAATTCT CCGATCCCCC AAGGGTTGGA CTGCTCCTCG AAACGTCTCT GGCCACCACC TCGAAGGCTA TTGGCGTGCC CACCAGATCC CCCTTGCCGA CGTTGCTTCC AATTCTGAAC ACCTCAAGCT CCTCGAGGAC TGGATGCGAT CTTACAAGCC CGAAGAGCTC TTCACTGAGG ATGGTAAACT TATCCCTGAG CTCAAGGCTC TTCCTCCTGC AGGTCAAGCC CGTATGTCTG CCAATCCTGT GTCTAACGGC GGTTTGGTCC GCAAGGCATT GAATCTTCCC GACTTCAAGG ACTACGCCAT CAAGGACATC GCTCCTGGTG TGACTCTTGC CCCCAGCATG TCAAACATGG CGCTCTTTGT TAGGGACGTG ATCAAGAAGA ACCAGACCAA CTTCCGTCTA TTCGGCCCTG ATGAGACTGA GTCCAACAAA CTCGCAGCTG TGTATGAGGC AGGCAAGAAG GTTTGGATGG GCGAGTACTT GCCCGAGGAC ACTGATGGTG GTAACCTCGC TCACGCGGGT CGGGTGATGG AGATCTTGTC TGAGCACACA GTTGAAGGAT GGCTTGAGGG TTATGTCTTA TCTGGTCGAC ATGGTCTTGT GAGTGCTATC TTTTATACTT TTCCTTAACT CATGACTGAC ATCTCCTTCT AGCTGAATTC CTACGAGCCC TTCATCCACA TCATCGACTC TATGGTTAAC CAACACTGCA AGTGGATCGA AAAGTGTCTC GAAGTCGAAT GGCGTGTCAA AGTCTCCTCC CTTAACATTC TTCTTACCGC CACGGTCTGG CGTCAGGACC ATAACGGTTT TACTCACCAA GACCCCGGTT TCCTCGACGT CGTCGCCAAT AAGTCCCCCG AGGTCGTCCG CATCTATCTC CCTCCCGACG GTAACTGTCT TCTCTCTGTC ATGAATCACT GCTTCGACAG CAAAAACTAT GTCAACGTTG TCGTTGCCGA CAAACAGGAT CACTTGCAGT ATCTCGACAT GGAGGCTGCT GTTGCGCACT GTACCAAGGG TCTTGGTATT TGGGAATGGG CGTGTGTGGG TGACCCCAAC GAGAACCCTG ACCTCGTTAT GGCTTGTTGC GGTGATGTCC CTACTATGGA GTCTCTTGCT GCCACTGCTC TCCTGAAAGA GTACCTCCCA GAGCTCAAGA TTCGTTTTGT TAACGTCGTT GACCTCTTCA AGCTTATCTC TCACGTGGAC CATCCGTAGG TTCATCTCAC GAGATCCGAC CCTTGAACAT GTACTGACTT GCTTAATTAG TCACGGTCTT ACTGATCGTC AGTGGGTCTC TTACTTCACC GAAGACACTC CAATCATCTT CAACTTCCAC TCTTACCCCT GGCTCATCCA CCGACTCACT TATAAGCGCC CTGGTTCTCA GAACATCCAC GTCCGAGGTT ACAAGGAGAA AGGTAACATC GACACTCCTC TCGAGCTCGC TATTCGTAAT GAGACCGACC GATACAGTCT CGCGATGGAC GCTATTGATC GTCTGCCCCA CCTCAAAAAC AAGGGTTCTA TGGCGAGGGA GAAGCTGTAT GATGCCCAGA TTAAGGCGAG GGACTGGGCC TTTGAGCATG GTATTGACCC GGAGGACGTC AGGAAGTGGA AGTGGCCTTA TGGTCCCAAG ACTGAGGGTA TTGCGAGCAA GCTTGGGTTC GGAGGAGAGA ACAAGCAACA GGTTGCATCC GTTGGTACTA GCGAATAAGG GTTGATCGCA GTCAAAAGTG TAATGTCGTA AGAAGGAAGT GTATTGTACA AACGAAAAAA GAAAATGAAT TGTCTGTTGG TTCTTGTTCT TAGTTTGAGT GGCCTCCGGG ATACTTTATA ATGTTTCGCC TTCGGCACTG GTGAATTACC AGCGAAATGT TTTTCATATC TTTTGCATAT TTCCAATAAC TCTGGAGATT TTCACTCATT CCCA
|
Protein sequence | MAEETSSLTS FGQARSTVKD QPLTVEELKK IDAYMRASLY LCLGMLYLRQ NPLLKEPLKK EHLKARLLGH WGSDAGQIFT YIHMNRLIKK YDLDALFVSG PGHGAPAVLS QSYLEGVYTE VYPNITEDVE GMRRFFKQFS FPGGVGSHAT PETPGSLHEG GELGYSISHA FGTVFDNPNL ITLTMVGDGE SETGPLAASW HSTKFLNPIT DGAVLPVLHL NGYKINNPTV LARISHEEIE ALFIGYGWKP YFVEGSDLTS MHQAMAATLE KAVLEIKAYQ KQARDSGKAF RPRWPMIILR SPKGWTAPRN VSGHHLEGYW RAHQIPLADV ASNSEHLKLL EDWMRSYKPE ELFTEDGKLI PELKALPPAG QARMSANPVS NGGLVRKALN LPDFKDYAIK DIAPGVTLAP SMSNMALFVR DVIKKNQTNF RLFGPDETES NKLAAVYEAG KKVWMGEYLP EDTDGGNLAH AGRVMEILSE HTVEGWLEGY VLSGRHGLLN SYEPFIHIID SMVNQHCKWI EKCLEVEWRV KVSSLNILLT ATVWRQDHNG FTHQDPGFLD VVANKSPEVV RIYLPPDGNC LLSVMNHCFD SKNYVNVVVA DKQDHLQYLD MEAAVAHCTK GLGIWEWACV GDPNENPDLV MACCGDVPTM ESLAATALLK EYLPELKIRF VNVVDLFKLI SHVDHPHGLT DRQWVSYFTE DTPIIFNFHS YPWLIHRLTY KRPGSQNIHV RGYKEKGNID TPLELAIRNE TDRYSLAMDA IDRLPHLKNK GSMAREKLYD AQIKARDWAF EHGIDPEDVR KWKWPYGPKT EGIASKLGFG GENKQQVASV GTSE
|
| |