Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH03380 |
Symbol | |
ID | 3259237 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 141783 |
End bp | 145371 |
Gene Length | 3589 bp |
Protein Length | 930 aa |
Translation table | |
GC content | 52% |
IMG OID | 638258146 |
Product | glycogenin glucosyltransferase, putative |
Protein accession | XP_572506 |
Protein GI | 58270700 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.337452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGCCTTTCC CTCGGGCGAA TCCAAAAGCA AAGAGTGTAT CTCATCTACT GAGACCGCTT GCAAAGAAAA CGCCTTCCGC ATCCTCAACT CAAACAAGCG GAACTCGGTC TCGGTCTTGT TGTTGTTTGT TTCTTCTTTC CAGCGGCGTA GACAAATAAC AGGTGAGTCT CCACTTGAGT CACCACATCA TACACAGCTC ACAACCTATC AGTCCAGACA TGTCTCTTCC CAACGCCTTC GTCACACTTC TAACGACGTC CTCGTATCTC CCCGGAGCCC TCGTGCTTCT CCATGCCCTC CAGGATCTTC ACCCAGCGCC GCGGGACTTT CAGATTGTAG CCCTCGTGAC TCCAGAGACG GTCGATGCTG CTACTATCGG CGAACTACGA AGGGCAGGGT ATGATTTGGT GATTGGCGTG GAACCTATTG GCAGTGGGAA AGCCGGTCAA GTCGGGTTAG AGCTCATGGG TAAGGTCGTC TCCGTTCGGT CTGTGGCCTG CCAAGGAAGA TTGTCCATAA TGCTAACGGA TGCGGCTAAC AGGCCGACCG GATTTGAATT TTGCATTAAC AAAGCTTCAT CTCTTCCGCC TCGCCCCGTT CTTCTCTACT CTTATCTACC TCGATGCGGA CATCCTCCCC CTCCGACCCA TTTCGCACCT GTTCACCTCG ACCGCCCCCC ATGTGTTTTC TGCATGCCCT GATACTGGCT GGCCAGACTG TTTCAACTCT GGATTCATGG TCATCCGTCC TAGAGAGAGC GATTGGGATG GGCTTAAAGG GATGTTGAAA GATGGTGAAG GGGAAGATGG GCTATATAGA GAAGCAGGGA ATGGGAGCTT TGATGGTGCT GACCAAGGGT TGTTGAATGA GTGGTTCAGT GAAGAAGGTG GTGGCGGGGA CTGGAACAGA CTGTCCTTCA CGTACGTTCC AGATTTCAGT TTTCCGTCAA TCAAGGATTG TTGCTGACAC CATTCTCGAC AGATACAATG TTACCCCCTC TGCGGCGTAT ACCTGGGCCC CAGCGTATAA ACGTTTCGGC CACAAGATCA GCAATGTCCA TTTTATTGGA CCTAATAAAC CGTGGACGAG CCTACCCGGA AGACCGGCCG GTGTGTCAAA TGTCAAAGGA AAAGAGAACT CTTACGACTG TAAGTCATTC CTGCCTGTCC AATGAATTGC CCAACTCATT GACGTCTTCT TCTTCTATCT AGACTTGTCT CTTATAGACC GGTGGTTTGC AGTGTACGAC AAACATGTCC GCCCAGCTTC CGCTCTCGAC CCCGACATCT CAAGGCGTTT TGCTGTCCCA CAAACCATAG CTGCTTGGGA TTCCCATGCG AACCGAGCAA GAGCAGCTGC CACTGTCCTG TCTGAAGACA AACTTGAACT GTCCGAGCTC AAAGCTGCCA CTGAAAGGGG CGTGAATGCT TTCAAACCGG GACAGTACAC TTCCCTTCCG CTCGAAGGAC GAGTAGATCT GATCATGCCC AAACCTAAGC CCGTACCTAG AGCTGCCATC TCCCAACTGG CTGCCGCAAG TACGGTAGCG CCATCCGTTA GCCCCTCAGC TCTTACTCCG CCTCCCGCAG ATGCGGTCCC AGCCCCTGCG CCTGCTCCTG TCCTGACTCA GACAGAGCAA CAGACCGCCC AACCTTTTGT CTGGGACGCC CAACGTTCCT CCCCACCAGC AAGCGCTCCA CCGGAAATGT CCGTCCCCCA CACATACTAC CGCAACGCAT GGGAAGCTCC CCTTTCCCAA CAATCATCCT ATTACGCCCA CCCCGAGTCT CACCAACCCC AGGCTGAACA CAAAGAACCA GAGTATCCTA CATTGCCAAA GGAAGTGACG GGGGATAGTT GGTATGCGAG GTTTGCGACG AGCACGCCGG ATAAGCGTGC AGTAAGTGCA GTTTTCCCGT GGGAAGAGAA AACCGGCAGC CATGGGTATG GACATGGGTC CAGACCGAAA CCGGAGAGGG TGTTCCCGAA AGGAGAGGAA CCGCTACCCC CTCTTGTGCA GCAGTTGATC CATCCTTTGC AACCGCCGTC CATCTCGATA CAGTACGCCA CTCCTACCGA CTCTTACCAA TCCCAGCATC CCTCTCACGC AACGGGGATG GGAATGGCAG GACAAGCACA AGCACCGAAA AGTCCTTCCC CGCCACCAAG ACACGTGTCC ATGGTAGAAG CTATGGCATC GTATAAGAAT GTGTGGGATG ATATTCCGCA GATAGGGAAG TATGTGGATA TCATGAGCGG GAAGACTGGT GGAAGATCTG TCAGAGGTTT GTCTACCCGT GGACACGGAC ATGGGCAAGG TCATATTCAA GGTCATAGTC AAGGGCAAAA GCAGTCACAG GCGCAAAGCC ATGAGAGGAA CGTCTCTCTC CAGTCCCTTC AGTCTGTCCC GGGTACACCC CGTACTCAAT ATTCCACTTT CGGTAAATCC CCGCGGCTTA CTAATGCGAG GGATCTAGAA AGACGAGGAA GTTTGGAGCA ACCTGAAGAT TCCGCTGATG GGGATGATGA AAACTCTACT TCGGCCTCGG AAGAGGAAGG TGGAAAAGGC GGGGAAGGAA AATCGTCGAA GCCGTATAAG GGCAATAGGA AATATAGAGA TCGGTGGGCA CAGACGGATA GAGTGAAGAC GGTTGATGAG ACAGTCCAGA CGCAGACGCA TGCTGGGGAA GAGATGGCTG GCGGAGGGTT GAAAATGTGG GGGCTGCCTC ATGCTTCTGC CCATGGACGG AAGAGCAGCA AGGAGATTCC TTTCCCCAGT GGAAGTGGGA ATGGAAGAGC AGGAGGGGGA GGGCAGCGAG AAGCTCAGAC TCAACACCAA TCGACTTATT ACGAGTACCA ACAACAACAC CCCCACTCCC AGCAGCCTCG ACAAGGGAGT ACGGCAAGCC CGAGCCAGAA ACCCGAGTTA AACGCTCGTC TTCCGGATTA TTCGTTTGAT TTCAAGGGGG CTACTTCCCA TGCGCAAGGG ATTGCCCAAG CACAGGCACA AGCCCAAAGT CAGGCTCAAG GACAGGGGGC CAACTCCAAC CTGAACGCAC AGCATAGGCA CAAGCCGAGC GGATCATTCT CGACCATTTA TGGTGGTCGA GGGAGGGTTT GGGATCCGAA CACGGATGTG GAGGTAAGGA GACGGGATAG TCAAGAGGTC CTGGCGAGAT TTATGCAAGG CAACTTGGGG AGAGGATGAT TAGATCTTCT CTTTTTTCCT TGAAACGTTG TGCGGAGCGC AGGGAGCGTA TCTAGGAGCC TCCTGTTCCC CGTATGCGTC ATAACAATGT CCCGACAGGA CTGGATTGTA TCTTGTGTTG ACCTCGATAT ATCGAGGAAA CCTATCCATT AGAGGAGTTG TGCAAAAACA AAAAAATATC AAAAAAAAAC GAGACAGCCT GAGGTTGTAG CTATATTTTA TCTTTTTTAT ACAATCTGTC TGGTTTAATT ATCCATCTGT TTATCTCTGC CAATTATTTC TCTTTTGCGA TGTGTTTAGA TTTGGGTTTT CTTTTTTTCT TTTTTTAATG GGGATGTCTT ATATCAACAT AAATATACCT ACATTTGGGA TAAACAAAC
|
Protein sequence | MSLPNAFVTL LTTSSYLPGA LVLLHALQDL HPAPRDFQIV ALVTPETVDA ATIGELRRAG YDLVIGVEPI GSGKAGQVGL ELMGRPDLNF ALTKLHLFRL APFFSTLIYL DADILPLRPI SHLFTSTAPH VFSACPDTGW PDCFNSGFMV IRPRESDWDG LKGMLKDGEG EDGLYREAGN GSFDGADQGL LNEWFSEEGG GGDWNRLSFT YNVTPSAAYT WAPAYKRFGH KISNVHFIGP NKPWTSLPGR PAGVSNVKGK ENSYDYLSLI DRWFAVYDKH VRPASALDPD ISRRFAVPQT IAAWDSHANR ARAAATVLSE DKLELSELKA ATERGVNAFK PGQYTSLPLE GRVDLIMPKP KPVPRAAISQ LAAASTVAPS VSPSALTPPP ADAVPAPAPA PVLTQTEQQT AQPFVWDAQR SSPPASAPPE MSVPHTYYRN AWEAPLSQQS SYYAHPESHQ PQAEHKEPEY PTLPKEVTGD SWYARFATST PDKRAVSAVF PWEEKTGSHG YGHGSRPKPE RVFPKGEEPL PPLVQQLIHP LQPPSISIQY ATPTDSYQSQ HPSHATGMGM AGQAQAPKSP SPPPRHVSMV EAMASYKNVW DDIPQIGKYV DIMSGKTGGR SVRGLSTRGH GHGQGHIQGH SQGQKQSQAQ SHERNVSLQS LQSVPGTPRT QYSTFGKSPR LTNARDLERR GSLEQPEDSA DGDDENSTSA SEEEGGKGGE GKSSKPYKGN RKYRDRWAQT DRVKTVDETV QTQTHAGEEM AGGGLKMWGL PHASAHGRKS SKEIPFPSGS GNGRAGGGGQ REAQTQHQST YYEYQQQHPH SQQPRQGSTA SPSQKPELNA RLPDYSFDFK GATSHAQGIA QAQAQAQSQA QGQGANSNLN AQHRHKPSGS FSTIYGGRGR VWDPNTDVEV RRRDSQEVLA RFMQGNLGRG
|
| |