Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG04310 |
Symbol | |
ID | 3258616 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 1217350 |
End bp | 1221104 |
Gene Length | 3755 bp |
Protein Length | 796 aa |
Translation table | |
GC content | 46% |
IMG OID | 638258055 |
Product | UDP-glucose,sterol transferase |
Protein accession | XP_572103 |
Protein GI | 58269894 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000406767 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTCACCTTGT TCCCCCCCCA CCTTCCATCT CAGGCTCAAA GGCGGTGTCA AAAGGCATGA CCATGTTTAT ATCAATCTAG CAATTGCCTC TTTATAATCT TCAACGATAA TCAGCCATGG AATCGCCACC ACCATACGAA GCTCAGGGCG GGCCATCAAC AACGCCCGCT GCGAGCAACA ATTATGTCGC AGTGCCCACT GACGCCGGTA AATATACAGA TCCAGCGAAA AAATGTTTTG AAGACTAATT CAGTATAGAT CGCAGCGCAT TCATTAGCTC TGGATCAGGT CTCCACGTCC GTTCCTCTTT GACGACTTAT GGGGATGTCA ATGCCTGGAT TGATGTCCAG GAGAATCTCG GCGATCTGCC TGCTGCTGTT GCTCCAAGAG TCAAGGAATA TGCGTTGGAT CCACAAGGAG CTGTGCCATC ATTCAACATT GTGATGTTTG CAATCGGCGA TGAAGGTAAA TTCCTGGTTT AATATTAAGG AACAATCGAC TAAATTGATG CTTGTAGATG ATCTTCGCCA ATTCATATCT CTTGCCATTG AACTCATCGT CTCACATTCT CATCGAATCC GCATTGTTAC ATCGGAATTT TATGAAGACC TAATTACTCA AGCTAAGAAT AATCTGTCAG GGAGGACAGG TAAAGATGGC CGAGTTGGGC TGCATGACAA GCTGGAAATG TATCCTCTCT CCGCACCAGC AGATGCGAAC CTGTCTACTT GGACAAAGGG TGGGTATTTA ACTGTCAACG CGTGTGTATC ATGCTCATGA TCATATCAGA CCAGAGAACT ATGGAACTCA CATTAATATC ACTATATCGC TCGACCTTCA GCCCCTCTGC TGTACCTACA AATCCACATT TTGCCGCTGA TCTAATCATT TCGGCGCCCA ATGTTCCCTG CCACGTATCC ATCGCTGAGC TTCTTGGTCT TCCTCTACAT ATCCTTTCGA GTATGATTAT TGCCGACCTT AAGCCGGGAA TAAAGCTGAT GAGCATCTAG CCAACCCTTG TTCACCCACT ATCACTCTTC CACATCCCGG AACCATCATC CAACGATCCA ACACCAACGC CTCACTTACT AATTATCTCA GCTATCCCAT CTATGAGAAT CAGTACATCT GCTTTGTTGG CCAATATATC ATTATCGCTG ATGGTTCTAT AGGGTTTGGC ATTCGCTTGG AAGAGTCATC AACGAGTTTC GAGTTGCCAG CCTTGGTCTA CCAACCCTCA CCAAGATGGA GGGGCCAGGT GTTCTGGATA GACTGAAAGT ACCTTTCACC TATTGCTGGA GTCCCTCTCT TTTGAAAAAA CCAGAAGATT GGAGAGAACA TATTGGTGAG TATAAAGACA TGCAAGCTTG TCAAGTCGCT AAGTCGATAG CAGACGTGAC TGGCTTCATA TTTGATCACC GGGAGCAGAT AGATTTCCAC CCCTCTGACG ACCTTTTGTA CTTCCTCAAA AACGGAAAGG AACCTGTATA TGTCAAGTGA GCCAAATTCC TGAGATTCAA CCCTGACATA CTGATTGTCT TTTAGGCTTG ACCTATCAAA TTCAGACTCC ACGAATATCA TAAGCATGCC ATCCGTTCTT TCATTCTAGG CTACGCTTAC CCAGTACAGA CTCCTTCATC ACGGCTTTCT TGAAGTCCAA TAATAGGGCC ATTGTGGATA TCAAAGGCAT ACAGATGAAG AATGGTGAAA ATCCCGATAT CTTCATCGTG TCTGGCAAGT TCCCAGGAGT CAACTTTGTT AATCTTGTAA AGAGCTAACC AGCTGGTTCA TAGAGGTGGC ACCCGTTCCA TACCAGTGGC TTTTGTCAGA GAGGAGGATA TCAGCAATCT GTCACGCCGA TAGTAAGTGA TTCCGTTTAT CGAAGCTTGC AAACTCTCAT ACAGGTTGTT AGGCGCTTCG CTCAACCTGG CTGCTATGAG AGCTGGAATA CCAGCCATCA TCGTGGCAAT TGAACGTCCT TTAAGGTAAT TCACCTTCGT TTTTGAGCAT ACTGCCCTCA CTAACCCATG TCTATAGCTT CTGGGGCAGA CAGATCCATC AAGTGGGCAT TGCTGCTTTC ATCTCGTCAG ACGTCCTTAG CTGTGAAGAG ATCACCTCTG CTTTGGAAGA GGCGTTATCT CCGAGAGTTC AATCTGCCGC CAGAGAGTAT GGATCACAAT TGTCTACTGA GGACGGTACA AAGGGGGCTG CGGAGACCAT CCATAAGCAC CTTCCCTTGC TTAGCATGAG GTGAATTGTT AGATATGGCG CATTGAGAAT CAGCTAACCA CTTAGTAGAT GCGATATCAT TCCTTCTAGA GCTGCTATTT GGTATTCCCC TGAGTACAAC CTTCATCTTT CGGGGATTGC TGCTGGAGTG TTGGTCGATG AGGGGAAGTT GTCTTTCAAA ACGCTTGAAC CGAATCGTAA CCTTTTTTCA TCAACAAACC TGATCATTTC TAATCGCTAG TGCAGGATCA AAGGAATACC CAGTGAATAT TGCCGACTCG GATCCTATCA TTGGAGGTAC CCAAGCGTTC TTCCTTGCTT TGACTGCATC TGTGTTGAAT GTGTTGCACA TGTTCAACCA ACCGGTAAGT TTCTTTTGTG AAGATTTGAG CCACAGCTGA TCGGACTTTT CCCAAAGGTT CCTCAGAGAG AAGTCGACAT CTCAGCACAG CAGCCAGTCA TCATTTCCCA AGTCCGAAAC CCTTCGGGCG GTTGGACGTG GTCATACGAA CAAGCTACAC GCCAGAGAGT TCCCATTACC GACTTTAAGT CAGGGATGAA GGAAGCAAGG GATGAACTTA CCACTGGTGT AAAGGATGGT ATGAAAGCGC TAGTCATGGA GCCTCTGTAT GGATTCAAGG AGGGAGTGAG TTGCGATGGC GGTCGGCACT TTTATAGAGA TCCTGACAAC CAGACATAGG GTCCAGTTGG TGGGGTCTTT GGACTCGTGA GAGGCGGTAA GTTTATGACT TACAATGTAA ATGACAAAAG CTAACGTCTA CAAGGTGTGT CTCTCGTTAC ACGACCTTTG GGTAGTGGTA TATCGGCAGT TCGATATGGT GCGCAAGGCG CAATACGCGA GGTAGACGGC CGAGCCACAA AACTCTTCAC TCTCGATTAC TCTTCCCCAG CCGAATCACT CCGACCTTCA CGAAAAGCAG CTAGTATCGA AGAACTCAGG AAGATAACCC AAGAAGACCG AAAGCGGATT TTGGAAGAAT TCAAGCATGC GAAATCTGAC GAAGCGACAG AATCGAGAAA GGTAAAGGAG GAGGCTATAT CGCTAGGGAA AATGCCGGAG CGTGCAAGGG GTGGCATTGA ACTTGGCAGA TATACTTCGC CTTTGAGTAA TGAAGCTGAC AAGTCGAGTA AAAAGTGGTG GAAGGGAAAG GGAAAGGGCA AAGAAAGGGC ATCTGAGACG ACTTTGGGGC CCCAGCAACC GCTTCAGTCA AGCTCTGGCT TGACATCTCC GAGTTCGTCC TCGCACACAG ACGAGAAGCT TTGGCCCAGC GAAAGGAAAT AAGGTATAAA TAATCTAAAT AATCTTTGTA CTGTATATCA TTAGCAAATA CACGGCAAAA TGGGAGAGCC ACTGAAACTT ATTTATTGTT CTTGATGGAA AATATGTGTA GGTTCGGAAG AAAGCATACG TAACCATATA AAATCCAACA TTCGAGCAAA AAAATAGTAT CATAAGGGAT TCATAAGTCA ATCTATATAA TTTTAGGATG CATAAGAAGC AAGCA
|
Protein sequence | MESPPPYEAQ GGPSTTPAAS NNYVAVPTDA DRSAFISSGS GLHVRSSLTT YGDVNAWIDV QENLGDLPAA VAPRVKEYAL DPQGAVPSFN IVMFAIGDED DLRQFISLAI ELIVSHSHRI RIVTSEFYED LITQAKNNLS GRTGKDGRVG LHDKLEMYPL SAPADANLST WTKDQRTMEL TLISLYRSTF SPSAVPTNPH FAADLIISAP NVPCHVSIAE LLGLPLHILS TNPCSPTITL PHPGTIIQRS NTNASLTNYL SYPIYENQVW HSLGRVINEF RVASLGLPTL TKMEGPGVLD RLKVPFTYCW SPSLLKKPED WREHIDVTGF IFDHREQIDF HPSDDLLYFL KNGKEPVYVK GGTRSIPVAF VREEDISNLS RRYFWGRQIH QVGIAAFISS DVLSCEEITS ALEEALSPRV QSAAREYGSQ LSTEDGTKGA AETIHKHLPL LSMRCDIIPS RAAIWYSPEY NLHLSGIAAG VLVDEGKLSF KTLEPNRSKE YPVNIADSDP IIGGTQAFFL ALTASVLNVL HMFNQPVPQR EVDISAQQPV IISQVRNPSG GWTWSYEQAT RQRVPITDFK SGMKEARDEL TTGVKDGMKA LVMEPLYGFK EGGPVGGVFG LVRGGVSLVT RPLGSGISAV RYGAQGAIRE VDGRATKLFT LDYSSPAESL RPSRKAASIE ELRKITQEDR KRILEEFKHA KSDEATESRK VKEEAISLGK MPERARGGIE LGRYTSPLSN EADKSSKKWW KGKGKGKERA SETTLGPQQP LQSSSGLTSP SSSSHTDEKL WPSERK
|
| |