Gene CNG04310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG04310 
Symbol 
ID3258616 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp1217350 
End bp1221104 
Gene Length3755 bp 
Protein Length796 aa 
Translation table 
GC content46% 
IMG OID638258055 
ProductUDP-glucose,sterol transferase 
Protein accessionXP_572103 
Protein GI58269894 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000406767 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTCACCTTGT TCCCCCCCCA CCTTCCATCT CAGGCTCAAA GGCGGTGTCA AAAGGCATGA 
CCATGTTTAT ATCAATCTAG CAATTGCCTC TTTATAATCT TCAACGATAA TCAGCCATGG
AATCGCCACC ACCATACGAA GCTCAGGGCG GGCCATCAAC AACGCCCGCT GCGAGCAACA
ATTATGTCGC AGTGCCCACT GACGCCGGTA AATATACAGA TCCAGCGAAA AAATGTTTTG
AAGACTAATT CAGTATAGAT CGCAGCGCAT TCATTAGCTC TGGATCAGGT CTCCACGTCC
GTTCCTCTTT GACGACTTAT GGGGATGTCA ATGCCTGGAT TGATGTCCAG GAGAATCTCG
GCGATCTGCC TGCTGCTGTT GCTCCAAGAG TCAAGGAATA TGCGTTGGAT CCACAAGGAG
CTGTGCCATC ATTCAACATT GTGATGTTTG CAATCGGCGA TGAAGGTAAA TTCCTGGTTT
AATATTAAGG AACAATCGAC TAAATTGATG CTTGTAGATG ATCTTCGCCA ATTCATATCT
CTTGCCATTG AACTCATCGT CTCACATTCT CATCGAATCC GCATTGTTAC ATCGGAATTT
TATGAAGACC TAATTACTCA AGCTAAGAAT AATCTGTCAG GGAGGACAGG TAAAGATGGC
CGAGTTGGGC TGCATGACAA GCTGGAAATG TATCCTCTCT CCGCACCAGC AGATGCGAAC
CTGTCTACTT GGACAAAGGG TGGGTATTTA ACTGTCAACG CGTGTGTATC ATGCTCATGA
TCATATCAGA CCAGAGAACT ATGGAACTCA CATTAATATC ACTATATCGC TCGACCTTCA
GCCCCTCTGC TGTACCTACA AATCCACATT TTGCCGCTGA TCTAATCATT TCGGCGCCCA
ATGTTCCCTG CCACGTATCC ATCGCTGAGC TTCTTGGTCT TCCTCTACAT ATCCTTTCGA
GTATGATTAT TGCCGACCTT AAGCCGGGAA TAAAGCTGAT GAGCATCTAG CCAACCCTTG
TTCACCCACT ATCACTCTTC CACATCCCGG AACCATCATC CAACGATCCA ACACCAACGC
CTCACTTACT AATTATCTCA GCTATCCCAT CTATGAGAAT CAGTACATCT GCTTTGTTGG
CCAATATATC ATTATCGCTG ATGGTTCTAT AGGGTTTGGC ATTCGCTTGG AAGAGTCATC
AACGAGTTTC GAGTTGCCAG CCTTGGTCTA CCAACCCTCA CCAAGATGGA GGGGCCAGGT
GTTCTGGATA GACTGAAAGT ACCTTTCACC TATTGCTGGA GTCCCTCTCT TTTGAAAAAA
CCAGAAGATT GGAGAGAACA TATTGGTGAG TATAAAGACA TGCAAGCTTG TCAAGTCGCT
AAGTCGATAG CAGACGTGAC TGGCTTCATA TTTGATCACC GGGAGCAGAT AGATTTCCAC
CCCTCTGACG ACCTTTTGTA CTTCCTCAAA AACGGAAAGG AACCTGTATA TGTCAAGTGA
GCCAAATTCC TGAGATTCAA CCCTGACATA CTGATTGTCT TTTAGGCTTG ACCTATCAAA
TTCAGACTCC ACGAATATCA TAAGCATGCC ATCCGTTCTT TCATTCTAGG CTACGCTTAC
CCAGTACAGA CTCCTTCATC ACGGCTTTCT TGAAGTCCAA TAATAGGGCC ATTGTGGATA
TCAAAGGCAT ACAGATGAAG AATGGTGAAA ATCCCGATAT CTTCATCGTG TCTGGCAAGT
TCCCAGGAGT CAACTTTGTT AATCTTGTAA AGAGCTAACC AGCTGGTTCA TAGAGGTGGC
ACCCGTTCCA TACCAGTGGC TTTTGTCAGA GAGGAGGATA TCAGCAATCT GTCACGCCGA
TAGTAAGTGA TTCCGTTTAT CGAAGCTTGC AAACTCTCAT ACAGGTTGTT AGGCGCTTCG
CTCAACCTGG CTGCTATGAG AGCTGGAATA CCAGCCATCA TCGTGGCAAT TGAACGTCCT
TTAAGGTAAT TCACCTTCGT TTTTGAGCAT ACTGCCCTCA CTAACCCATG TCTATAGCTT
CTGGGGCAGA CAGATCCATC AAGTGGGCAT TGCTGCTTTC ATCTCGTCAG ACGTCCTTAG
CTGTGAAGAG ATCACCTCTG CTTTGGAAGA GGCGTTATCT CCGAGAGTTC AATCTGCCGC
CAGAGAGTAT GGATCACAAT TGTCTACTGA GGACGGTACA AAGGGGGCTG CGGAGACCAT
CCATAAGCAC CTTCCCTTGC TTAGCATGAG GTGAATTGTT AGATATGGCG CATTGAGAAT
CAGCTAACCA CTTAGTAGAT GCGATATCAT TCCTTCTAGA GCTGCTATTT GGTATTCCCC
TGAGTACAAC CTTCATCTTT CGGGGATTGC TGCTGGAGTG TTGGTCGATG AGGGGAAGTT
GTCTTTCAAA ACGCTTGAAC CGAATCGTAA CCTTTTTTCA TCAACAAACC TGATCATTTC
TAATCGCTAG TGCAGGATCA AAGGAATACC CAGTGAATAT TGCCGACTCG GATCCTATCA
TTGGAGGTAC CCAAGCGTTC TTCCTTGCTT TGACTGCATC TGTGTTGAAT GTGTTGCACA
TGTTCAACCA ACCGGTAAGT TTCTTTTGTG AAGATTTGAG CCACAGCTGA TCGGACTTTT
CCCAAAGGTT CCTCAGAGAG AAGTCGACAT CTCAGCACAG CAGCCAGTCA TCATTTCCCA
AGTCCGAAAC CCTTCGGGCG GTTGGACGTG GTCATACGAA CAAGCTACAC GCCAGAGAGT
TCCCATTACC GACTTTAAGT CAGGGATGAA GGAAGCAAGG GATGAACTTA CCACTGGTGT
AAAGGATGGT ATGAAAGCGC TAGTCATGGA GCCTCTGTAT GGATTCAAGG AGGGAGTGAG
TTGCGATGGC GGTCGGCACT TTTATAGAGA TCCTGACAAC CAGACATAGG GTCCAGTTGG
TGGGGTCTTT GGACTCGTGA GAGGCGGTAA GTTTATGACT TACAATGTAA ATGACAAAAG
CTAACGTCTA CAAGGTGTGT CTCTCGTTAC ACGACCTTTG GGTAGTGGTA TATCGGCAGT
TCGATATGGT GCGCAAGGCG CAATACGCGA GGTAGACGGC CGAGCCACAA AACTCTTCAC
TCTCGATTAC TCTTCCCCAG CCGAATCACT CCGACCTTCA CGAAAAGCAG CTAGTATCGA
AGAACTCAGG AAGATAACCC AAGAAGACCG AAAGCGGATT TTGGAAGAAT TCAAGCATGC
GAAATCTGAC GAAGCGACAG AATCGAGAAA GGTAAAGGAG GAGGCTATAT CGCTAGGGAA
AATGCCGGAG CGTGCAAGGG GTGGCATTGA ACTTGGCAGA TATACTTCGC CTTTGAGTAA
TGAAGCTGAC AAGTCGAGTA AAAAGTGGTG GAAGGGAAAG GGAAAGGGCA AAGAAAGGGC
ATCTGAGACG ACTTTGGGGC CCCAGCAACC GCTTCAGTCA AGCTCTGGCT TGACATCTCC
GAGTTCGTCC TCGCACACAG ACGAGAAGCT TTGGCCCAGC GAAAGGAAAT AAGGTATAAA
TAATCTAAAT AATCTTTGTA CTGTATATCA TTAGCAAATA CACGGCAAAA TGGGAGAGCC
ACTGAAACTT ATTTATTGTT CTTGATGGAA AATATGTGTA GGTTCGGAAG AAAGCATACG
TAACCATATA AAATCCAACA TTCGAGCAAA AAAATAGTAT CATAAGGGAT TCATAAGTCA
ATCTATATAA TTTTAGGATG CATAAGAAGC AAGCA
 
Protein sequence
MESPPPYEAQ GGPSTTPAAS NNYVAVPTDA DRSAFISSGS GLHVRSSLTT YGDVNAWIDV 
QENLGDLPAA VAPRVKEYAL DPQGAVPSFN IVMFAIGDED DLRQFISLAI ELIVSHSHRI
RIVTSEFYED LITQAKNNLS GRTGKDGRVG LHDKLEMYPL SAPADANLST WTKDQRTMEL
TLISLYRSTF SPSAVPTNPH FAADLIISAP NVPCHVSIAE LLGLPLHILS TNPCSPTITL
PHPGTIIQRS NTNASLTNYL SYPIYENQVW HSLGRVINEF RVASLGLPTL TKMEGPGVLD
RLKVPFTYCW SPSLLKKPED WREHIDVTGF IFDHREQIDF HPSDDLLYFL KNGKEPVYVK
GGTRSIPVAF VREEDISNLS RRYFWGRQIH QVGIAAFISS DVLSCEEITS ALEEALSPRV
QSAAREYGSQ LSTEDGTKGA AETIHKHLPL LSMRCDIIPS RAAIWYSPEY NLHLSGIAAG
VLVDEGKLSF KTLEPNRSKE YPVNIADSDP IIGGTQAFFL ALTASVLNVL HMFNQPVPQR
EVDISAQQPV IISQVRNPSG GWTWSYEQAT RQRVPITDFK SGMKEARDEL TTGVKDGMKA
LVMEPLYGFK EGGPVGGVFG LVRGGVSLVT RPLGSGISAV RYGAQGAIRE VDGRATKLFT
LDYSSPAESL RPSRKAASIE ELRKITQEDR KRILEEFKHA KSDEATESRK VKEEAISLGK
MPERARGGIE LGRYTSPLSN EADKSSKKWW KGKGKGKERA SETTLGPQQP LQSSSGLTSP
SSSSHTDEKL WPSERK