Gene CNA01850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA01850 
Symbol 
ID3253769 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp498616 
End bp500840 
Gene Length2225 bp 
Protein Length519 aa 
Translation table 
GC content48% 
IMG OID638252518 
Productalcohol dehydrogenase, putative 
Protein accessionXP_566553 
Protein GI58258281 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0520343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGCTTCTGC CACACTAAGC CTCGTCTCGC CCCCTTATAT TATAGGCCAT CCTCCACCGG 
CCTGCACCTA ACACCACATC CACAGCAACA CAAATATGCC CCCTCCAGCA TCCAGAGCCT
CCATCGTCCG ACTCCTCTCG CTCACCGCTA ACCAAGGCTG TAGAGGCTGT GGACGTACCC
ATTCAGTGCT TGACCATGCC CATAACCATG GACACACCCA ATCCCACGTC CCACTTGGGC
TGCGTGGAAT GGCAACCCCT GTAGACTTGC CAATCAGAGG CGGGCCCCCT CCTGGTAACA
CCGACTACGC TTTTGAGGTG AATATTCTCT AACGCATCGT TTGGTTTTCC CTGACCTAAG
ATTAACATCT CTAGATGGCC GCATCAAACC TTCGGTTTGG TGCCCATGCA ACCAGCGAAG
TGGGCATGGA TTTTGCCAAT CTCATAAAGG AAATGCCTGC TGTAGACCGC TCTCAGGCTA
AAATCGGTGT TTTTACGGAT CCCAATGTAG CGAAACTCCC CGTGATAGAG ATTGTGGAAG
AGAGTCTGTT ACGAGCGGGT TTGAACTTCG TAATGTGGGA TAAATGTGCA GTGGAGCCTA
CAGATAAGAG CTGGCAAGTA AGTTTGAAAC ATTAAGCCGA AGGAGAAAAA AAGCACACGG
TGCTTCGTGG ATTCAGTTGA CATAATCTTG TGGCCATTGC ACCGCTTAGG AAGCAATCGA
CTTTTCCAGG TCTTCACATC TCACTCATTT TTTGGCCGTT GGTGGTGGAT CATCTATGGA
CACTGCTAAG GCTGCCAATT TGTTCACGAA TTATCCAAAA GCCGATCTCT TTGAATTCAT
CAACGCTCCC ATCGGTAAAG GTACACCCAT CACCAAGAAA CTCAGCCCTT TGATTGCCAG
TACGTTAGCC GGTGACCTTT CTCCCGGATT GATGCTGATG CTGATATGAT AAACAGTCCC
GACTACCGCT GTGAGCTTAA TTGGGCCGTG TAGCAGTATA CCATGAAAAA GCTGATGGCC
AAGTCATAGG GAACAGGATC TGAGACTACT GGCACTGCCA TCCTAGACAT TCCGTCTCGC
AAATTCAAAA CTGGAATTGC ATCGCGGGCT CTCAAGCCAA CTTTGGGTAT TGTCGATCTG
TTGAACACCA CTACGTGTCC GAAGGAAGTG GCCATCGCAG CCGGGCTGGA TGTCCTCTTT
CATTCGCTCG AAAGTAAGAC CTGATATCGC AAAGCTGAAT AGATCATTGA TTTGTCTCTT
CTTATGTAGG CTGGACTGCC GTGCCCTACC ATCAGAGGAC ACCTAGGCCT GCAAACCCCA
TTAACCGACG TATGTGATAT TATTATCTAA TGTTCTGATT TGAATATTAA ATGCCCGTCT
GCCTGCCATA TAGCTGCCTA TCAGGGCTCC AACCCTATCT CTGATATCTT TTCCAAATGG
GCTTTGGAAA CTACAGTCAA GTATCTTCCT CGAATTGCTC GAGACCCCTT TGGTGACGAA
GAGGCCCGAG CGCAAATGCT TCTAGCTGCT TCTACTGCAG GTATTGGTTT TGGTAATGCA
GGAGTGCACA TGTGTGTGAG TAGTCCAGTT TCAGCTATTA CGATCAATGT TAGGAGCATA
GACTAACTTC TCTAACCTCA AGCACGCCTT CTCCTACCCC ATTTCTTCAC TTAACAAAGG
CAGACCTAAG GAGACACAGT ATCATCACCC ATCTTATAAT CCCGACGTCC CCCTCATTCC
TCACGGCGTG GCTGTTTCTC TGACAGCTCC CGCCGTATTC AATTTCACGG CTCCTTCTTC
TCCTGATAGA CACCGCGAAG CATTGTTGGT CTTCCTTGGG AAAGACAGGG CCCATGAGGC
GACTGGTCTA AAGGATGAGG ACCTGGGACC TAAGTTGAGT GAGGAGATCC GAAGATTTTT
GGATATCGTG GAAGTTCCTA GAGGGTTGAG TAAAGTTGGC TACACAGGAA ACGACATCAC
TTCCGTAAGT TTTTTACTGT CCATTTATAG GCCGAAGTAT ATTGACCAAT TCTGATGTAG
CTTGTCGATG GCTGTCTTCC TCAACGAAGA GTTCTGGATC TTGCTCCTGT TTTGGCCAAG
AACAATAATG CTGAAGAGAG AGAACAACTT GCTCACATCG TCGAACTTTC AATGAACTGG
TAGATGAGAT AGTATGTAGG GTACAGGCAA GTGAAGACAG GGGAATGCAT ATTAATTGCG
AGCTT
 
Protein sequence
MPPPASRASI VRLLSLTANQ GCRGCGRTHS VLDHAHNHGH TQSHVPLGLR GMATPVDLPI 
RGGPPPGNTD YAFEMAASNL RFGAHATSEV GMDFANLIKE MPAVDRSQAK IGVFTDPNVA
KLPVIEIVEE SLLRAGLNFV MWDKCAVEPT DKSWQEAIDF SRSSHLTHFL AVGGGSSMDT
AKAANLFTNY PKADLFEFIN APIGKGTPIT KKLSPLIAIP TTAGTGSETT GTAILDIPSR
KFKTGIASRA LKPTLGIVDL LNTTTCPKEV AIAAGLDVLF HSLESWTAVP YHQRTPRPAN
PINRPAYQGS NPISDIFSKW ALETTVKYLP RIARDPFGDE EARAQMLLAA STAGIGFGNA
GVHMCHAFSY PISSLNKGRP KETQYHHPSY NPDVPLIPHG VAVSLTAPAV FNFTAPSSPD
RHREALLVFL GKDRAHEATG LKDEDLGPKL SEEIRRFLDI VEVPRGLSKV GYTGNDITSL
VDGCLPQRRV LDLAPVLAKN NNAEEREQLA HIVELSMNW