Gene CNG03080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG03080 
Symbol 
ID3258570 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp857074 
End bp859600 
Gene Length2527 bp 
Protein Length519 aa 
Translation table 
GC content47% 
IMG OID638257930 
Productaldehyde dehydrogenase, putative 
Protein accessionXP_572008 
Protein GI58269704 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.611345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGACATCGA AAGAAGGAAA CAGATACTTC TTGCCCCTCT AGTCGATCCA AGCGCAATGC 
TCGTCAAATC CCGTCCCCGT ATATCACCTT TATTCGCTGG TATGCGCTAT TATTCCCAGA
TCAGCCGGGT TCCTCTTTGG ATTGGAGGAA AGCAAGTAAC GTCAACTAGC CGCGGAACAA
TCATTCACAA GCACCCGAGA ACAGGGCAGA AAAGCTGTGA GATGGTTATT GCCGGTGAAC
TGGAAACGTG AGCTATAAGT TCGTTTTGAA CTTTCGTCGA TAAAAGCTAA AGGGAAGGAC
ACATCATGAT ATCAGGAGCG AAGCAATTCG TCACAGTCAC GAAGCTTATA AGTCATGGAG
TATGGTCTCC GGTTGGGAAA GACGGTCCAT ATTACAAAAT GTACGTTTAC TCAATGTACT
GTCCTGCTGT GAATATACAG TTTCAGTGGC TGATCGTTCT CTGACTCAGG CCTTACGATT
GCTGAAGGAG CGGTCGGCAA ATATTGCGAG CCTGCTTCGT GCAGATGCCG ACTTTTCAGA
CCTTCTTGTT CACAGCGATA TAAACTCTTC TATAAACTTA TTGGACGGTT CTGCTGAAAC
GTGAGCTGTT TCCCAGAAAT ATATACTTGT GGCCCATTGC TAATATGCTA ATAGAGCAAT
ATCTATTGAA GCAAGGCAAA AACTGTCATC GCTCAGGGTC TTCGGCTGAT GCAGGCTTTT
ACCTAGGGAT ATATGCCACA GACCGTAGAT GGTAGCTTGG CCATGGTAAT GAAGGAACCA
CACGGGCCGG TTCTCAGCAT ACCTGCATTC AATTTCCCAC TCACATTGGC AATGCGTAGC
ATAGGTAAGC ATGTATCGGC TATCCTCAGA TAGTCATGCG CGAACACTCA ACGGCGCAAT
ATCATGACTT ACAGTCTATC CCATCGCTTG CAGTAACACT GTGGTCATGA AGGGTGAGCA
TGGAATACTT TCCTCCAATA CCGATGGGAA AATTGACCTT CTCTCATAGC ATCACCACTT
GTACCTCAGC TTTCAACCTT CATAGCTGCG CTATTCAATG ACGCAGGGCT TCCTCCCGGT
GTGCTTCAAA TATTGAGTTT CTCCGAAGAT GAAGTGGGCA AAAGAGTCAA GCAGCTTATT
TCTCACGATG ACATACGTGT ACGACTATAC CGGCGAAGGC AACTTCCTCT AAAGGTGACT
GCTTCTCGCT GACATGATGA ATACAGTTTG TCAACTTCAC TGGTTCAGTC AGTTTAGGCA
AACAGCTCGC AAGCCTTTGT GGACAATATC TTAAGTATGT GATAATATTG GGAACAAGTC
AGGTGTTTCG GTCTAGACAC TGACAATACT GTATTGAAGA CCTTCAGTCA TGGAGCTGGG
GGGTAAAGCC CCAGCCATTG TTTTACCCTC CGCTAATTTA CAGTTGGCGG CCAATCACGT
GAGTTTCGTC TCTATTATTT TTTCCACTAT GATTTTTGGT GTTGAGAAGT GAGGCTTCTG
CAGATTTTAT TTGGGGCATT TCTTAACTCG GGACAAGTCT GCATGTCAAT GGAAAGAGTC
ATCGTCCATG AGAAAATTGC TGAAGAATTC GAGCAAGTTC TCAAATCAGA GGCCATCAAG
GCAGGCTGGG CTGGGGGCAT GGAGTTGGTC AGATCGGGAG CAGGAGAAAG AGCAAAACAT
ATGGTTGACC AAGCTGTGAG GATGGTAAGA GTATTCTGCA AGCTGTTCAA TTACATGCGG
CTAACAGAAT ATCTCGCTAG GGAGCCAGAG TGATTTACAG CGCAGGAGCA GACGGCTCAG
CCGCGTCACT TTCGTCGAAG TCTGCATACC CGCCAACGAT CCTAGCTGAT GTTCGCCGCG
ATGCGGATCT CTTTCAGGTC GAGTCATTTG CCCCCATTTT GACGGTCCAG TCGGGGTCTG
ATTTATCTTC AATAATTGCC ATGGCGAACT CTCACGAAAC AGGACTTTCG GCGTCTGTCT
TTTGTCAAGA CCTCGCTTTG GCACTCAAAG TGGCACAAAG TTTACAGTCT GGGGCCGTCC
ATATAAATGG TATGAGCGTC CACGATGAGC ATGGACTGCC TCATGGGGGC ACAAAAAGCA
GCGGATGGAG CAGATTCAAT GGCAAAGGCG CCATTGAAAG TTTTACTCAG ACGAAAGTCA
TTCGAATTAA TGGAACAAAT CTCTCATTGC CATTGTCAGC GCTCTACCGT GGCTTACCAG
AGAATGGTTC TAGTGAAATT TAATCTATTG CGGCCACAGT ACATTACCCC AGTCATATTT
GCTTTTGGTC CGCTTTTGGC CTGGCGAGTA CTTGCTGGGC ACTCCGGTGT GCAGTTGAAA
GAAAAATTAT TTCAAGATAG TAGAGCAACA TTGTCAGTCA TCGACGTCGG GCAATGTTGC
AGGTGGTGGG CTAAGTTTCC ACAGTGATGT TATCATGTTG ACGCTGTCCG GCACTGTCAT
TATGCGTGCG TGACTGAAGT CTCATAGCAG GCAGCATCAT GCATCATGCA TCATTCGTTG
ACACTTA
 
Protein sequence
MLVKSRPRIS PLFAGMRYYS QISRVPLWIG GKQVTSTSRG TIIHKHPRTG QKSCEMVIAG 
ELETSEAIRH SHEAYKSWSM VSGWERRSIL QNALRLLKER SANIASLLRA DADFSDLLVH
SDINSSINLL DGSAETAISI EGYMPQTVDG SLAMVMKEPH GPVLSIPAFN FPLTLAMRSI
VYPIACSNTV VMKASPLVPQ LSTFIAALFN DAGLPPGVLQ ILSFSEDEVG KRVKQLISHD
DIRFVNFTGS VSLGKQLASL CGQYLKPSVM ELGGKAPAIV LPSANLQLAA NHILFGAFLN
SGQVCMSMER VIVHEKIAEE FEQVLKSEAI KAGWAGGMEL VRSGAGERAK HMVDQAVRMG
ARVIYSAGAD GSAASLSSKS AYPPTILADV RRDADLFQVE SFAPILTVQS GSDLSSIIAM
ANSHETGLSA SVFCQDLALA LKVAQSLQSG AVHINGMSVH DEHGLPHGGT KSSGWSRFNG
KGAIESFTQT KVIRINGTNL SLPLSALYRG LPENGSSEI