Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG03080 |
Symbol | |
ID | 3258570 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 857074 |
End bp | 859600 |
Gene Length | 2527 bp |
Protein Length | 519 aa |
Translation table | |
GC content | 47% |
IMG OID | 638257930 |
Product | aldehyde dehydrogenase, putative |
Protein accession | XP_572008 |
Protein GI | 58269704 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.611345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGACATCGA AAGAAGGAAA CAGATACTTC TTGCCCCTCT AGTCGATCCA AGCGCAATGC TCGTCAAATC CCGTCCCCGT ATATCACCTT TATTCGCTGG TATGCGCTAT TATTCCCAGA TCAGCCGGGT TCCTCTTTGG ATTGGAGGAA AGCAAGTAAC GTCAACTAGC CGCGGAACAA TCATTCACAA GCACCCGAGA ACAGGGCAGA AAAGCTGTGA GATGGTTATT GCCGGTGAAC TGGAAACGTG AGCTATAAGT TCGTTTTGAA CTTTCGTCGA TAAAAGCTAA AGGGAAGGAC ACATCATGAT ATCAGGAGCG AAGCAATTCG TCACAGTCAC GAAGCTTATA AGTCATGGAG TATGGTCTCC GGTTGGGAAA GACGGTCCAT ATTACAAAAT GTACGTTTAC TCAATGTACT GTCCTGCTGT GAATATACAG TTTCAGTGGC TGATCGTTCT CTGACTCAGG CCTTACGATT GCTGAAGGAG CGGTCGGCAA ATATTGCGAG CCTGCTTCGT GCAGATGCCG ACTTTTCAGA CCTTCTTGTT CACAGCGATA TAAACTCTTC TATAAACTTA TTGGACGGTT CTGCTGAAAC GTGAGCTGTT TCCCAGAAAT ATATACTTGT GGCCCATTGC TAATATGCTA ATAGAGCAAT ATCTATTGAA GCAAGGCAAA AACTGTCATC GCTCAGGGTC TTCGGCTGAT GCAGGCTTTT ACCTAGGGAT ATATGCCACA GACCGTAGAT GGTAGCTTGG CCATGGTAAT GAAGGAACCA CACGGGCCGG TTCTCAGCAT ACCTGCATTC AATTTCCCAC TCACATTGGC AATGCGTAGC ATAGGTAAGC ATGTATCGGC TATCCTCAGA TAGTCATGCG CGAACACTCA ACGGCGCAAT ATCATGACTT ACAGTCTATC CCATCGCTTG CAGTAACACT GTGGTCATGA AGGGTGAGCA TGGAATACTT TCCTCCAATA CCGATGGGAA AATTGACCTT CTCTCATAGC ATCACCACTT GTACCTCAGC TTTCAACCTT CATAGCTGCG CTATTCAATG ACGCAGGGCT TCCTCCCGGT GTGCTTCAAA TATTGAGTTT CTCCGAAGAT GAAGTGGGCA AAAGAGTCAA GCAGCTTATT TCTCACGATG ACATACGTGT ACGACTATAC CGGCGAAGGC AACTTCCTCT AAAGGTGACT GCTTCTCGCT GACATGATGA ATACAGTTTG TCAACTTCAC TGGTTCAGTC AGTTTAGGCA AACAGCTCGC AAGCCTTTGT GGACAATATC TTAAGTATGT GATAATATTG GGAACAAGTC AGGTGTTTCG GTCTAGACAC TGACAATACT GTATTGAAGA CCTTCAGTCA TGGAGCTGGG GGGTAAAGCC CCAGCCATTG TTTTACCCTC CGCTAATTTA CAGTTGGCGG CCAATCACGT GAGTTTCGTC TCTATTATTT TTTCCACTAT GATTTTTGGT GTTGAGAAGT GAGGCTTCTG CAGATTTTAT TTGGGGCATT TCTTAACTCG GGACAAGTCT GCATGTCAAT GGAAAGAGTC ATCGTCCATG AGAAAATTGC TGAAGAATTC GAGCAAGTTC TCAAATCAGA GGCCATCAAG GCAGGCTGGG CTGGGGGCAT GGAGTTGGTC AGATCGGGAG CAGGAGAAAG AGCAAAACAT ATGGTTGACC AAGCTGTGAG GATGGTAAGA GTATTCTGCA AGCTGTTCAA TTACATGCGG CTAACAGAAT ATCTCGCTAG GGAGCCAGAG TGATTTACAG CGCAGGAGCA GACGGCTCAG CCGCGTCACT TTCGTCGAAG TCTGCATACC CGCCAACGAT CCTAGCTGAT GTTCGCCGCG ATGCGGATCT CTTTCAGGTC GAGTCATTTG CCCCCATTTT GACGGTCCAG TCGGGGTCTG ATTTATCTTC AATAATTGCC ATGGCGAACT CTCACGAAAC AGGACTTTCG GCGTCTGTCT TTTGTCAAGA CCTCGCTTTG GCACTCAAAG TGGCACAAAG TTTACAGTCT GGGGCCGTCC ATATAAATGG TATGAGCGTC CACGATGAGC ATGGACTGCC TCATGGGGGC ACAAAAAGCA GCGGATGGAG CAGATTCAAT GGCAAAGGCG CCATTGAAAG TTTTACTCAG ACGAAAGTCA TTCGAATTAA TGGAACAAAT CTCTCATTGC CATTGTCAGC GCTCTACCGT GGCTTACCAG AGAATGGTTC TAGTGAAATT TAATCTATTG CGGCCACAGT ACATTACCCC AGTCATATTT GCTTTTGGTC CGCTTTTGGC CTGGCGAGTA CTTGCTGGGC ACTCCGGTGT GCAGTTGAAA GAAAAATTAT TTCAAGATAG TAGAGCAACA TTGTCAGTCA TCGACGTCGG GCAATGTTGC AGGTGGTGGG CTAAGTTTCC ACAGTGATGT TATCATGTTG ACGCTGTCCG GCACTGTCAT TATGCGTGCG TGACTGAAGT CTCATAGCAG GCAGCATCAT GCATCATGCA TCATTCGTTG ACACTTA
|
Protein sequence | MLVKSRPRIS PLFAGMRYYS QISRVPLWIG GKQVTSTSRG TIIHKHPRTG QKSCEMVIAG ELETSEAIRH SHEAYKSWSM VSGWERRSIL QNALRLLKER SANIASLLRA DADFSDLLVH SDINSSINLL DGSAETAISI EGYMPQTVDG SLAMVMKEPH GPVLSIPAFN FPLTLAMRSI VYPIACSNTV VMKASPLVPQ LSTFIAALFN DAGLPPGVLQ ILSFSEDEVG KRVKQLISHD DIRFVNFTGS VSLGKQLASL CGQYLKPSVM ELGGKAPAIV LPSANLQLAA NHILFGAFLN SGQVCMSMER VIVHEKIAEE FEQVLKSEAI KAGWAGGMEL VRSGAGERAK HMVDQAVRMG ARVIYSAGAD GSAASLSSKS AYPPTILADV RRDADLFQVE SFAPILTVQS GSDLSSIIAM ANSHETGLSA SVFCQDLALA LKVAQSLQSG AVHINGMSVH DEHGLPHGGT KSSGWSRFNG KGAIESFTQT KVIRINGTNL SLPLSALYRG LPENGSSEI
|
| |