Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC05340 |
Symbol | |
ID | 3256278 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 1595654 |
End bp | 1597523 |
Gene Length | 1870 bp |
Protein Length | 355 aa |
Translation table | |
GC content | 50% |
IMG OID | 638255752 |
Product | d-arabinitol 2-dehydrogenase, putative |
Protein accession | XP_569729 |
Protein GI | 58265146 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.656485 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCCAAGTC ACTTTTGTAC CCTCTTCACA ACTTTTAACC ATGTCCTTCA TCCGCTCTAG CCTTTTCAAG GCCACTGCCA ATCCCATCAG GCGATCTGCC TTTGCTACCA CTCCACTTCG AGCCTTCACC AGGTCCGCTC TTGTCAGCAA CAACAAGAAG GACGATGGTT ACGAGGAGCA TCGAGTCGAG ATTGAGCCCA AGATCGCTGC TGTCGACGAG AGTTTCACGT TTGAACACCC TGAGGTGAGC TTTTGAGCTT TTCCATCCAT GCAATCCATG TCGGTCGGTG AGATTCCCAA CACCTGGTTG TGGAGTCCCG GTCACGCATG TTTCTTTTGT TTCAAAGGAT CTCGTACCCC AGTGCGCTGT TTCTGCACCT GTGTTGACAC GTGATATGAT GTAGAAATGG GTAGACAAGC ATCCTGGTCA TGATATGCAG CGAGGTGATT TTGGTCGACA CACCAAGCGA ACTCTTGCAT CTTTCTCTAT GGACGGCAAG GTCTGCCTTG TCACTGGTGC AGCTCGAGGT CTTGGTAACA TGATGGCCAG GACTTTTGTT GAATCGTGAG TACCGATGCT TTTTTTCATG GCCCCAATCC CGGCGCTTTT GGACTTTCCA TACCGATATG TGCGGCAAAG GGACGAATGA GGCGGCGGCC ATCGCCGATT TCTTAAACTG CCGCGACGTT TGGTCCCGGG CCTTTTTTCG GTCGTTGGCA CATGCCATTG ACTTGTTCTG TTCCTGATCG GTGCTGATGG CGACTTGAAA GCGGCGCGAA CGCCATTGTC CTTGTCGATC TCAAGAAGGA GGATGCCGAG CGTGCAGCCA AGGAGCTCGT TGACTGGTTT GGTGAGTATT GTATTCTCTT ATTCGCCGTG AACTCTGTTA ACGTGTCATC ATGTAGTCGA GAACGGTGAA GCCGAGAAGG GTGAAATTGA GGCTATTGGT CTCGGTTGCG ACGTTTCCGA CGAGGCCTCT GTCAAGCAGG TCTTTAGCAC CGTCAAGGAG AGATTCGGCC GGCTTGACGC TGTCGTCACT GCTGCCGGTA TTGTCGAAAA CTTTGTCGCT CACGAGTACC CCATCGATAA GATCAAGAAG CTGTTGGACA TCAACATTAT GGGTACTTGG TATTGCGCAC TTGAGGCTGC CAAGCTTATG CCTGAAGGTG GTTCCATTAC CCTCGTCGCA TCTATGAGCG GTAGCGTAAG CCTATTCACT TACCGCACTT TGATATCTGC TAACTGGATA ATCACAGATT GTCAACGTTC CTCAACCTCA AACCCCTTAC AACTTTTCCA GTGGGTCTTT TTTGATCCAT GAATAACGTG TTTGAATGCT GACTGTGACG CAGAGGCTGC TGTGCGACAC ATGGCTCGAT CCCTCGCCGT CGAATGGGCT CTCAAGGGTA TCCGGTATGT TAGTTTCTCC TGTGACCGAT GACCAAAAAT TAACCACCAT GCAGTGTCAA CGCTCTTAGT CCGGGTTACG TCCTCACCAA CTTGACTAAG GTCATTCTCG ACGCCAACCC CGTTCTCCGT GACGAGTGGC TCAACCGTAT CCCCATGGGT CGAATGGCCG ACCCTTCTGA TCTCAAGGGT GCCGTCATTT ACCTTGCTTC TGACAGCTCC AAGTACACCA CTGGTGCTGA GATCATGATT GACGGCGGTT ACACTTGCTT GTAAGCGGTG ATCTCCAAAG AAGTGAATGC ACGTTGGGAT TTTACAGGAC GAGAGAACTT CATGGACATT GTTGCTTGGC TCGATAGGAG GATGTGTATG GTTTTAAAAG TAATAGAAAG GGCGGTACAT AATTGTGAAT CCGTAAATTA GAGAATACAT TGGAATCGGC AGAATGCATA CATAGAATAA
|
Protein sequence | MSFIRSSLFK ATANPIRRSA FATTPLRAFT RSALVSNNKK DDGYEEHRVE IEPKIAAVDE SFTFEHPEKW VDKHPGHDMQ RGDFGRHTKR TLASFSMDGK VCLVTGAARG LGNMMARTFV ESGANAIVLV DLKKEDAERA AKELVDWFVE NGEAEKGEIE AIGLGCDVSD EASVKQVFST VKERFGRLDA VVTAAGIVEN FVAHEYPIDK IKKLLDINIM GTWYCALEAA KLMPEGGSIT LVASMSGSIV NVPQPQTPYN FSKAAVRHMA RSLAVEWALK GIRVNALSPG YVLTNLTKVI LDANPVLRDE WLNRIPMGRM ADPSDLKGAV IYLASDSSKY TTGAEIMIDG GYTCL
|
| |