Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA01050 |
Symbol | |
ID | 3253440 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 287322 |
End bp | 288844 |
Gene Length | 1523 bp |
Protein Length | 379 aa |
Translation table | |
GC content | 50% |
IMG OID | 638252437 |
Product | sorbitol dehydrogenase, putative |
Protein accession | XP_566539 |
Protein GI | 58258253 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.917438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAGGTCT GATACCACGA TGTCTACCGA ACTCAACCCA GACAACACCA GCTTTGTCCT CCACGGCGTC GAAGACGTCA GGTTCGACCA GGTACGTCAT CATGGACACT CCTCTCGGAA CATCCACGTA TTGCTTACGC CCCATTCTCC CGCAGCGTCC CATCCCCGAG GTCCACAATG ACCAAGTCCT CATCAAGGTC GTCAAGACTG GTATCTGCGG CTCTGATGTG CACTACCTTC AGCATGGACG TATCGGCTCT TTTGTCCTCG AGGAACCCAT GTGTCTGGGT CACGAGTCAG CTGGTGTCGT CGTCAAGCTC GGTCCTAACG TGAGAGAGGA TCTAGGTGTC GAAGTTGGCA CCAGAGTGGC TATGGAGCCT GGTGTTTGCT GTAGGTCTTG TGCCAATTGT AAAGCTGGCT TGTACGAGGT AAGTTTGAAT TTAAGCAAAG CTTTTAAAAG GCGTCCGTGG TCCGCCCCGC TGATGAAACG CAGCTCTGTC CTTACATGAG CTTTGCCGCT ACTCCCCCTA CCATCTTTGG TACACTCTGT CGATACTATG TGCTCCCTGC TGACCTTGTC CACCCTCTTC CCGAATCCGT TTCCTTTGAG GATGGTGCTA TGATGGAACC CCTCTCCGTC GGTGTCCACT CTGTGGCCAC CTTGGGAGGG TGCAAGTCTG ACCAGACAGT CATTGTCTTT GGTGCCGGAC CCGTTGGACT GTTGTGTATG GCTGTTGCCA AGGCCCTGGG AGCGAGGAGG ATTATTGCTG TGGATATCAA CAAGGAAAGA CTGGAATTCG CCAAGAGTTA CGCTGCCACT GATGTCTGCA TACCTGTAAG TGCCCTATCG TTTTTAAAGT AGTTGTCATA AGTAATGGGA AGAGGTAGGG TTCTAAATTG GACGGCGAAG ACGGAGAAGC GTACACCGCC CGAATAGCTG GTGAACTTCG TCAGGAGCTC GGCATTCCCG AGCGAGGAAA GGGTGCCATC GATCTCGCCA TCGAAGCATC CGGTGCGCCT ACTTGTGTTC AAATCGGTTT GGCCGTGTTG AAACCTGCGT ACGTTTTGTC AAAATGCATA CCCATTTATC ACCGAACTGA CCAGGACATT CACACAACTA GCGGCACTTA CGTCCAAGTT GGTATGGGCG CCAAGATGAC CGTCCCCGTT CCCCTCTTCC ACATCATCTC CAAGCAACTC CACGTTGTCG GTTCCTTCAG ATACGGTTCC GGCGACTACC CTTTGGCCAT TTCACTTGTT GAAAGGGGAT TGATCGACTT GAAGCCGTTG GTCACTCAGA GGTTCAAGTT TGAAAATGCC AAAGAGGCGT TTGAGACCAC AAAGGTTGGA AAAGACAAGA ATGGGAAGGG CGTGATCAAG TGTATCATCG ATGGACCGGA GTAAAATAAT AATAATAACG GTGGATTTCA TAGGGGTTAT AGAAGGGGTT TTATTGTAGA TCGTAAACTA GAAAAAAAGC TTACACAAAT AGTCAGTTGG TTACATGAAT TGTGTTTATA GAT
|
Protein sequence | MSTELNPDNT SFVLHGVEDV RFDQRPIPEV HNDQVLIKVV KTGICGSDVH YLQHGRIGSF VLEEPMCLGH ESAGVVVKLG PNVREDLGVE VGTRVAMEPG VCCRSCANCK AGLYELCPYM SFAATPPTIF GTLCRYYVLP ADLVHPLPES VSFEDGAMME PLSVGVHSVA TLGGCKSDQT VIVFGAGPVG LLCMAVAKAL GARRIIAVDI NKERLEFAKS YAATDVCIPG SKLDGEDGEA YTARIAGELR QELGIPERGK GAIDLAIEAS GAPTCVQIGL AVLKPAGTYV QVGMGAKMTV PVPLFHIISK QLHVVGSFRY GSGDYPLAIS LVERGLIDLK PLVTQRFKFE NAKEAFETTK VGKDKNGKGV IKCIIDGPE
|
| |