Gene CNA01050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA01050 
Symbol 
ID3253440 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp287322 
End bp288844 
Gene Length1523 bp 
Protein Length379 aa 
Translation table 
GC content50% 
IMG OID638252437 
Productsorbitol dehydrogenase, putative 
Protein accessionXP_566539 
Protein GI58258253 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.917438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGGTCT GATACCACGA TGTCTACCGA ACTCAACCCA GACAACACCA GCTTTGTCCT 
CCACGGCGTC GAAGACGTCA GGTTCGACCA GGTACGTCAT CATGGACACT CCTCTCGGAA
CATCCACGTA TTGCTTACGC CCCATTCTCC CGCAGCGTCC CATCCCCGAG GTCCACAATG
ACCAAGTCCT CATCAAGGTC GTCAAGACTG GTATCTGCGG CTCTGATGTG CACTACCTTC
AGCATGGACG TATCGGCTCT TTTGTCCTCG AGGAACCCAT GTGTCTGGGT CACGAGTCAG
CTGGTGTCGT CGTCAAGCTC GGTCCTAACG TGAGAGAGGA TCTAGGTGTC GAAGTTGGCA
CCAGAGTGGC TATGGAGCCT GGTGTTTGCT GTAGGTCTTG TGCCAATTGT AAAGCTGGCT
TGTACGAGGT AAGTTTGAAT TTAAGCAAAG CTTTTAAAAG GCGTCCGTGG TCCGCCCCGC
TGATGAAACG CAGCTCTGTC CTTACATGAG CTTTGCCGCT ACTCCCCCTA CCATCTTTGG
TACACTCTGT CGATACTATG TGCTCCCTGC TGACCTTGTC CACCCTCTTC CCGAATCCGT
TTCCTTTGAG GATGGTGCTA TGATGGAACC CCTCTCCGTC GGTGTCCACT CTGTGGCCAC
CTTGGGAGGG TGCAAGTCTG ACCAGACAGT CATTGTCTTT GGTGCCGGAC CCGTTGGACT
GTTGTGTATG GCTGTTGCCA AGGCCCTGGG AGCGAGGAGG ATTATTGCTG TGGATATCAA
CAAGGAAAGA CTGGAATTCG CCAAGAGTTA CGCTGCCACT GATGTCTGCA TACCTGTAAG
TGCCCTATCG TTTTTAAAGT AGTTGTCATA AGTAATGGGA AGAGGTAGGG TTCTAAATTG
GACGGCGAAG ACGGAGAAGC GTACACCGCC CGAATAGCTG GTGAACTTCG TCAGGAGCTC
GGCATTCCCG AGCGAGGAAA GGGTGCCATC GATCTCGCCA TCGAAGCATC CGGTGCGCCT
ACTTGTGTTC AAATCGGTTT GGCCGTGTTG AAACCTGCGT ACGTTTTGTC AAAATGCATA
CCCATTTATC ACCGAACTGA CCAGGACATT CACACAACTA GCGGCACTTA CGTCCAAGTT
GGTATGGGCG CCAAGATGAC CGTCCCCGTT CCCCTCTTCC ACATCATCTC CAAGCAACTC
CACGTTGTCG GTTCCTTCAG ATACGGTTCC GGCGACTACC CTTTGGCCAT TTCACTTGTT
GAAAGGGGAT TGATCGACTT GAAGCCGTTG GTCACTCAGA GGTTCAAGTT TGAAAATGCC
AAAGAGGCGT TTGAGACCAC AAAGGTTGGA AAAGACAAGA ATGGGAAGGG CGTGATCAAG
TGTATCATCG ATGGACCGGA GTAAAATAAT AATAATAACG GTGGATTTCA TAGGGGTTAT
AGAAGGGGTT TTATTGTAGA TCGTAAACTA GAAAAAAAGC TTACACAAAT AGTCAGTTGG
TTACATGAAT TGTGTTTATA GAT
 
Protein sequence
MSTELNPDNT SFVLHGVEDV RFDQRPIPEV HNDQVLIKVV KTGICGSDVH YLQHGRIGSF 
VLEEPMCLGH ESAGVVVKLG PNVREDLGVE VGTRVAMEPG VCCRSCANCK AGLYELCPYM
SFAATPPTIF GTLCRYYVLP ADLVHPLPES VSFEDGAMME PLSVGVHSVA TLGGCKSDQT
VIVFGAGPVG LLCMAVAKAL GARRIIAVDI NKERLEFAKS YAATDVCIPG SKLDGEDGEA
YTARIAGELR QELGIPERGK GAIDLAIEAS GAPTCVQIGL AVLKPAGTYV QVGMGAKMTV
PVPLFHIISK QLHVVGSFRY GSGDYPLAIS LVERGLIDLK PLVTQRFKFE NAKEAFETTK
VGKDKNGKGV IKCIIDGPE