Gene CNK02940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK02940 
Symbol 
ID3254501 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp862222 
End bp863495 
Gene Length1274 bp 
Protein Length367 aa 
Translation table 
GC content52% 
IMG OID638253785 
Productalcohol dehydrogenase (NADP+), putative 
Protein accessionXP_567889 
Protein GI58260958 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.013941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGCCG ACAACGAATT CAAAGGCTGG GCTGGTTTGG ACGAGAAGGC CTGCGACGGC 
CACATGTCAT TCCAGGAGTT CACTCCCAAG AAGTGGGATG AAGACGACGT CGATGGTGAG
TATCGTCTGC TGGAGTAATG GAAGCCGACG GAGCTGACAT GTCAACAGTC AAAATTTTGT
ACTGCGGTAT CTGTGGTTCA GATGTCTCCT CTTTAACTGG TGAATGGGGA CCTGTAAAGG
ACATCTGCCC CCAGGTTTGC GGCCACGAAA TCGTTGGTGA GGTTGTGAGG GTCGGCACCT
CTCCCGAAAA CGGCCTTAAG ATTGGTGACC TCGTCGGTAT TGGTGCCCAG TCAGACTCTT
GTCGTGAATG CGAATGGTGC AAGGAAGGTC AGTAATCTCT ATTACAATCC AACTTGCAAC
AAGTACTGAC ATGTAGATGC TAGGCAAGGA AAACTACTGT GCTACCCAAA CCATCACTTT
TAACTACCCC TACAACCGTG GTCCCAATGG CAAGGGGTCC ATCGCCCGAG GTGGTTTTGC
CAAGTACTGG CGAGGACCTT CCAAGTTTGC TGTCCCGCTT CCTTCCGGCC TCGAGCCTGA
CGTTGCGGCA CCTATGCTTT GTGGTGGTGT CACCGTCTAC AGCCCCCTCG CCCGTTTTGA
AATCGGTACC AAGCGCAAGC GCGTCGGTGT CATCGGTGTC GGTGGTCTCG GTCACATGGC
TATCCTTTTT GCCAAGGCTA TGGGCGCCGA GGTGACTGCT ATCTCTCGAA CTGATGCGAA
GAAGGAGGAC GCCTTCAAGC TTGGTGCTAC CGATTACTTT GCTACTGGTG GTGACTTGCA
GGAGGCTGTC AAGGCTCGCA CTCGATCTCT CGACTTTATT CTCTGTACTA TCAGTAAGTC
ATCATCGTCA ATGGACTTCT ACCCCAACTT GTATGCTGAT TCTTGTACTA TCAGACCCTG
AAAGCTTCTC CATCAGCGAC TACCTCCCCC TCCTCACCCC CGCCGGTGTC TTCTGCATCG
TCGGCGTCAT CCCCACCCCT TTGCAAGTCC CCGCTTTCCC TCTTATCATG AACAGCGCTT
GCGTCGCCGG TTCCAACATC GGTAGCCCCA AGGAGATTAC TGAAATGTTC GAATTCGCCG
TTAAGCACAA CATTAAGCCT TGGATCCAGA AGTGGAACTT CGACGATATC AACAAGGCGT
TGCCTTCTTT CCAAAAAGGT GATCCTAGGT ATAGGTTCGT CTTGGTCAAC GCCGATAATG
GCGGCAAGCT TTAA
 
Protein sequence
MVADNEFKGW AGLDEKACDG HMSFQEFTPK KWDEDDVDVK ILYCGICGSD VSSLTGEWGP 
VKDICPQVCG HEIVGEVVRV GTSPENGLKI GDLVGIGAQS DSCRECEWCK EGKENYCATQ
TITFNYPYNR GPNGKGSIAR GGFAKYWRGP SKFAVPLPSG LEPDVAAPML CGGVTVYSPL
ARFEIGTKRK RVGVIGVGGL GHMAILFAKA MGAEVTAISR TDAKKEDAFK LGATDYFATG
GDLQEAVKAR TRSLDFILCT INPESFSISD YLPLLTPAGV FCIVGVIPTP LQVPAFPLIM
NSACVAGSNI GSPKEITEMF EFAVKHNIKP WIQKWNFDDI NKALPSFQKG DPRYRFVLVN
ADNGGKL