Gene CNC05340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC05340 
Symbol 
ID3256278 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1595654 
End bp1597523 
Gene Length1870 bp 
Protein Length355 aa 
Translation table 
GC content50% 
IMG OID638255752 
Productd-arabinitol 2-dehydrogenase, putative 
Protein accessionXP_569729 
Protein GI58265146 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.656485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCCAAGTC ACTTTTGTAC CCTCTTCACA ACTTTTAACC ATGTCCTTCA TCCGCTCTAG 
CCTTTTCAAG GCCACTGCCA ATCCCATCAG GCGATCTGCC TTTGCTACCA CTCCACTTCG
AGCCTTCACC AGGTCCGCTC TTGTCAGCAA CAACAAGAAG GACGATGGTT ACGAGGAGCA
TCGAGTCGAG ATTGAGCCCA AGATCGCTGC TGTCGACGAG AGTTTCACGT TTGAACACCC
TGAGGTGAGC TTTTGAGCTT TTCCATCCAT GCAATCCATG TCGGTCGGTG AGATTCCCAA
CACCTGGTTG TGGAGTCCCG GTCACGCATG TTTCTTTTGT TTCAAAGGAT CTCGTACCCC
AGTGCGCTGT TTCTGCACCT GTGTTGACAC GTGATATGAT GTAGAAATGG GTAGACAAGC
ATCCTGGTCA TGATATGCAG CGAGGTGATT TTGGTCGACA CACCAAGCGA ACTCTTGCAT
CTTTCTCTAT GGACGGCAAG GTCTGCCTTG TCACTGGTGC AGCTCGAGGT CTTGGTAACA
TGATGGCCAG GACTTTTGTT GAATCGTGAG TACCGATGCT TTTTTTCATG GCCCCAATCC
CGGCGCTTTT GGACTTTCCA TACCGATATG TGCGGCAAAG GGACGAATGA GGCGGCGGCC
ATCGCCGATT TCTTAAACTG CCGCGACGTT TGGTCCCGGG CCTTTTTTCG GTCGTTGGCA
CATGCCATTG ACTTGTTCTG TTCCTGATCG GTGCTGATGG CGACTTGAAA GCGGCGCGAA
CGCCATTGTC CTTGTCGATC TCAAGAAGGA GGATGCCGAG CGTGCAGCCA AGGAGCTCGT
TGACTGGTTT GGTGAGTATT GTATTCTCTT ATTCGCCGTG AACTCTGTTA ACGTGTCATC
ATGTAGTCGA GAACGGTGAA GCCGAGAAGG GTGAAATTGA GGCTATTGGT CTCGGTTGCG
ACGTTTCCGA CGAGGCCTCT GTCAAGCAGG TCTTTAGCAC CGTCAAGGAG AGATTCGGCC
GGCTTGACGC TGTCGTCACT GCTGCCGGTA TTGTCGAAAA CTTTGTCGCT CACGAGTACC
CCATCGATAA GATCAAGAAG CTGTTGGACA TCAACATTAT GGGTACTTGG TATTGCGCAC
TTGAGGCTGC CAAGCTTATG CCTGAAGGTG GTTCCATTAC CCTCGTCGCA TCTATGAGCG
GTAGCGTAAG CCTATTCACT TACCGCACTT TGATATCTGC TAACTGGATA ATCACAGATT
GTCAACGTTC CTCAACCTCA AACCCCTTAC AACTTTTCCA GTGGGTCTTT TTTGATCCAT
GAATAACGTG TTTGAATGCT GACTGTGACG CAGAGGCTGC TGTGCGACAC ATGGCTCGAT
CCCTCGCCGT CGAATGGGCT CTCAAGGGTA TCCGGTATGT TAGTTTCTCC TGTGACCGAT
GACCAAAAAT TAACCACCAT GCAGTGTCAA CGCTCTTAGT CCGGGTTACG TCCTCACCAA
CTTGACTAAG GTCATTCTCG ACGCCAACCC CGTTCTCCGT GACGAGTGGC TCAACCGTAT
CCCCATGGGT CGAATGGCCG ACCCTTCTGA TCTCAAGGGT GCCGTCATTT ACCTTGCTTC
TGACAGCTCC AAGTACACCA CTGGTGCTGA GATCATGATT GACGGCGGTT ACACTTGCTT
GTAAGCGGTG ATCTCCAAAG AAGTGAATGC ACGTTGGGAT TTTACAGGAC GAGAGAACTT
CATGGACATT GTTGCTTGGC TCGATAGGAG GATGTGTATG GTTTTAAAAG TAATAGAAAG
GGCGGTACAT AATTGTGAAT CCGTAAATTA GAGAATACAT TGGAATCGGC AGAATGCATA
CATAGAATAA
 
Protein sequence
MSFIRSSLFK ATANPIRRSA FATTPLRAFT RSALVSNNKK DDGYEEHRVE IEPKIAAVDE 
SFTFEHPEKW VDKHPGHDMQ RGDFGRHTKR TLASFSMDGK VCLVTGAARG LGNMMARTFV
ESGANAIVLV DLKKEDAERA AKELVDWFVE NGEAEKGEIE AIGLGCDVSD EASVKQVFST
VKERFGRLDA VVTAAGIVEN FVAHEYPIDK IKKLLDINIM GTWYCALEAA KLMPEGGSIT
LVASMSGSIV NVPQPQTPYN FSKAAVRHMA RSLAVEWALK GIRVNALSPG YVLTNLTKVI
LDANPVLRDE WLNRIPMGRM ADPSDLKGAV IYLASDSSKY TTGAEIMIDG GYTCL