Gene CNA02580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA02580 
Symbol 
ID3253605 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp675737 
End bp677601 
Gene Length1865 bp 
Protein Length416 aa 
Translation table 
GC content49% 
IMG OID638252590 
Productsorbitol dehydrogenase, putative 
Protein accessionXP_566646 
Protein GI58258467 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACAATCATG CCTACTTCAC TTCCACTCGA ATCCAAGCAA CCTCCCAAAC CCGAGGTATT 
TGAGGCCAAG AGTAATCTTG GTTTCATGCT TCACTCACCT CTCAAAACGA GCTTTGAAGA
GGTATCCCAT CATTTTGCAT TCAGCGTCAG AGCGCTATAT TTGTTCTTTG CTGATGAGCT
CCACAGCAAT CTGTTCCCGA AATTGGACCA GACGAAGTTT TAGTGGAGAT TAAGAAAACG
GTATGTATAT CTCCTGAATG AACGGATGGT CATTGGTAAC ATCTTCTAAG GGTATTTGTG
GCTCTGATGT TCACTGTCGG TACCCCTATC TTTTAGCATG GTCAGAACAA TCTCTGGCTA
ACGACTGATA TTTCTCTCGT AGTTTATAAC ACTGGTAAAA TGGGTCTTGC TGCTCTCACA
GAGTCCATGT GCTTGGGTCA TGAATCATCG GGCATCGTTG TTCAGCTTGG CTCTAACATT
GTCCAGCAAG CGGCTCGCTC CAATGTGATG GCGACCGCGC GAGGAGAGGC CGAAGAGTCC
AACAAAGGCA CTGTGTCTAA TAGGCCGCTT CAAGTAGGAG ACAAAGTCGC TCTTGAACCG
GGAGTTACTT GCCGGATGTG CGTAGACTGC AAGGGCGGCA AATATCAGGT GCATACCCTT
TCCACGTCCC TTGTAGAAGT AGCGAGAAAC TGATGCTCTA TAGATATGCG AGCACATGAT
ATTTGCGGCC TACCCACCAT CCACAGGTGG TACCCTCCAG CGTTATTACG CTTTGTTCGT
CGGTCTCTCG GAATGATGTC AAACCGAGAC TGACATCTCT TCCGCAGACC TGCCGACCTT
GTCTACCCTC TTCCTGACAA TGTTGACCTC TCTTTCGGGG CCATGATGGA ACCTCTTTCT
GTGGCTACTC ATGCAGTTGC TAATATTGGG GGTATGCGTA CCGGCTGGAA CGTCCTCATC
ACTGGTGCGG GTCCAGTAGG GTTGTTGGCT ATGGCTGTTG CCAAAGGCTT AGGGGCTGGG
AAGGTGATAG CCGTAGACAT TAATGAAGAA AGGTTGCATT TCGCGAAGCA GTACGCGGCC
ACAGATACCT ACATCCCTGT ACGTTTCCTT GTGACAATGT GAACGGCGCT GATGACGGTT
CCGATGCTCC TGACAGATCC CGCCAAATGA AGGAGAGTCC AGGGGCGATC ATGCCGTTCG
GGCGGCTGAG GACCTTCTTC GTTCCACTGG TACCCCCGCT CGCGGCCCAG GCTCCATTGA
TCTGGTTGTC GACGCAACAG GTGCCGAGAC CTGCGTGTTA ATGGGTTTGA ATGCCATCAA
GCCAGGGTAG GTTTCTCGCA TGTTACGCGT CATTATTGAG GATGGTCCAT ACTGAGACTG
ATATAGGGGG ATCTATGTGC AAATTGGTTT TGGTCCTCCC AACGTGTCTG TCCCTATGTT
CCGGATTGTC ACGAACGAGA TCACTATCCG AGGTGCATGG CGGTGAGTTC AACCAAGTCA
CGCGTACCAG CATAACCTGA CTCATTCTGT CCTAAAAGTT ATGGCTCTGG CGATTATCCT
CTCGCCATTG ATATGGTTGC TCGAGGTCTT GTCAATCTTA AACCTCTTTT AACGCATACC
TTCAAGTTCG AAGACGCCCT TGAGGCATTT GAGATTACCA AAAACGGGAG GGATAAGAAT
GGAAAAGGTG TTATCAAATG TGTCATTGAC GGACCTGAGT AGTGGATTGT TTTCTCGAGA
CATTTTCATA AGATGGGGGA GGGGTATTAT GTTCATTCAC GGACGTTGTC CGTATGCCGT
TCGGTTTTTT TGAAGTTGAT CCGTAAATTG TATATGCGAC GAATATTGTG AAGATCGCAA
CCGTA
 
Protein sequence
MPTSLPLESK QPPKPEVFEA KSNLGFMLHS PLKTSFEEQS VPEIGPDEVL VEIKKTGICG 
SDVHFYNTGK MGLAALTESM CLGHESSGIV VQLGSNIVQQ AARSNVMATA RGEAEESNKG
TVSNRPLQVG DKVALEPGVT CRMCVDCKGG KYQICEHMIF AAYPPSTGGT LQRYYALPAD
LVYPLPDNVD LSFGAMMEPL SVATHAVANI GGMRTGWNVL ITGAGPVGLL AMAVAKGLGA
GKVIAVDINE ERLHFAKQYA ATDTYIPIPP NEGESRGDHA VRAAEDLLRS TGTPARGPGS
IDLVVDATGA ETCVLMGLNA IKPGGIYVQI GFGPPNVSVP MFRIVTNEIT IRGAWRYGSG
DYPLAIDMVA RGLVNLKPLL THTFKFEDAL EAFEITKNGR DKNGKGVIKC VIDGPE