Gene CNG00100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG00100 
Symbol 
ID3258805 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp21364 
End bp23073 
Gene Length1710 bp 
Protein Length392 aa 
Translation table 
GC content49% 
IMG OID638257622 
ProductL-arabinitol 4-dehydrogenase, putative 
Protein accessionXP_571725 
Protein GI58269138 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.603371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACAATGAC CATTGCGGCT ACATCTATCC CTACCACCAA GTACGAGGCG CACTACGACC 
CCACAAAGGT CATTCAGCAT CCCGAGTTCC AAACGCTCAG TGAAGATGCT CCTGAGCTTT
CCGACCCCAA GCTCAACATC GCCTGTGCGT ACAACCCGGC TCATGAGATC CACATGGTCA
AGAAGCCTCG ATTCGAGCCC GGTCCTGGTG AAGTGACAAT CCACGTCCGT GCCACAGGAA
TCTGCGGGTA CGTTGTTCCT TCCTCCTCCC ATATCAAATC GGAACTATTT AAGAAGTTTA
ATGTTTGATG TAGTTCTGAC GTTCACTTCT GGAAACACGG CCATATTGGA CCTACCATGA
TTGTCACAGA TGAGTGCGGA GCAGGTCATG AATCGGCAGG CGAAATCGTT GCCGTTGGAG
AAGGCGTCGC ACAATGGCAG GTCGGTGACA GGGTAGCTGT TGAAGCCGGT GTTCCATGCG
GTCTCGCCTC ATGTGACCCT TGTCGTACCG GTCGTTACAA CGCTTGTCAG TATTCTAAGT
ATTCTTTCGA GGTTTGTCCC ACTGGAGGTG TTTATGTTTT TTGTCTAGGT CCTGCTGTCG
TCTTCTTCTC CACCCCTCCT TACCATGGTA CACTCACTCG ATATCACAAT CACCCGGCTG
CTTGGTGCCA CCGTCTCGCC GATAACGTGT CTTATGAAGA AGGATCCCTG TGTGAGCCTC
TTGCAGTGGC GCTGGCCGGT CTTGACAGGG CTGGTGTGAG ATTGGGGGAC CCTATTGCCA
TTTGGTAAGT ATACTCCCAC CTTGATATCT TCGTTTTCAA GTGCAGCGTT GACCATTGAT
AGCGGGGCGG GCCCTATAGG GTTAGTTACT CTTCTTGCTG CGCATGCTGC AGGTTGTACG
CCCATTGTCA TCACCGATCT CTTCCCATCC CGACTTGAGT TCGCCAAGAA GCTTCTTCCA
ACTGTCAAGA CTGTACAGAT TGAGAAGACT GCAAAGCCCG AAGAGGTTGC GAAGCAGATC
AAGGGCGCGG CGGGTATGCA GCTTTCGCTT GCATTTGACT GTACAGGAGT GGAGAGCAGT
ATCAGATCTG CTATCTTCGT AAGTCCTTGC TTGAAAGACA TCTTCACAGT TTCTAACACT
AACTCTATTG CTTTGACAGT CTGTCAAGTT TGGAGGCAAA GTCTTTGTAA TTGGTGTCGG
ACCTTCAGAG CAAAGCGTGA GTCTTTTCCT TACTTCTATT CCCGCCATGA ATAGTCAACA
GCTAATACTA TCTTTGTCAG TACCCATTCG GCTATTGTAG CGCCAACGAG ATCGATCTCC
AATTCCAGTA CAGGTACAAC AATCAAGTCA GTAATTTCCG CATACCTTAT CAAGCTCCCT
AGAAAAGCCT TGCTGATGTA AGCCTAAAAA CTCCTGCAGT ACCCGAAAGC CATTCGACTC
GTCGCTGGCG GGCTTGTCGA CCTGAAACCA CTTGTCACCC ACCGTTTCGC TTTGAAGGAG
GCTGTTAAGG CTTTCCACGT CGCCGCTGAT CCCTCTCAAG GAGCTATCAA GGTTCAGATC
CGTGATTAGT CGGATTAGTC AGTGGATTTT TCGTGATGGT AAAAAGGTGT GATCTTGGCT
TGTTAGATAA TTAAACAGTC TTAGAGTTAG GTATACAGTA TAGATATCAT GGAGTTCTTC
CTGAGATTCC AGATGCCATG ATCATGAAAT
 
Protein sequence
MTIAATSIPT TKYEAHYDPT KVIQHPEFQT LSEDAPELSD PKLNIACAYN PAHEIHMVKK 
PRFEPGPGEV TIHVRATGIC GSDVHFWKHG HIGPTMIVTD ECGAGHESAG EIVAVGEGVA
QWQVGDRVAV EAGVPCGLAS CDPCRTGRYN ACPAVVFFST PPYHGTLTRY HNHPAAWCHR
LADNVSYEEG SLCEPLAVAL AGLDRAGVRL GDPIAICGAG PIGLVTLLAA HAAGCTPIVI
TDLFPSRLEF AKKLLPTVKT VQIEKTAKPE EVAKQIKGAA GMQLSLAFDC TGVESSIRSA
IFSVKFGGKV FVIGVGPSEQ SYPFGYCSAN EIDLQFQYRY NNQYPKAIRL VAGGLVDLKP
LVTHRFALKE AVKAFHVAAD PSQGAIKVQI RD