Gene CNE00140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE00140 
Symbol 
ID3257936 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp26666 
End bp29748 
Gene Length3083 bp 
Protein Length720 aa 
Translation table 
GC content49% 
IMG OID638256596 
Producttransketolase, putative 
Protein accessionXP_570699 
Protein GI58267086 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCGTG GCATTGACAT CACTGCTCAA ATCCCGAGTC TTGCCCAAAG CTCCTCGGTC 
CGCTCAAAGC TACTTTCGTG AGCCCATGCT CAGCAACTTC AAATCCGGAA CGCAACGGTA
GTTCGTAATC ATGCGCGCGG TCCTGGACCC GCCTCGGTCC GAGAGAACGC TTGCCGGATT
TGTTGTGTTC GGTGTGCGGG CAAAGTTGTA TACTTTCCTC ATAGATAACA GTACTTAATT
GACTGACAGT CTGCACTGTT GATCTTGAGA CAGTGAACAA TACATGACAG GATGACGGCC
ACCAGAAACA CGAATGGGCA CGGAGAGCTG AACCGGGAGG CTCACACAAC CAACAAAGTA
GTTTCCGTGA GTGCGCTTTT CGACTGCCGC CGAGTTGGAC TGACATATTG GCAGGAAAAA
GAAGAACAAT TGGTGCTGAA CACGATACGA TGCCTTGCAG CAGACTTGTG TCAGCAGGTA
TGTTCAGTTT CCGGCGTCCG CAAGATTTTG ATTGCTAAGT TCTCGCAGTA CAAAGGCGGT
CATCCCGGTA CTGTTATGGG TGCCTCAGCC ATAGGGATTG CCTTGTGGAG ATATGAGATG
AGATACAACC CTCTAAACCC CGACTGGTTC AATCGAGACC GTACGTTGAG CCGTCGAAAA
TTGAGGAAGC TGAGCCCAGA TAGGCTTCGT CCTTTCGGCT GGACATGCTT GTCTTTTCCA
ATATATTTTC CTCCATCTTT CCGGCTACGA AGCTTGGACA TTGGACCAAA TCAAGATGTA
TCACTCCCCA GCGACTTCGG GATCTATGGC CGCAGGACAC CCTGAGATAG AATACCCAGG
CATGTGGAAT CTCAGCAAAG CGATGATGAT ATGATTGACA ATTGCAGGTA TCGAGGTGAC
AACTGGACCC CTCGGACAGG GTATCTCTAA CGCGGTCGGG ATGGCAATTG CATCAAAGCA
GCTTGCGGCG ACATACAACA GGGAAGGGCT GGATATAGTG GACAACAAAA TCTGGTGTTT
CACCGGTGAT GGCTGTTTAC AAGAAGGTGT TGGCCAGGAA GGTATCTCTT TCTGACCGGA
AATAAAGAAG TCTGGGTGCT AACGTTCTGC AGCCATTTCG CTGGCAGGGC ATTTGGGATT
GGATAATTTG ATCCTGGTAT ATGACAACAA TGCTGTCACG GTAGACGGCC GAATCGACAA
CTGTTTTACC GAGAACACTT CGAAAAAACT GCAAGCTCAA GGCTGGAATG TCATCGACGT
CTATGATGGC TCGAACGATG TGAGTGTTGA ATGCTTAAGC CAGTTCACGA CTCATGCGGA
TAACAGCTTG CCGCGATCCT AGAAGGGTTT GACAAAGCCA AGCAGTTCAA AGGGAAACCA
AGTCTGGTTA ATATACGTAC AGTCATTGGT TACTCCTCAC GAAAAGCCAA CACTGGACCG
GCCCATGGTC AAGCCTTGGG AGACGATGAA GTGGCATATG TCAAAACTCA ACTCGGATTC
GATCCTGCTC AGAAATTCGT TATTCCCGAC AAGGCATACG AATATTTCGC TGAATGCAAA
ATCAAGGGTG CAAAGGCAAA CGAAGAATGG AACGAAAAAT TCGAAGCCTA CTCAAAGGCG
TATCCAGACT TGTATAAACA GCTGACTGAT CGAATGAACG GCAGATTTGC TCCAGATGGT
TGGGAAACTC TGTTACCGGC GAAAAAGGAT CTTCCGCAAG GGGACCAGCC TACCAGGAAG
TCAAGTGGAA TCGCAGTTCA AACTTTGGTG CCCAAGAATA ATACCTTCAT CGCTGGTTCA
GCGGACCTCC TCGAATCCAC CTTTGTCAAT TTCGATGGTC AAGTAGAGTT CCAGAACGTG
GGTTTTTTTA AAACTGCAGT TTTGAAATGT TTGATTTGAA GGCTGACACT GTCCATAGCC
TTCTTCGGGT CTGGGAGACT ACACTGGCAG ACAAATTCGC TTCGGCATTA GGGAGTTTGC
AATGGTGGGT TTGGGCAACG GTATGGCAGC ATATCACAAG GGAATGTTCA TCCCGTGAGT
GACCAATTGC CTATTATTTC AACTGCTTAC ATTTCATAAG TATCATGTCT ACGTTCTTCA
TGTTTTGGAT CTACGCTGCC CCGGCCGCCC GAATGGCCGC TTTACAACAT CTGCGATTCA
TCGGAGTTGC CACTCATGAT TCCATCGGTA TAGGGGAAGA TGGTAGGTAT TCCCTTCAGC
CTTGGCATCC ATTTTTGCTG AATAAATATT ACTTGGTAGG TCCCACCCAT CAGCCCATCG
CTCTGGCTGC TTTCTACAGA GCCTTACCGA ACATCAACCT TATCCGACCT GCCGATGCAG
AGGAATGTAT GGGAATGTGG CTCTTGGCTC TATCGGATCA ATCAGCAAAC ACCCCCAGCA
TCTTCGCCTT GTCTCGACAA CCCGTGCCCC TTCTTTCCGG AACCGACCGT ACAAAGGTCG
CACAAGGGGC TTACATTGTT TACGGTGATG ACAGAAGTCC GGATATGACG ATTATTGCAA
CCGGTGCTGA AGTCGCCCGA GCAATCGAGT CTGCCAAGCT GCTGAAGTCG GTTAAGAAGG
TCCGAGTAGT CTCGATGCCT TCCCAGAAAC ATTTCGACAT TCAAACTGCA GAATACAAAG
AATTGGTTCT AAAGAGCTCC ACCGCCCTCG TGGTCGCAAT TGAAGCATGG GCTTCTTACG
GTTGGGCTAG ATACGCACAT GCTTCGTTGT CCATGCAGAC TTTCGTAAGG CTATTTCCCC
TCTGGCATTT GATCCTCAGC TCATAAATCG CGATAGGGTC ACTCGGCACC TCAACAGCAG
CTGTATGAGC ATTTCGGATT CTCTCCGAAA GCCATGGCAG AGAAGATTGA TGAGTGGGCG
GCCAGGTGGC AAACGAAAGG AGGCCGGCCA GGACTTGGTG ATTTCGAAGA GCTGTTGCTG
CGCTATACGC AACATTGAGT CAGAGATTCA AGGTTCCATG GTGATTGCTT GTTAAATGGC
ATTGACACGC AGTGCTTAGG TTGCGCAGAT GCTGGGATTA TGTTATATGA GTACTGAGAT
TGTGCTGCAT GAACACTCGG AAT
 
Protein sequence
MTATRNTNGH GELNREAHTT NKVVSEKEEQ LVLNTIRCLA ADLCQQYKGG HPGTVMGASA 
IGIALWRYEM RYNPLNPDWF NRDRFVLSAG HACLFQYIFL HLSGYEAWTL DQIKMYHSPA
TSGSMAAGHP EIEYPGIEVT TGPLGQGISN AVGMAIASKQ LAATYNREGL DIVDNKIWCF
TGDGCLQEGV GQEAISLAGH LGLDNLILVY DNNAVTVDGR IDNCFTENTS KKLQAQGWNV
IDVYDGSNDL AAILEGFDKA KQFKGKPSLV NIRTVIGYSS RKANTGPAHG QALGDDEVAY
VKTQLGFDPA QKFVIPDKAY EYFAECKIKG AKANEEWNEK FEAYSKAYPD LYKQLTDRMN
GRFAPDGWET LLPAKKDLPQ GDQPTRKSSG IAVQTLVPKN NTFIAGSADL LESTFVNFDG
QVEFQNPSSG LGDYTGRQIR FGIREFAMVG LGNGMAAYHK GMFIPIMSTF FMFWIYAAPA
ARMAALQHLR FIGVATHDSI GIGEDGPTHQ PIALAAFYRA LPNINLIRPA DAEECMGMWL
LALSDQSANT PSIFALSRQP VPLLSGTDRT KVAQGAYIVY GDDRSPDMTI IATGAEVARA
IESAKLLKSV KKVRVVSMPS QKHFDIQTAE YKELVLKSST ALVVAIEAWA SYGWARYAHA
SLSMQTFGHS APQQQLYEHF GFSPKAMAEK IDEWAARWQT KGGRPGLGDF EELLLRYTQH