Gene CND00040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCND00040 
Symbol 
ID3257266 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006686 
Strand
Start bp9444 
End bp12501 
Gene Length3058 bp 
Protein Length673 aa 
Translation table 
GC content45% 
IMG OID638255943 
Producttransketolase, putative 
Protein accessionXP_570357 
Protein GI58266402 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.214798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTATTTTGA CTGCGTATTA TGACTGAATC TCCCACTTTC ACTTACGACT TCTTGCATGA 
TACCCCACAC TACGTATGCT ATCCCGCTTC CTATATAGCC AATTTCCCAC TGTCAGTCGA
CGCAACGATC AGCTTTTCCT CTTTTCTTTC TTATAATCAT CTATCTATCT AGTTCGTCGT
TGTATTCAAA AATAGAAATT ACCATACAGT AGCCAGAATG TCTGTCAATG TGAGTATCCA
TGGTCGGTGG ATATTGAATT TTATACTATG TGTGAATTAT TCCCCCCCGC TGATGTTCCC
CATGCAGAAA GTTCAAGACT GGGATTTCGA AAAGTTCCCC ATCGACCTCA AAAAATACAA
ACCTTTCCCT CTTGACCCTA CCAAGGACAA GAAGCTTTCA CAAGAGCAAA AAGATGGTTT
GGTAAGTGTT CAGTAGGCAA ATCAATAATT TTTTGGTTGA TACACTGATC TCTGTCTGAC
TTTAGATTGC CAACATCTCA TTGTTGCGTG ATGTGATTGT CTTCTTCACC GCGACAGGTG
CTGCTAGGGG TCTGGCCGGT CATACTGGGT GAGTTTCTCA TCTACCGTCT GGTCAGCATC
TCCGCGATCT TTATGTTAGA ACAACCTCTG TTCCGTTATA GAAATTGCTG AGATATTATG
TTTAGAGGAG CCTTTGACAC CATCCCCGAA GTTGTGATCC TTCTTTCCTT CCTCCTTGCC
GACCAGGACA AGTCGAAATA CGTTGATATT CTCTTCGACG AGGCTGGTCA TCGTGTCGCG
ACTCAGTACC TTCTCTCTGC TCTTGACGGT CACATTCCAG TTGAGCATCT TCTCCACTAC
CGAGAAGCTA ACTCCAAGCT CCCTGGTCAT CCTGAGCTCG GTCTCACTCC CGGCGTCAAG
TTCTCTTCTG GACGATTAGG ACACATGTGG CCCTTGGTCA ACGGTGTGGC TTTGGCCGAG
AAGAACAAGG CCGTATTCAT ACTTGGGTCA GATGGTTCTC AACAGGAAGG CGATGACGCC
GAAGCTGCCA GATTGGCTGT CGCTCAAGGA TTGAACGTGA AGCTCTTCGT TGATGACAAT
GATGTGACCA TCGCGTGAGT GGGGTTAATC TTCTGAGCCT TATAAAGAGC CTGTACGCTG
ATAATCATCC AGTGGTCACC CATCTGAGTA CCTCAAAGGA TACAGCGTCG CCAGAACTTT
GGAGGGTCAT GGACTCAAGG TCGTTGAAGC TAACGGTGAA GACCTTGACT CTCTTTACTC
TGCCATCGTC GAGGTCATGA ACCACAAAGG TCCAGCTGCC GTAGTCACCC ACAGGCCTAT
GGCTCCTAAG ATCAAGGGTA TCGAAGGAAG TCCTCACGCT CACGACGCCA TCAAGGTGTA
GGTGATACGT CTGTAAGCCC GAACGTGAAA CCAATGCTGA TAATTATTCC TTTTGAGTGA
ACCTGCCATC GAATACCTTG ACGCTCGACA CCCTAAATGC GCTGCTATCC TTCGAGCTAT
TCAACCTTCC AACTACGCCG AGTTGCTTTC TGGTAGTACC AAGGAGAGGG GAGCTTGTCG
AGTTCAATTC GGTGAAGCTG TCAGCGCCGT TCTTGATAAA ACTAGCAAAG AGCAGAACAA
GGCAAAGGTG TTGGTCATCG ACTCCGACTT GGAAGGTTCT ACAGGTTTGA GTGTGATCCA
CAAGAAACAT CCCGAGTGAG TTCATGACAC CTTAACTGCC TAATGTACCC ACATGTTTAC
GTCCCATTAC AGAGTATTTT TGTCCAGCGG CATCATGGAA CGTGGGAATT TTTCTGCTGC
TGCCGGTTGG GGTGCTTTCA ACGCCGATAG ACAAGGCGTT TTCAGGTATT GTTTTTTTTT
ACCAATTTTA ATTCATCCAG AAGTTAATTG ACAGGGTCAC GCAGTACCTT CTCAGCCTTC
TCTGAAATGA TCATCTCCGA ATTGACCATG GCTCGTCTCA ACTTTGCCAA CGTTCTCACT
CACTTCTCGC ATTCTGGTGT CGACGAGATG GCTGATAACA CCTGGTGAGT ATTGGGTCAT
CTGTTGATAT ATGTGCTAAT AGGCCATCCT AGTCACTTCG GTATCAACCA ATTCTTCCTC
GACAACGGTC TTGAAGATGG GTACGAGACC AGGTTGTATT TCGCCGCTGA TTGCGTGAGT
TACCACTCAA CGATGTTACT TGATTGAGCA AAAGCTGATT GTTAAACTTT AGTCTCAAAT
GGATGCGTAA GTCACTGAAC TTTGCTGCCC TATGCTTTCT TTTCACATCC TCTCATACCT
TACGCTATTG TTTCTTGTGT CAGAACTCAT CGCTGACATT TATGTAGAAT CGTCGACCGA
GTCTTCTACG ACAAAGGTCT TCGATTCGTC TTTTCCACTC GATCCAAGGT CCCATGGATT
TTGAAGGAAG ACGGATCCAG ATTCTTCGAC TCTGATTACA AATTCGTTCC CGGTAAAGAT
GAAGTTATTC GAAAAGGCAC CAAAGGCTAT GTTGTAGCGT ATGGAGAGAT CTTATACAGA
GCACTCGACG CCGTTGATCG TTTGAGAAAA GAGGGTTTGG ATGTCGGCTT GATCAACAAA
TCGACACTCA ACGTAGTGGA CGAAGATATG ATCAAGGAAA TTGGTTCAAC TGAGTTCGTC
TTTGTCGCCG AAAGTTTGAA CAGGAAGACC GGGTTGGGAA GCAAGGTAAG TTACATTATT
GTCTATTGAT AAAATATGCT AACGGCAAGT ACAGTTCGGT ACTTGGCTTC TCGAACGAGA
TTTAAGACCA AGGTACAATT ACATGTGAGT TAAGAGCGTC GAATTCAAGT GAACTCCTAC
TGATCGCTTT AAACCACTAC AGCGGAACCA GCAAGGAAGG ATGCGGTGGG CTTGGTGAAC
AAATTGGTCA CCAAAACCTC GGTAGCTCTG ATATCGCTCT CAAGGTGAAG CAAATGATCA
AGTGAACCGA TCTTGGACAT CCATGTCACG ATGATGACAC TGTTGAAGGA ATAGATCAAG
TGAGAAGCTT TTTGATCAAG TCAATTTTAT AAGTACAATT ATGCAAGACA TATGAATG
 
Protein sequence
MLSRFLYSQF PTFVVVFKNR NYHTVARMSV NKVQDWDFEK FPIDLKKYKP FPLDPTKDKK 
LSQEQKDGLI ANISLLRDVI VFFTATGAAR GLAGHTGGAF DTIPEVVILL SFLLADQDKS
KYVDILFDEA GHRVATQYLL SALDGHIPVE HLLHYREANS KLPGHPELGL TPGVKFSSGR
LGHMWPLVNG VALAEKNKAV FILGSDGSQQ EGDDAEAARL AVAQGLNVKL FVDDNDVTIA
GHPSEYLKGY SVARTLEGHG LKVVEANGED LDSLYSAIVE VMNHKGPAAV VTHRPMAPKI
KGIEGSPHAH DAIKVEPAIE YLDARHPKCA AILRAIQPSN YAELLSGSTK ERGACRVQFG
EAVSAVLDKT SKEQNKAKVL VIDSDLEGST GLSVIHKKHP EVFLSSGIME RGNFSAAAGW
GAFNADRQGV FSTFSAFSEM IISELTMARL NFANVLTHFS HSGVDEMADN TCHFGINQFF
LDNGLEDGYE TRLYFAADCS QMDAIVDRVF YDKGLRFVFS TRSKVPWILK EDGSRFFDSD
YKFVPGKDEV IRKGTKGYVV AYGEILYRAL DAVDRLRKEG LDVGLINKST LNVVDEDMIK
EIGSTEFVFV AESLNRKTGL GSKFGTWLLE RDLRPRYNYI GTSKEGCGGL GEQIGHQNLG
SSDIALKVKQ MIK