Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND00040 |
Symbol | |
ID | 3257266 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | + |
Start bp | 9444 |
End bp | 12501 |
Gene Length | 3058 bp |
Protein Length | 673 aa |
Translation table | |
GC content | 45% |
IMG OID | 638255943 |
Product | transketolase, putative |
Protein accession | XP_570357 |
Protein GI | 58266402 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.214798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTATTTTGA CTGCGTATTA TGACTGAATC TCCCACTTTC ACTTACGACT TCTTGCATGA TACCCCACAC TACGTATGCT ATCCCGCTTC CTATATAGCC AATTTCCCAC TGTCAGTCGA CGCAACGATC AGCTTTTCCT CTTTTCTTTC TTATAATCAT CTATCTATCT AGTTCGTCGT TGTATTCAAA AATAGAAATT ACCATACAGT AGCCAGAATG TCTGTCAATG TGAGTATCCA TGGTCGGTGG ATATTGAATT TTATACTATG TGTGAATTAT TCCCCCCCGC TGATGTTCCC CATGCAGAAA GTTCAAGACT GGGATTTCGA AAAGTTCCCC ATCGACCTCA AAAAATACAA ACCTTTCCCT CTTGACCCTA CCAAGGACAA GAAGCTTTCA CAAGAGCAAA AAGATGGTTT GGTAAGTGTT CAGTAGGCAA ATCAATAATT TTTTGGTTGA TACACTGATC TCTGTCTGAC TTTAGATTGC CAACATCTCA TTGTTGCGTG ATGTGATTGT CTTCTTCACC GCGACAGGTG CTGCTAGGGG TCTGGCCGGT CATACTGGGT GAGTTTCTCA TCTACCGTCT GGTCAGCATC TCCGCGATCT TTATGTTAGA ACAACCTCTG TTCCGTTATA GAAATTGCTG AGATATTATG TTTAGAGGAG CCTTTGACAC CATCCCCGAA GTTGTGATCC TTCTTTCCTT CCTCCTTGCC GACCAGGACA AGTCGAAATA CGTTGATATT CTCTTCGACG AGGCTGGTCA TCGTGTCGCG ACTCAGTACC TTCTCTCTGC TCTTGACGGT CACATTCCAG TTGAGCATCT TCTCCACTAC CGAGAAGCTA ACTCCAAGCT CCCTGGTCAT CCTGAGCTCG GTCTCACTCC CGGCGTCAAG TTCTCTTCTG GACGATTAGG ACACATGTGG CCCTTGGTCA ACGGTGTGGC TTTGGCCGAG AAGAACAAGG CCGTATTCAT ACTTGGGTCA GATGGTTCTC AACAGGAAGG CGATGACGCC GAAGCTGCCA GATTGGCTGT CGCTCAAGGA TTGAACGTGA AGCTCTTCGT TGATGACAAT GATGTGACCA TCGCGTGAGT GGGGTTAATC TTCTGAGCCT TATAAAGAGC CTGTACGCTG ATAATCATCC AGTGGTCACC CATCTGAGTA CCTCAAAGGA TACAGCGTCG CCAGAACTTT GGAGGGTCAT GGACTCAAGG TCGTTGAAGC TAACGGTGAA GACCTTGACT CTCTTTACTC TGCCATCGTC GAGGTCATGA ACCACAAAGG TCCAGCTGCC GTAGTCACCC ACAGGCCTAT GGCTCCTAAG ATCAAGGGTA TCGAAGGAAG TCCTCACGCT CACGACGCCA TCAAGGTGTA GGTGATACGT CTGTAAGCCC GAACGTGAAA CCAATGCTGA TAATTATTCC TTTTGAGTGA ACCTGCCATC GAATACCTTG ACGCTCGACA CCCTAAATGC GCTGCTATCC TTCGAGCTAT TCAACCTTCC AACTACGCCG AGTTGCTTTC TGGTAGTACC AAGGAGAGGG GAGCTTGTCG AGTTCAATTC GGTGAAGCTG TCAGCGCCGT TCTTGATAAA ACTAGCAAAG AGCAGAACAA GGCAAAGGTG TTGGTCATCG ACTCCGACTT GGAAGGTTCT ACAGGTTTGA GTGTGATCCA CAAGAAACAT CCCGAGTGAG TTCATGACAC CTTAACTGCC TAATGTACCC ACATGTTTAC GTCCCATTAC AGAGTATTTT TGTCCAGCGG CATCATGGAA CGTGGGAATT TTTCTGCTGC TGCCGGTTGG GGTGCTTTCA ACGCCGATAG ACAAGGCGTT TTCAGGTATT GTTTTTTTTT ACCAATTTTA ATTCATCCAG AAGTTAATTG ACAGGGTCAC GCAGTACCTT CTCAGCCTTC TCTGAAATGA TCATCTCCGA ATTGACCATG GCTCGTCTCA ACTTTGCCAA CGTTCTCACT CACTTCTCGC ATTCTGGTGT CGACGAGATG GCTGATAACA CCTGGTGAGT ATTGGGTCAT CTGTTGATAT ATGTGCTAAT AGGCCATCCT AGTCACTTCG GTATCAACCA ATTCTTCCTC GACAACGGTC TTGAAGATGG GTACGAGACC AGGTTGTATT TCGCCGCTGA TTGCGTGAGT TACCACTCAA CGATGTTACT TGATTGAGCA AAAGCTGATT GTTAAACTTT AGTCTCAAAT GGATGCGTAA GTCACTGAAC TTTGCTGCCC TATGCTTTCT TTTCACATCC TCTCATACCT TACGCTATTG TTTCTTGTGT CAGAACTCAT CGCTGACATT TATGTAGAAT CGTCGACCGA GTCTTCTACG ACAAAGGTCT TCGATTCGTC TTTTCCACTC GATCCAAGGT CCCATGGATT TTGAAGGAAG ACGGATCCAG ATTCTTCGAC TCTGATTACA AATTCGTTCC CGGTAAAGAT GAAGTTATTC GAAAAGGCAC CAAAGGCTAT GTTGTAGCGT ATGGAGAGAT CTTATACAGA GCACTCGACG CCGTTGATCG TTTGAGAAAA GAGGGTTTGG ATGTCGGCTT GATCAACAAA TCGACACTCA ACGTAGTGGA CGAAGATATG ATCAAGGAAA TTGGTTCAAC TGAGTTCGTC TTTGTCGCCG AAAGTTTGAA CAGGAAGACC GGGTTGGGAA GCAAGGTAAG TTACATTATT GTCTATTGAT AAAATATGCT AACGGCAAGT ACAGTTCGGT ACTTGGCTTC TCGAACGAGA TTTAAGACCA AGGTACAATT ACATGTGAGT TAAGAGCGTC GAATTCAAGT GAACTCCTAC TGATCGCTTT AAACCACTAC AGCGGAACCA GCAAGGAAGG ATGCGGTGGG CTTGGTGAAC AAATTGGTCA CCAAAACCTC GGTAGCTCTG ATATCGCTCT CAAGGTGAAG CAAATGATCA AGTGAACCGA TCTTGGACAT CCATGTCACG ATGATGACAC TGTTGAAGGA ATAGATCAAG TGAGAAGCTT TTTGATCAAG TCAATTTTAT AAGTACAATT ATGCAAGACA TATGAATG
|
Protein sequence | MLSRFLYSQF PTFVVVFKNR NYHTVARMSV NKVQDWDFEK FPIDLKKYKP FPLDPTKDKK LSQEQKDGLI ANISLLRDVI VFFTATGAAR GLAGHTGGAF DTIPEVVILL SFLLADQDKS KYVDILFDEA GHRVATQYLL SALDGHIPVE HLLHYREANS KLPGHPELGL TPGVKFSSGR LGHMWPLVNG VALAEKNKAV FILGSDGSQQ EGDDAEAARL AVAQGLNVKL FVDDNDVTIA GHPSEYLKGY SVARTLEGHG LKVVEANGED LDSLYSAIVE VMNHKGPAAV VTHRPMAPKI KGIEGSPHAH DAIKVEPAIE YLDARHPKCA AILRAIQPSN YAELLSGSTK ERGACRVQFG EAVSAVLDKT SKEQNKAKVL VIDSDLEGST GLSVIHKKHP EVFLSSGIME RGNFSAAAGW GAFNADRQGV FSTFSAFSEM IISELTMARL NFANVLTHFS HSGVDEMADN TCHFGINQFF LDNGLEDGYE TRLYFAADCS QMDAIVDRVF YDKGLRFVFS TRSKVPWILK EDGSRFFDSD YKFVPGKDEV IRKGTKGYVV AYGEILYRAL DAVDRLRKEG LDVGLINKST LNVVDEDMIK EIGSTEFVFV AESLNRKTGL GSKFGTWLLE RDLRPRYNYI GTSKEGCGGL GEQIGHQNLG SSDIALKVKQ MIK
|
| |