Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND00940 |
Symbol | |
ID | 3257365 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | + |
Start bp | 289133 |
End bp | 292035 |
Gene Length | 2903 bp |
Protein Length | 687 aa |
Translation table | |
GC content | 50% |
IMG OID | 638256030 |
Product | conserved hypothetical protein |
Protein accession | XP_570402 |
Protein GI | 58266492 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | [TIGR00232] transketolase, bacterial and yeast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.292242 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTATCTCTTT TTCTTTCTCG TTTTCTTCTC TCTTCTCCAC CAAAAATCCC TTCTATTGAC TTCAAAATGG CCAACTTTTC CAGCAACGAC ATCACCGCTG TCAAGTTAGT CTTTTCTTTT CCCTAATAGC GTGGCCTTTT TGCCCTCTTT CGCACTGGTG CTGACTTCTC TTTTTTTTCG TAGCACCATC CGAACCCTTG CCGCTGATGT TGTGGCCAAG GCATGTCATT TTTCTACGAA GGAGCATACA TTGAATACAA GACTAACGGA CATGTGCAGG CTAACTCTGG TCACCCCGGT GCCCCCATGG TGAGTCCGCG CTTCTGCTCT TCTAATGAAG TGGTTGCTCA TCAATCTGTT GCATATAGGG TATGGCCCCT GCTGCCCACG TGCTCTTCAC CCGATTTATG AGGTTCAACT CCAAGAACCC CAAGTGGATT AACAGGGACC GATTCGTCCT TTCTAACGGT CATGCGTAAG TAATCGCTAG CATACACCAG ATATGACGTA CAGTGGCTAA TCGTTGATGT TTTCCAGTTG TGCTCTCCAG TACATTCTTC TTCACCTTGC TGGCTACGAA GTCTCTATGG AGGACCTCAA GCAATTCCGT CAGATTGACT CTATCACCCC CGGTCACCCC GAAGTTGGTG TTACCCCCGG TATTGAGGTT ACCACTGGTC CCCTTGGTCA GGGTAAGGCT CTCCAATCTA TGAAACCATG TCTGCGACTG ATAGGCATAC TTCTCCATAG GTATTGCCAA CGCTGTCGGT CTTGCCATTT CCCAGGCCCA CATGGGTGCC GTCTTCAACA AGGAGAACTT CTCTTTGATT GACAACTACA CCTACTGTTT CCTTGGTGAC GGTTGTCTCC AGGAGGGTGT TGCCTCTGAG GCCTGTTCTC TTGCCGGTCA TTTGAAGTTG GGTAACTTGG TTGCCATCTA CGATGACAAC AGTAAGTATG ACGGTATTCA GTTGTTAATG TATTCTGACG ATTTCCACAG AGATCACCAT TGATGGTGAC ACTGCTGTGT CTTTCACTGA GGACGTCGAG ATGCGATTCA AGTCTTACGG CTGGAATGTC CTCCACGTTG AGAAGGGTGA CGAGTGAGTG TCCTTTTAGT TCTCATAATG ACACAGTCTA AACCTTTTTA GTGACCTTGC TGCCATCGAG AAGGCCATCA CCGAGGCTAA GAAGAGCAAG GATGCCCCTA CCATCATTAA CCTCAAGACT ACCATTGGTT TCGGTTCTCT TCACGCCGGT GGCCACGACG TCCACGGTTC TCGTAAGTTC TACCAATTCA CACTCAGCAA TGCATATCTG ATTTTGCTTC CCAAGCTCTC AAGAAGGATG ACATTGTTCA GCTCAAGAAG AAGTTCGGCT TTAACCCCGA GGAGACCTTT GTCGTTCCCA AAGAGACTTC CGACCTCTAC CACAAGGTTG CCGAGAGCGG TGCCAAGGCC GAGGCTGAGT GGCAGGCTCT TTTCAAGTCT TATAGCGAGA AGTACCCCAA GGAGGCTGCT GAGCTTCAGC GACGAGTTGA AGGCCGTCTT CCCGACGGTT GGGAGAAGGC TCTCCCCACC TACACCACCT CTGACGCTGC TGTCGGTTCC AGGAAGTTGT CCGAGACTAC TATCACCAAA CTCGTTGAGG TGTTGCCCGA GTTGGTCGGT GGTTCTGCCG ACTTGACCGG TTCCAACTTG ACCAGGTGGA AGGGCGCTGA GGACTTCCAG CACCCCTCTA CCGGTCTCGG AAGCTACGCT GGCCGATATT TCCGATTTGG TGTTAGGGAG CACGGTATGA CTGCCGTCTG TAACGGTATC GCTGCCTATG GTGGTATCAT TCCTTTCACT GCCACTTTCC TTAACTTCGT CTCTTACGCT GCCGGTGCCG TCCGACTTTC CGCCTTGTCT CACCTTCGAG TCCTCAACGT TGCCACCCAC GACTCTATTG GTCTCGGTGA AGATGGGCCT ACCCACCAGC CCGTCGAGAC TGCTGCTTGG CTCCGAGCTA TCCCTAACCT TGCTTTCTGG AGGCCCGCTG ACGGTAACGA GACTTCTGCC GCCTATCTTG TCGCCATCTT GTCTCAGCAC ACTCCTTCCG TCTTCGCCTT TTCCCGACAG AATGTAGGTC GAATTTGAAA GTAAAAACAT TGGCTGGATC TGACGTTTCC TTTTGTATAG TTGCCTCAAC TCGCCAACTC TTCCATTGAG AAGGCTACTA AGGGTGGTTA TGTTGTTGAG GAGGTTGAGA ATGGTTAGTC AGATTTCGGT TGATTAAACA CGGATATGCT AACGCGTTTA TAGCCGATGT GACCCTCGTC TCCACTGGTT CTGAGGTTAT CCTTTGTCTT CAAGCTCTTG AGCAGCTCAA ATCCAAGGGC ATCAAGGCCC GATTGGTTTC CTTGCCTTGT TTCGAGGTCT TCGTATGTGC TCTGCTTCGG AATAAAAAAG GATACGTAGG AACTAATTTT CTGTACAGAA CAACCAGCCC AAGGACTACA AGCTCAGCGT TCTTCCTTCC GGTGCTCCCA TCCTCTCAGT CGAGGCTTAC TCTACCTTCG GCTGGGGAAC TTACTCTCAC GACCACTTCG GTCTCAAGGC TTGGGGTGCC TCCGGTCCTT ACAACAAGGT CTATGAGAAG GTACGTACAA TTGTTTTTTT TAAGTTAGGG ACGCTTGTTG ACAGCATGCC TTTCCAGTTC GACATCACTC CCGAGGGTAT TGCCAGGAGG GCTGAGAAGG TTGTTGACTT CTACAAGAAG CGTGGCCAGC CCGTATTCTC TCCTTTGATC TCTGCTTTGG ATGACATCTC TGAGTAAAGT TCAAAAGAGT TGAGTGAAGG TCGTAAGCAG GAGTTAGTAG TATACACACA TCATAGAACA ATGTAGTGTT TCCGGATTTG TCA
|
Protein sequence | MANFSSNDIT AVNTIRTLAA DVVAKANSGH PGAPMGMAPA AHVLFTRFMR FNSKNPKWIN RDRFVLSNGH ACALQYILLH LAGYEVSMED LKQFRQIDSI TPGHPEVGVT PGIEVTTGPL GQGIANAVGL AISQAHMGAV FNKENFSLID NYTYCFLGDG CLQEGVASEA CSLAGHLKLG NLVAIYDDNK ITIDGDTAVS FTEDVEMRFK SYGWNVLHVE KGDDDLAAIE KAITEAKKSK DAPTIINLKT TIGFGSLHAG GHDVHGSPLK KDDIVQLKKK FGFNPEETFV VPKETSDLYH KVAESGAKAE AEWQALFKSY SEKYPKEAAE LQRRVEGRLP DGWEKALPTY TTSDAAVGSR KLSETTITKL VEVLPELVGG SADLTGSNLT RWKGAEDFQH PSTGLGSYAG RYFRFGVREH GMTAVCNGIA AYGGIIPFTA TFLNFVSYAA GAVRLSALSH LRVLNVATHD SIGLGEDGPT HQPVETAAWL RAIPNLAFWR PADGNETSAA YLVAILSQHT PSVFAFSRQN LPQLANSSIE KATKGGYVVE EVENADVTLV STGSEVILCL QALEQLKSKG IKARLVSLPC FEVFNNQPKD YKLSVLPSGA PILSVEAYST FGWGTYSHDH FGLKAWGASG PYNKVYEKFD ITPEGIARRA EKVVDFYKKR GQPVFSPLIS ALDDISE
|
| |