Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG00690 |
Symbol | |
ID | 3258788 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 189265 |
End bp | 192157 |
Gene Length | 2893 bp |
Protein Length | 691 aa |
Translation table | |
GC content | 48% |
IMG OID | 638257686 |
Product | trehalase precursor, putative |
Protein accession | XP_571789 |
Protein GI | 58269266 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.689154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCTTGA GCGGAAATGG ACTGACCAGT AAGACAGACA TTTGTAGACA AGGTATTGTT CATCTTGTGA TACTGGGAGG TATATCTTGC AGCTGATACC CATTGAACAT AGCCAACAGC GAAGACTCTT AACGAAACTT TGTCCGCATG GGAGGCTCTA GGTGACAATG TCACAGTCGG AGATGTGGAA ACATTTGTCG AGCAATACTT CGTCCGTGTC TTCCCCTTAA TTTGTTATCC GAAGTTGGAA TTGATTTATC TCTTTGATGG CAGAAAGGAG AAGGTCTTGA GCTTAGCCAA GTTGAGCTCG AAAACTTTGT TGAAGACCCT GCTATACTTG ACAACATTAC CGATCCCGTC TTCAGAGCTT GGGTAAAGAT TGTGAATGGA TACTGGACTC TCCTCGCCAG GTGAGACACG GCTTCGAAAG GTGCAAAGCG CTAGCTAATG ATTGTGATAC CTTCATCAGG GAGACCAACC AATCGGCCCT TTGCAATGGA GACTGCGAGT CAAGTTTGAT TCCTCTGAAC CATACTGTTA TCGTTCCTGG CGGGCGATAC AGGGAAATAT ATTATTGGGA TTCTTTCGTG AGCGGCGCCT TTCACACATA TCGCATGATT TTGGCTCATC TTTTGTATAG TGGGTGCTGG AAGGTCTTCT CAAGTCTGAG CTGTACGACT ATGCCTGGGA TTTACTACAG AACTTTATGG ATCTCATTGA TGTCAGTTCT TTTTGTCTGA ATGAGTGAAG AGACTGTTGT TGATAATGTC CGCAGATCTA TGGGTATCTT CCCAACGGCG GGAGAAAGTA CTATCTCAAT CGTTCTCAGC CTCCAGTATT TGTCCAGGTA TGTCGCAATA TATAATACTT TCATTATTAT ACGGATGCTG ATGAAGCGCA GATGATCGAT GCTTACATCA AGGCCACTAA CAGCATTACT CTTCTTGAGC GAGCTCTCCC TGTAGCTTCA GTTCGTATGT CCCTTCGTTG ATACGACCGT TCAGCTGACT ATAAGACATC AGTCCGAGTT AGAATGGTGG GCAAATAACA GAACCTCAAA TTTCACGTCA CCCTTCACCA ATCAATCCCG CACTATTGCT CAATATTCGG TCACTAACAG CGCTCCTCGA CCAGAAGTAT GTCCAAAGTC CTGTCCCTTC CAAGCAATCT CTCTCTGATA AGCTACCTAG GGTTATGTTG AGGACTTCGA AACGGTGATG GGAGCTTCCC CAGCCCTCAA CGAAACTGAA CAAGCCGAGT TGTACTCGGA GCTCGCTACT GGCGCCGAGT CAGGCTGGGA CTACTCCTCA CGATGGTGCG AGCAACCACT CCTTAACACA ACAGATAACA ACCCTTCCTT AAGGACCTTG AAAGTCAAGT CAATCATCCC TGTTGATCTA CTGAGTCTGA TGGCCGGAGA CCATGCCCTC GTGAGTTCAT CACTGATATA GAGGATGGGA TTATGGCGCT GACATAGGTT TAGCTGGCCA ATTTGTATGA GCTTTATGCA AACAGTACTG GGGGTGGAGA AGGGACAGGC AATGAGGAAA TGTCAAAGAG GGATGGGGAA TCTGATGATG CAGCGAGCAA AATTGCATAC CATCGTCAGA TGGCCCAAGA GTTCAGCGAC TCGATCCTCG ATCTCTGCTG GGACCCAGAA AAGGTGAGCC TTGGAACCCA TTTGGTACCG AAAAAGTGCT TACAGAGGGG ACTCGCATGC AGTCATGGTT CTACGACTTT AACGTGACTT CAAACTCTCG CTCCAACATC TTCCACGCGG GTGGCACCTG GCCACTTTGG CAAAACATTA CTCCATCCGA AATTATGGGC AACGAAAGTG CGGCTCTTTC TTTAGTTTCA GGATTTAGGT TCCTTTTGGG TCACTACTCA GGGGTCCCAA GTGTGGCTAC TCTGCTGTTT ACTGGACTGA ACTGGGTACG TTCCGAGGAT TTTTCGCGCG GAAAGAGGGT TGCTGATTGA TGGGCAGGAT TTCCCTAACG CCTGGCCGCC CCATGCGTGT AAGTCAAGAG AGATGGCCAC TAGCTGTAAA AGACATCCAT TAACATCGGA ATAAAAACCA CAGATACCGC CATCAAAGCT TTTGAGACAC TTGGTCGTGT ATTGCCCAAT GCCACTGTCC TTTCCAACTT GACGATCCCC TTCGATTCAG TGACCGAGAA CCAACTCGGT CTCTCAGAAT CCGAGCTCCA ACCACAACCC CAATCCACCA TTGGTAACGT CTCTCTGAAC ACCGAGACCT CCCAAGACAA GCCCTGGCCT CTTGCTCTCT CAATTGAATT TGCGAACAGG TATTTGGGAG CCGCATTCTG TTCATGGTAC TCCACCGGAG GGCAAATTAG CGGATTATTG ACACAGTTGC CGTTGAGCGA CTTGAATGCT ACTGGAACCT ATACTTCTGA GCAATCAGGT AAGGGGTTTT CGGAGCTGTG ATTCATATCA GCTGACAATA AATTGCCAGG CGTGATGTTC GAAAAGGTGG GCTTACCTTC GCATATAGGG TTATATAGGT TTGGAGCAAA ACTAACACGT CGGTAGTTCA ATGTTACTGA CACAGATGCC GCTGGAGGAG GTGGTGAGTA TACAGTCCAA GTCGGATTCG GTTGGACAAA CGGAGTAGCC CTTTGGGCCG CTGGCGAGTA CGGACAGTAC ATCCCTGCAC CCACATGTCC CCTTATTCCG ATCATCGAAG TCAATGGGAC GGCTGGTTCC AATACCTCTG ATAGCTCGGT ATACAAGTCG ACGGATAAGG ATGGCGGTCC GACAGCGAGT GACACCACTA CGTCCAAGAG CTTGTTTGTC GGATACCGAA TCCCGAGAGA GTAGGCTCGC CCGCGAAAGG ACTCTTATGA ATCAATCCTT TTATAATTTG CTT
|
Protein sequence | MILSGNGLTS KTDICRQGIV HLVILGAKTL NETLSAWEAL GDNVTVGDVE TFVEQYFKGE GLELSQVELE NFVEDPAILD NITDPVFRAW VKIVNGYWTL LARETNQSAL CNGDCESSLI PLNHTVIVPG GRYREIYYWD SFWVLEGLLK SELYDYAWDL LQNFMDLIDI YGYLPNGGRK YYLNRSQPPV FVQMIDAYIK ATNSITLLER ALPVASSELE WWANNRTSNF TSPFTNQSRT IAQYSVTNSA PRPEGYVEDF ETVMGASPAL NETEQAELYS ELATGAESGW DYSSRWCEQP LLNTTDNNPS LRTLKVKSII PVDLLSLMAG DHALLANLYE LYANSTGGGE GTGNEEMSKR DGESDDAASK IAYHRQMAQE FSDSILDLCW DPEKSWFYDF NVTSNSRSNI FHAGGTWPLW QNITPSEIMG NESAALSLVS GFRFLLGHYS GVPSVATLLF TGLNWDFPNA WPPHAYTAIK AFETLGRVLP NATVLSNLTI PFDSVTENQL GLSESELQPQ PQSTIGNVSL NTETSQDKPW PLALSIEFAN RYLGAAFCSW YSTGGQISGL LTQLPLSDLN ATGTYTSEQS GVMFEKFNVT DTDAAGGGGE YTVQVGFGWT NGVALWAAGE YGQYIPAPTC PLIPIIEVNG TAGSNTSDSS VYKSTDKDGG PTASDTTTSK SLFVGYRIPR E
|
| |