Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_64968 |
Symbol | NTH2 |
ID | 4851578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2195250 |
End bp | 2198411 |
Gene Length | 3162 bp |
Protein Length | 811 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393286 |
Product | Neutral trehalase (Alpha,alpha-trehalase) (Alpha,alpha-trehalose glucohydrolase) |
Protein accession | XP_001386768 |
Protein GI | 126274916 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.332926 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0654828 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATAGGTTGAA GTCTGGCTCT TCCCAGTGAA AGTAGTGCTG GTTTATTCTC TCCCGAAGTA AGAGCAGGTA TCTGGTGCGA GCTTCTGTAC CAATTTCTTT AAAATCAAGA TCATTCATAA ATATACCAGA CGGAAAGTGT CTTAAATACT TGAGATAAAT TGGAACTATA AAATCGTAAC GATATACGCA GTTCCGTCGA GGTCCGTATC TACAACTCTA AGCAATCCGA TTGTCACAGT CTATTGATTT TTGATCACAT CAATATCGCC AGCTTGTGTT ATCGCACTTA TTTCATCTAA ATTGCATCCC CGCGAAAACC GCTGCAGGCG CAAAAAAGAT TATTTATACA TACAGAAGAC CAGCTCTCCC TTTTTCTTAT TTACTTTCTC CCCCATAGAG AAAAGTATAG TTGAAGTTTT GCATTGCTAA AACACTCCTG ACTAGATCCG TTACACTTAT CGAACACCAA TCGTTGTTGC ATTTCAGTTA CTTCTTAGTT GTCTATAATC ATTATCTATC TCATCGAGTC CTATACATAC TTTATCTTCA TCCTGATAAT TGCCTGATAT AGTGATCTCT TCGTTTCGAG ATAGTTTCAC TTTTCACGTC ATATGATCAC CATTCTGCTC ATTTTATCTA ATATTCACAA TGTTTAACCC TTTCAGGCAC CGCAGGTCTT CTTCCAGCTC CGGCGACGAT GATCCCTTTG ACCTCGCAGA AAACTACTAT GGCCCCAAAA ACGCTCCCAT CTCCAAGATG GGCAGAGTGA GAACCTTCTC AGTTTTTGAA ACCAAACCAC GCAACCAGAT CTTTGACTTT GCCGAGGATG CCATCAAGGA ACAGTCTTCT TCGGCTTCTA CATCCCCATC GCCTACGCCC TTTGCAGAGA GCTTAAGCGA GGATAACGAG CCACGTCCCA GCGAAGCGGT GTTCGGCCGG AAGCCTTCCA TCGTGCCTAT TTATGCTGAC GACTCGTCTA CAGAGTCAAT CAACAGCATC GGACATCCTG TAAAGCCTAA AGCAATTCAT CACTTCAGAA GGTCTTCTGT AGACGACTCT TATCTCCGGC CCAAGAAGTT CTACATCAGC AACGTAGAAG CTACGCTTGA TGAGTTGTTG CATAATGAAG ATACCGACCG TAACTGCCAG ATCACCATTG AAGATACCGG GCCGAAGGTA TTGCGCTTGG GAACTGCCAA CTCTAATGGC TTCAACCAGG CGGCTATCAG GGGTACCTAT ATGTTGAGTA ACTTGCTCCA AGAGTTAACT ATTGCCCAAC GTTTTGGCAG AAAAGAGATG ATCCTAGATG AACAACGTCT TAACGAAAAC CCCGTTGCCA GAATGAAGCG TCTCATATCG ACCCAGTTCT GGAAGTCGTT GACAAGGCAG ATCACGGCAG AAAACATCGT AGACATGGCT AGAGACACCA AGATCAAAGA AACTTACGTA GACCAACACG GAGTCCTCCA TGAGGATGCG GAATCGCACC GTATATACGT CCCATACAAC CGTAAGGACC AATATGACTA CTATATGTCA ATCAAAGAAA GCAGACAGGA TGTCTTCTTT GATGTACAAT ACTTGCCGGA AGTCATAGAT TCGGTTTACA TAAGATCCAT CAACAAAAAG CCTGGTTTAC TTGCTTTGGC TTCTTCTCCA GACCCTAAGG ACCCCAAGAA AATGATTAAC CAGCCTTATG TTGTCCCCGG CGGTAGATTC AACGAGTTGT ACGGCTGGGA TTCGTACATG GAGACTATTG GATTATTAAC CGATGTCACT CCCAAGGACC AAGAACATCT CAGATTAGCC CGGGGCATGA CGGAAAACTT CATCTACGAG ATCCACCACT ACGGCAAGAT TCTTAACGCC AACAGGTCCT ACTATTTGGG CAGATCCCAG CCTCCCTTCT TGACGGATAT GGCTCTTAGA GTATTCGACA AGACTGTAGA AGTAGACCCG GGTAGGAAAG TAGAAGCGTT GGATTTTCTC AAGAGAGCGA CCGTAGCAGC CGTTAAAGAA TACAAGACCA TCTGGTGTGC CAAACCTCGT TTAGATGAGA AGTCTGGTTT ATCATGCTAC CACCCTGATG GTTTGGGAGT TCCACCAGAG ACTGAATCTT CTCACTTCAA CGCTGTGTTG AAGCCGTACG CAGAAAAGTA CAAGATTTCA CAGGACGAGT TCATCGAGAA GTACAATGCA GAGATCATCA AGGAACCTGA ATTGGACGAG TATTTCCTTC ATGACAGAGC TGTGAGAGAA TCTGGTCACG ACACCACGTA TAGATTGGAG GGTAGATGTG CTTATTTAGC TACTGTAGAC TTGAATGCTT TGTTGTACAA GTACGAGAAT GATATCGCAT TTATCATCCA GAACCACTTT GGGGGGAAGT TGGAATGCAA CGGAGAAATC GAGGAGCCGG AAGTTTGGGT AGCCAGGGCA ATCAAGAGAG TGGAACGAGT CAACAAGTAT TTATGGAACG AAGAAGATTC CATGTACTAC GACTATGACA TCAAGAAAGA AACGCAGTGC AAGTACGAAT CTGCCACTGC CTTCTGGCCA TTATGGTCCA AGTTGGCTAT TCAGGAACAA GCGGACAAAC TTGTCAAAAA TTCATTGGAC AAGTTCGAAG AATTTGGTGG TTTGGTCGCT GGTACGCTTA AATCTCGTGG AGACGTCAAT CTCTCGAGAC CTTCTCGTCA ATGGGACTAT CCTTATGGTT GGGCCCCTCA ACAAATCTTG GCCTGGATTG GGTTGTGCAA CTATGGGTAT GATGGTATCG CTCGCCGTTT GGCGTACCGC TGGCTCTACA TGATGACCAG AGCTTTTGTA GACTACAACG GGGTTGTTGT CGAGAAGTAC AACGTCACCG AGGGAGCTGT TCCACACAAG GTTGATGCCG AGTATGGTAA CCAAGGTTTG GACTTTAAGG GTGTAGCAAC TGAAGGATTT GGCTGGGTCA ATGCCAGCTA CTTGTTTGGG CTCACGTTGA TGAACTTGCA TGCAAAGAGA TGTTTGGGGA CGTTGACGCC GCCAAACGTG TTCTTGATGA ATATGCATCC CGTGCAAAGA GAGTACTTTC TGTAATTTCT CTGGTTGGTA TAATGAGATA TAACATATAA GAAATATATA AGAATTTAGC AAATGTAATG CTATATAATG AT
|
Protein sequence | MFNPFRHRRS SSSSGDDDPF DLAENYYGPK NAPISKMGRV RTFSVFETKP RNQIFDFAED AIKEQSSSAS TSPSPTPFAE SLSEDNEPRP SEAVFGRKPS IVPIYADDSS TESINSIGHP VKPKAIHHFR RSSVDDSYLR PKKFYISNVE ATLDELLHNE DTDRNCQITI EDTGPKVLRL GTANSNGFNQ AAIRGTYMLS NLLQELTIAQ RFGRKEMILD EQRLNENPVA RMKRLISTQF WKSLTRQITA ENIVDMARDT KIKETYVDQH GVLHEDAESH RIYVPYNRKD QYDYYMSIKE SRQDVFFDVQ YLPEVIDSVY IRSINKKPGL LALASSPDPK DPKKMINQPY VVPGGRFNEL YGWDSYMETI GLLTDVTPKD QEHLRLARGM TENFIYEIHH YGKILNANRS YYLGRSQPPF LTDMALRVFD KTVEVDPGRK VEALDFLKRA TVAAVKEYKT IWCAKPRLDE KSGLSCYHPD GLGVPPETES SHFNAVLKPY AEKYKISQDE FIEKYNAEII KEPELDEYFL HDRAVRESGH DTTYRLEGRC AYLATVDLNA LLYKYENDIA FIIQNHFGGK LECNGEIEEP EVWVARAIKR VERVNKYLWN EEDSMYYDYD IKKETQCKYE SATAFWPLWS KLAIQEQADK LVKNSLDKFE EFGGLVAGTL KSRGDVNLSR PSRQWDYPYG WAPQQILAWI GLCNYGYDGI ARRLAYRWLY MMTRAFVDYN GVVVEKYNVT EGAVPHKVDA EYGNQGLDFK GVATEGFGWV NASYLFGLTL MNLHAKRCLG TLTPPNVFLM NMHPVQREYF L
|
| |