Gene PICST_64968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_64968 
SymbolNTH2 
ID4851578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2195250 
End bp2198411 
Gene Length3162 bp 
Protein Length811 aa 
Translation table 
GC content44% 
IMG OID640393286 
ProductNeutral trehalase (Alpha,alpha-trehalase) (Alpha,alpha-trehalose glucohydrolase) 
Protein accessionXP_001386768 
Protein GI126274916 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1626] Neutral trehalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.332926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0654828 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATAGGTTGAA GTCTGGCTCT TCCCAGTGAA AGTAGTGCTG GTTTATTCTC TCCCGAAGTA 
AGAGCAGGTA TCTGGTGCGA GCTTCTGTAC CAATTTCTTT AAAATCAAGA TCATTCATAA
ATATACCAGA CGGAAAGTGT CTTAAATACT TGAGATAAAT TGGAACTATA AAATCGTAAC
GATATACGCA GTTCCGTCGA GGTCCGTATC TACAACTCTA AGCAATCCGA TTGTCACAGT
CTATTGATTT TTGATCACAT CAATATCGCC AGCTTGTGTT ATCGCACTTA TTTCATCTAA
ATTGCATCCC CGCGAAAACC GCTGCAGGCG CAAAAAAGAT TATTTATACA TACAGAAGAC
CAGCTCTCCC TTTTTCTTAT TTACTTTCTC CCCCATAGAG AAAAGTATAG TTGAAGTTTT
GCATTGCTAA AACACTCCTG ACTAGATCCG TTACACTTAT CGAACACCAA TCGTTGTTGC
ATTTCAGTTA CTTCTTAGTT GTCTATAATC ATTATCTATC TCATCGAGTC CTATACATAC
TTTATCTTCA TCCTGATAAT TGCCTGATAT AGTGATCTCT TCGTTTCGAG ATAGTTTCAC
TTTTCACGTC ATATGATCAC CATTCTGCTC ATTTTATCTA ATATTCACAA TGTTTAACCC
TTTCAGGCAC CGCAGGTCTT CTTCCAGCTC CGGCGACGAT GATCCCTTTG ACCTCGCAGA
AAACTACTAT GGCCCCAAAA ACGCTCCCAT CTCCAAGATG GGCAGAGTGA GAACCTTCTC
AGTTTTTGAA ACCAAACCAC GCAACCAGAT CTTTGACTTT GCCGAGGATG CCATCAAGGA
ACAGTCTTCT TCGGCTTCTA CATCCCCATC GCCTACGCCC TTTGCAGAGA GCTTAAGCGA
GGATAACGAG CCACGTCCCA GCGAAGCGGT GTTCGGCCGG AAGCCTTCCA TCGTGCCTAT
TTATGCTGAC GACTCGTCTA CAGAGTCAAT CAACAGCATC GGACATCCTG TAAAGCCTAA
AGCAATTCAT CACTTCAGAA GGTCTTCTGT AGACGACTCT TATCTCCGGC CCAAGAAGTT
CTACATCAGC AACGTAGAAG CTACGCTTGA TGAGTTGTTG CATAATGAAG ATACCGACCG
TAACTGCCAG ATCACCATTG AAGATACCGG GCCGAAGGTA TTGCGCTTGG GAACTGCCAA
CTCTAATGGC TTCAACCAGG CGGCTATCAG GGGTACCTAT ATGTTGAGTA ACTTGCTCCA
AGAGTTAACT ATTGCCCAAC GTTTTGGCAG AAAAGAGATG ATCCTAGATG AACAACGTCT
TAACGAAAAC CCCGTTGCCA GAATGAAGCG TCTCATATCG ACCCAGTTCT GGAAGTCGTT
GACAAGGCAG ATCACGGCAG AAAACATCGT AGACATGGCT AGAGACACCA AGATCAAAGA
AACTTACGTA GACCAACACG GAGTCCTCCA TGAGGATGCG GAATCGCACC GTATATACGT
CCCATACAAC CGTAAGGACC AATATGACTA CTATATGTCA ATCAAAGAAA GCAGACAGGA
TGTCTTCTTT GATGTACAAT ACTTGCCGGA AGTCATAGAT TCGGTTTACA TAAGATCCAT
CAACAAAAAG CCTGGTTTAC TTGCTTTGGC TTCTTCTCCA GACCCTAAGG ACCCCAAGAA
AATGATTAAC CAGCCTTATG TTGTCCCCGG CGGTAGATTC AACGAGTTGT ACGGCTGGGA
TTCGTACATG GAGACTATTG GATTATTAAC CGATGTCACT CCCAAGGACC AAGAACATCT
CAGATTAGCC CGGGGCATGA CGGAAAACTT CATCTACGAG ATCCACCACT ACGGCAAGAT
TCTTAACGCC AACAGGTCCT ACTATTTGGG CAGATCCCAG CCTCCCTTCT TGACGGATAT
GGCTCTTAGA GTATTCGACA AGACTGTAGA AGTAGACCCG GGTAGGAAAG TAGAAGCGTT
GGATTTTCTC AAGAGAGCGA CCGTAGCAGC CGTTAAAGAA TACAAGACCA TCTGGTGTGC
CAAACCTCGT TTAGATGAGA AGTCTGGTTT ATCATGCTAC CACCCTGATG GTTTGGGAGT
TCCACCAGAG ACTGAATCTT CTCACTTCAA CGCTGTGTTG AAGCCGTACG CAGAAAAGTA
CAAGATTTCA CAGGACGAGT TCATCGAGAA GTACAATGCA GAGATCATCA AGGAACCTGA
ATTGGACGAG TATTTCCTTC ATGACAGAGC TGTGAGAGAA TCTGGTCACG ACACCACGTA
TAGATTGGAG GGTAGATGTG CTTATTTAGC TACTGTAGAC TTGAATGCTT TGTTGTACAA
GTACGAGAAT GATATCGCAT TTATCATCCA GAACCACTTT GGGGGGAAGT TGGAATGCAA
CGGAGAAATC GAGGAGCCGG AAGTTTGGGT AGCCAGGGCA ATCAAGAGAG TGGAACGAGT
CAACAAGTAT TTATGGAACG AAGAAGATTC CATGTACTAC GACTATGACA TCAAGAAAGA
AACGCAGTGC AAGTACGAAT CTGCCACTGC CTTCTGGCCA TTATGGTCCA AGTTGGCTAT
TCAGGAACAA GCGGACAAAC TTGTCAAAAA TTCATTGGAC AAGTTCGAAG AATTTGGTGG
TTTGGTCGCT GGTACGCTTA AATCTCGTGG AGACGTCAAT CTCTCGAGAC CTTCTCGTCA
ATGGGACTAT CCTTATGGTT GGGCCCCTCA ACAAATCTTG GCCTGGATTG GGTTGTGCAA
CTATGGGTAT GATGGTATCG CTCGCCGTTT GGCGTACCGC TGGCTCTACA TGATGACCAG
AGCTTTTGTA GACTACAACG GGGTTGTTGT CGAGAAGTAC AACGTCACCG AGGGAGCTGT
TCCACACAAG GTTGATGCCG AGTATGGTAA CCAAGGTTTG GACTTTAAGG GTGTAGCAAC
TGAAGGATTT GGCTGGGTCA ATGCCAGCTA CTTGTTTGGG CTCACGTTGA TGAACTTGCA
TGCAAAGAGA TGTTTGGGGA CGTTGACGCC GCCAAACGTG TTCTTGATGA ATATGCATCC
CGTGCAAAGA GAGTACTTTC TGTAATTTCT CTGGTTGGTA TAATGAGATA TAACATATAA
GAAATATATA AGAATTTAGC AAATGTAATG CTATATAATG AT
 
Protein sequence
MFNPFRHRRS SSSSGDDDPF DLAENYYGPK NAPISKMGRV RTFSVFETKP RNQIFDFAED 
AIKEQSSSAS TSPSPTPFAE SLSEDNEPRP SEAVFGRKPS IVPIYADDSS TESINSIGHP
VKPKAIHHFR RSSVDDSYLR PKKFYISNVE ATLDELLHNE DTDRNCQITI EDTGPKVLRL
GTANSNGFNQ AAIRGTYMLS NLLQELTIAQ RFGRKEMILD EQRLNENPVA RMKRLISTQF
WKSLTRQITA ENIVDMARDT KIKETYVDQH GVLHEDAESH RIYVPYNRKD QYDYYMSIKE
SRQDVFFDVQ YLPEVIDSVY IRSINKKPGL LALASSPDPK DPKKMINQPY VVPGGRFNEL
YGWDSYMETI GLLTDVTPKD QEHLRLARGM TENFIYEIHH YGKILNANRS YYLGRSQPPF
LTDMALRVFD KTVEVDPGRK VEALDFLKRA TVAAVKEYKT IWCAKPRLDE KSGLSCYHPD
GLGVPPETES SHFNAVLKPY AEKYKISQDE FIEKYNAEII KEPELDEYFL HDRAVRESGH
DTTYRLEGRC AYLATVDLNA LLYKYENDIA FIIQNHFGGK LECNGEIEEP EVWVARAIKR
VERVNKYLWN EEDSMYYDYD IKKETQCKYE SATAFWPLWS KLAIQEQADK LVKNSLDKFE
EFGGLVAGTL KSRGDVNLSR PSRQWDYPYG WAPQQILAWI GLCNYGYDGI ARRLAYRWLY
MMTRAFVDYN GVVVEKYNVT EGAVPHKVDA EYGNQGLDFK GVATEGFGWV NASYLFGLTL
MNLHAKRCLG TLTPPNVFLM NMHPVQREYF L