Gene PICST_39360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39360 
Symbol 
ID4851662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2478983 
End bp2480185 
Gene Length1203 bp 
Protein Length400 aa 
Translation table 
GC content47% 
IMG OID640393370 
Productpredicted protein 
Protein accessionXP_001387047 
Protein GI126275193 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4833] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCT CCAACCCTAA CCTTGAAGGG TATAACGTTG TGCGCAACTT TTGGCTCTTG 
TACTACAACA GGAATAAGGG CACTTTTGTA GCACCCAAGA AATGTAGCGG ATGCTACAAT
GAGGACAGAT TTGTGGTTTG GGCTGTTGCT GTTGCTGTTC AGGCAGTAGT CGATGGTGCC
CGAATCTATC CAGAATTGCG TCCCTTGATT GCGCCTGGCA TAGATATCTT CAAGAAATAT
AAGAATCCTC ACCTCAGAGG ATATTCTGCT GCTGAAAATG GTGGAAATGA CAAGGACATC
TACTATGATG ATGATGCCCA AGTAGCCAGT GCCATGCTTA CTGCGTATGA GGTTACTGGT
GAAAAGCGTT ACTTGGACCT CGGTAGGGAA TTAGTGCGTT TCCTCATGGG AGGATGGAAC
ACCAATCCCA ATGCTAAGAC CAAGGGTGGC ATGTGCTGGC ACATCACTAA CCACTACTTA
AACGCTTGTA CTACCGCTGA AACCGCTAAA GCCTGTTTGC AGATTCTGAG ATTCATTCCC
AACGAAGCAA AGATCTACAT CGATTTCGCA GCCAAATGTA TCGATTGGCA GATCAGAGTC
TTGCAAGACC CATCGGACAA GTTGATTAAG GATGGTGTCC AGGACACTTC TACAGACTTT
AACGATACGA AGTGGACATA TAACGTAGGT ACTACGTTAT CTGCTGCCGC TCACTTGTAC
CATATCACCA AGGATCCTAA GTGGAAGCAG ATAGCTGATG ACTTGGCTGC TGCTGGGATT
AACAGAGGTG TTTTCTTCTA TGACCGTGAC TACGACGACG CTCACAGATA CTGGAGAGAT
GCATCCTACT TTGTCCAGTT GCTTATAGAG GGATTGGCAG ACTACTTGTT GTACGTGGGG
AACGAAGCTC CAGAGGGCTT GCCAGAAAAA ATCCAGGAAG AAGTCAGAAG ACACTTGGTT
ATGTTCTACG AATACATGAG AGACCCAAGA GACGGCTTAT ATATCCAAAG TTTTGAACCC
CACAGAACAT ACAAGGAAGT CTACGACTCC AAATACGTCA AGGAGTTTGG TGGCCACAAA
GGTTGGGGGT TGAAGGACGA AGACAAGTCA GGAGACGAAC CTATGAAGTG TTTGATGGGT
GGAGGTGCTG CTGCGAGAGT GTTTTTCCAG GGTGCCAGAG TTGTTCCCGA GGTCAAGTAT
TAG
 
Protein sequence
MSLSNPNLEG YNVVRNFWLL YYNRNKGTFV APKKCSGCYN EDRFVVWAVA VAVQAVVDGA 
RIYPELRPLI APGIDIFKKY KNPHLRGYSA AENGGNDKDI YYDDDAQVAS AMLTAYEVTG
EKRYLDLGRE LVRFLMGGWN TNPNAKTKGG MCWHITNHYL NACTTAETAK ACLQILRFIP
NEAKIYIDFA AKCIDWQIRV LQDPSDKLIK DGVQDTSTDF NDTKWTYNVG TTLSAAAHLY
HITKDPKWKQ IADDLAAAGI NRGVFFYDRD YDDAHRYWRD ASYFVQLLIE GLADYLLYVG
NEAPEGLPEK IQEEVRRHLV MFYEYMRDPR DGLYIQSFEP HRTYKEVYDS KYVKEFGGHK
GWGLKDEDKS GDEPMKCLMG GGAAARVFFQ GARVVPEVKY