Gene PICST_57399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_57399 
SymbolEXG1 
ID4837972 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp537868 
End bp539313 
Gene Length1446 bp 
Protein Length458 aa 
Translation table12 
GC content43% 
IMG OID640389287 
Productexo-1,3-beta-glucanase 
Protein accessionXP_001383379 
Protein GI150864529 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.514091 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AACATCACCA ACTCCACCAT CCAATCTCAG TACAATTCCA ACAATTTCCA GTACCAGGGC 
GTAGCCATCG GAGGCTGGCT TGTGCTTGAG CCATATATAA CGCCGTCCTT ATTTCTTGCC
TTTAACCAAA CGGCGAACAC CACAGAGAAG GATATTCCCG TGGACGAATA TCACTATTGT
AAGAAATTGG GCTACGAGGA AGCTGAGAAA AGACTAACTC AGCATTGGGA CACGTTCTAC
ACCGAGAACG ACTTTGCCGA CATCAAGAAT GCCGGCTTGA ACATGGTGCG GATCCCCATT
GGATATTGGG CTTTCCAGAA ACTCGATGGC GACCCATATG TTCTGGGAGC CCAGGACTAC
TTGGACAAGG CCCTAGAATG GGCCAAAAAC AACGATCTTA AAGTATGGAT CGATTTGCAT
GGAGTTCCGG GGTCACAAAA TGGATTTGAC AATTCTGGAT TTAGAGATAT CGGATACCCT
GGCTGGTTCA ACAGTACAGA AAACGTGAAC TTGACAAAAC AAGTTCTTCA CCAAATTTAC
CACAAATATG GTACTGGAGA GAACGCCATA AATTATAGGG ACACCATTCT CGGTATTGAA
GTCGTTAACG AACCCTTCAC TCCAAAGTTG TCGATGTCAA GGTTGCAAAG TTTCTATATA
GACACCTACA TTGACTCCAG AAAAACACAG ACTCTCAACA ATACAATCGT CATTCATGAT
GGGTTTGAGG GGATTGGCTA TTGGAACGAT TTCCTTGCTG GTGGAAAAGT TTATTCCAAC
AGTAGCTATC TCAACAGTAC GAATAGCACA TCCAAATTCC TGAAAAGGGA CAATTACTTC
CCATTTAATG CTACAAATAT TACAGCTGTT GCTTCAGCAG AAGTTTTTAA CGTGTTGATT
GATCACCACC ATTATGAGGT GTTTGCCTCT TCAGTTGCTT CGAACATCAC GACACATTTG
CTGAATATTC GAGGATACAG TCAATCTATC AAGGACGAGT TAAAGTATCA TCCGGCTGTA
GTAGGCGAGT GGTCAGCAGC CTTGACTGAC TGTACACCTT GGCTCAATGG AGTCGGCCTA
GGAGCTCGAT ACGCAGGAGA AGCACCCTAT GATAACACTA CCAAGGTGGG AAAATGTGAC
AACATCAACA ATTTCGACAA ATGGACCAAA CAGCAGAAGA AAAATACCCG GAAATTTATT
GAGATACAAT TAGATCAGTA CTCTCGTTAT TCGAACGGGT GGATTTTCTG GTGTTGGAAG
ACTGAAACCA CCATCGAATG GGATTTCAAA AAGCTTGTAG AGTTGGACTT GATGCCCCAA
CCGCTTAACA ATTTTACGTA TATCGTAAAT GGAACAGATA CAGATCCTGA TAGTGGAGCT
TCTCTTGCAA GGCTCAGTTT GTTTATCCCA GTGGCACTGG TCTTGATGTT CGTTGCTTTT
TTTTAA
 
Protein sequence
NITNSTIQSQ YNSNNFQYQG VAIGGWLVLE PYITPSLFLA FNQTANTTEK DIPVDEYHYC 
KKLGYEEAEK RLTQHWDTFY TENDFADIKN AGLNMVRIPI GYWAFQKLDG DPYVSGAQDY
LDKALEWAKN NDLKVWIDLH GVPGSQNGFD NSGFRDIGYP GWFNSTENVN LTKQVLHQIY
HKYGTGENAI NYRDTILGIE VVNEPFTPKL SMSRLQSFYI DTYIDSRKTQ TLNNTIVIHD
GFEGIGYWND FLAGGKVYSN SSYLNTVASA EVFNVLIDHH HYEVFASSVA SNITTHLSNI
RGYSQSIKDE LKYHPAVVGE WSAALTDCTP WLNGVGLGAR YAGEAPYDNT TKVGKCDNIN
NFDKWTKQQK KNTRKFIEIQ LDQYSRYSNG WIFWCWKTET TIEWDFKKLV ELDLMPQPLN
NFTYIVNGTD TDPDSGASLA RLSLFIPVAS VLMFVAFF