Gene PICST_33450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33450 
SymbolEXO1 
ID4840691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp673373 
End bp675403 
Gene Length2031 bp 
Protein Length676 aa 
Translation table12 
GC content43% 
IMG OID640392006 
Product5'-3' exonuclease 
Protein accessionXP_001386143 
Protein GI150866513 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.167856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTTA CTGGTCTTCT TCCTTGCTTG AAAGAGATCC AGAATCCAGG CACTCTTGAA 
CAGTACCGAG GCAAGACTCT AGCCATTGAT ACCTATGGCT GGCTCCATCG AGCTCTAATC
TCGTGTGCCG AGGAATTGTG TCTTGAAAGG CCCACTCGCA AGTATATAAC GTCGATTTTG
AAGCGGGTAG ATATGTTGCG TCATTTTGGC GTCGAGCCCT ACTTCGTTTT TGATGGGGCT
GCCTTGCCTA CCAAGGCTGA AACGGCCAAT GAAAGACGCG TGAAACGTCA GGAAGCCCGC
AAAAAAGCCG AAGAATACCT GAAAGCAGGC AAAAGACTGT TGGCATGGAA AGAATATATG
AAGGCAGCCA GTGTGACATC TCAGATGGCC AAATCGGTCA TGGTAGAACT AGATGCCCGG
GGCATTAAGT ATATAGTAGC TCCGTATGAG GCAGATCCCC AGATGGTATA CTTGGAAAAG
ATTGGACTTG TAGACGGTAT TCTTTCTGAA GATTCCGATT TGTTGATTTT CGGTTGTAAC
CGTTTGATTA CGAAGCTCAA AGACGATGGA ACACTAGTGG AAATATGTCG TCAGGACTTC
CACAAGGTCA AGCTGATTCC GTACTTGAGC AAATTCTCCC AAGAACAGCT CCGGTTGATA
GCAATGCTTT CTGGCTGTGA TTATACCAAC GGCATCCAAG GTATTGGCAT CAAGACAGCT
TTCAATTTGG TCCAGAAACA CGCCAAGCTT GAAAGAATAG TAGCAGTCTT GAGAGCTGAA
GGAAAACCTA TAGATGAAGG GTTCCACGAT GAGCTTCATA GAGCAAACTT GGCATTCCAG
TTCCAGAAAG TATTCAATCC AAAGCTTCAA CAGCTCAAGA CTTTGCATGA TTACCCTGAG
GATTTGGAAC TAGACTACGA GGTTCTTGAA TCATGTTGTG GAAGAACTTT CAGCAACGAA
CTCTACATCC AGATTTGCAA TGGTTCTATA CACCCAAACA CTCACGAAGC CTTGGTTTCA
AGAGAGCAGA GTTTGTCCAG TCTCAAGAGC AATTCAGTCA ATATTAGCTC AGTGAAATCT
GCTCCAGCTC AATTGTCTCA GCGGTCAAAG TCTATTACTG CTACTCTAAC TCCAAAAGGA
CCACAGAAGA CAGTCTTTGA TTTCTTTCAA GTTAGAAAAC AAGTTGTTTC TGTGTCACTG
ATTACCGGTA ATTCAGTATC TTCAGTAACT TCGTTATCGA CTGGTAGTAC AACCGTCAAG
ACTCTGAAAA CGCCCGAAAA GAGATTATAT CCAGCATCTG ACCGTAGCAA AATGTCGCCT
ACCTCAAGAA AGATGCAAAG AATTGCAGAT GATCCAATAC CACCTGCAAG TTCTGTTGGA
AAGATCAGTA AATTCTTTTC GTCATCTCTG GACAAGACGC AAGAAAGTCA GGTTAAGACA
GTAGAGCCGT CTTCACTTTC TTGGGACTCA AGCATGATTG GAGATTCGTA TTTTTCTGAT
GAACCAGGCA GTCCTGTCAA GTCTGTCAAC ACCAATGATA TCTTGGAAGA TCTAACTGAT
ACAGACGATC CAATTTTTGA TCCTCCGGAG AATGACACGG AAAACTCTAC TTCTGAACAG
TCTACTTCTA AAGTAGAAAG TACAATATCT GATAATTTTG GTATTGACGA TGACGACGAT
GAGATCGAAG AGTCTCCTGT AAAGAACAAG GAAGCACCTC GTTTGCAGCA AGTTCAACCT
ACCAAGATGG AAGTGCTTAG GGAGAATTTA AGGGAAGAAT TCTCGTTCAG CATGAATCCT
CTTCCCACTA GAAGCAACTC CTCATTGACA TATTCAAGTA ATAGATCTGC TAGGCTTCCT
TTACAGGCCA AGGATGTAAA CATTAGTTCT CGATCTTTGA AAGCTAGAGC AACTGAACCA
AAATGTGTAT CTACATTCAA GCCACAGACT GAAAAACAAC AGTCTCAACA GCATATTAAA
CCACCGAAAA AGCATATAGA TCTCAAACAA TTTGCATTTG GAAGAAATTA G
 
Protein sequence
MGVTGLLPCL KEIQNPGTLE QYRGKTLAID TYGWLHRALI SCAEELCLER PTRKYITSIL 
KRVDMLRHFG VEPYFVFDGA ALPTKAETAN ERRVKRQEAR KKAEEYSKAG KRSLAWKEYM
KAASVTSQMA KSVMVELDAR GIKYIVAPYE ADPQMVYLEK IGLVDGILSE DSDLLIFGCN
RLITKLKDDG TLVEICRQDF HKVKSIPYLS KFSQEQLRLI AMLSGCDYTN GIQGIGIKTA
FNLVQKHAKL ERIVAVLRAE GKPIDEGFHD ELHRANLAFQ FQKVFNPKLQ QLKTLHDYPE
DLELDYEVLE SCCGRTFSNE LYIQICNGSI HPNTHEALVS REQSLSSLKS NSVNISSVKS
APAQLSQRSK SITATLTPKG PQKTVFDFFQ VRKQVVSVSS ITGNSVSSVT SLSTGSTTVK
TSKTPEKRLY PASDRSKMSP TSRKMQRIAD DPIPPASSVG KISKFFSSSS DKTQESQVKT
VEPSSLSWDS SMIGDSYFSD EPGSPVKSVN TNDILEDLTD TDDPIFDPPE NDTENSTSEQ
STSKVESTIS DNFGIDDDDD EIEESPVKNK EAPRLQQVQP TKMEVLRENL REEFSFSMNP
LPTRSNSSLT YSSNRSARLP LQAKDVNISS RSLKARATEP KCVSTFKPQT EKQQSQQHIK
PPKKHIDLKQ FAFGRN