Gene PICST_31695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31695 
SymbolPOL5.7 
ID4838556 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1455674 
End bp1457535 
Gene Length1862 bp 
Protein Length476 aa 
Translation table12 
GC content41% 
IMG OID640389871 
ProductPutative RNA polymerase 
Protein accessionXP_001384578 
Protein GI150865385 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.265783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAATC ATATGGCCCT TACCAACCTA TTGAACCTAG AAGTGCAAGG TCACATCACG 
GTTGACGACA CCGAGTCAAG TAAGAATCTT GTGAAGAATT GTATAACCTG CAACAGTATT
ACTGTCAGAC AATCATCACA CAATCATCAC ACTCAACGCG CCGCTGTGCG CAGACTTGAA
CGTGTGAGTT GCGATACTAT CGGACCTTTC CAATTTAGGA GATACAATAA GTCTACTGCA
AAAATATTTA TCACTTCCGT TATTGATCAT TACACCGGTT ACACCAAACT ATTATATACT
GATCACAAGT CTCTTGCAGA CACAGTATTA AATACTCTTA ATTTATGGAA TCATAAATTT
CCTGGTGAAT CGATTTCATA TTTCCGGTCG GATAATGCAA TAGAACTTCC TTCTGACGAA
CAACTACTTA AACTTGGAAT CGAACGCGAT CAAATTCCAC CGTATTCTCC AGAGCTAAAT
GGATTGGCAG AATCACATAA TCGCATTATT CTTGCCAATA TACGGAAAGT TGTACTAAGT
TTCCCCGATC GCCACGATGA AGTATTGACT CTCTTCAAAG AAATCGTGGA ATACTCGGCA
TTCGTCAAGA ACAATACTCC CCGAAAGCTG CTCCAATATC GCACACCTGC CGCCGTCTTT
TTTAACTATA AAGACGCATA TCACCGTCCT ATTGTCCCAT TCGGTATGGA CGTTGTGATA
AAAGCATCTT CGAAAGAAGA ATACGAGAAG TATGGCCGTC CCTTACTTAA GACTGAACCA
CACGCTTTCT TCGGTTCTAT CGTCGGCTTC GCCACAGATA ACTATAGCTA CCGAATACTC
GTCCAAGCAG AACATTTCCC AATTATAACG AATTGTAATG CAAAACTCCT CAATTCACGC
CAATTTATCG AGAATTATTT TCAATCACTT GATTTGTTAC AGAGAGACAG CGCTCAATAT
AATGCCACAG TACTCGATGC ACTTGAAGAC AAACTTGCTG ATCATATCGA TATCGCTGAT
AAAGAGGTTA TCTTTGATGC TACTTTACTT AAAAACGGGG ACACATCTGT CCAGACAGCC
AACATTGGAG ACACCAATTT ATCGACGCCA TCTCCTCAAA ATCCTGAGGC TTTGAGCAAT
ACACCTTCTT CGCAGATGAC GATCGAAAGT CTTTCTACAA GACATCCCGA AGTAGCCACT
CAACACAAAC GGCGTATAGA AGAACTCACT TCTGACACGT TACCATTTCC TACCGATCAA
GAAGGTAATT CGGTAGACGG AAAGCACATT AAGCGCTCAC CACGATTAGG GGGAGTAAAA
ACAACTCCAC TTTCAGCCAC TAGAACGATC AGGGACCATA AGACTAAAAG TAAAATTGTT
GATAACAATC GTGATTACGC TGCTGACCCC ATTAAAACTG CTATAGAAGA AACCAGAACG
AAAATGAGTG AAGATCATAA CTTAAACTCA ACTACGACAA ATTCTGAAGA GTTAACTTCA
CAGTTAGCCT CCCAGGAGGA TAGAATCTCA GGGGAGTTAT CAGGGGAAAC AAATGTGGAC
TTATCAACCA CACCAGCACA AAATACAAGA TCACATACCA AGTCAAGACT AGACAAACAA
ATTGAGCTGT CTAAAAATTG GTCTAACCTA GATACGCGAA AGACTCAATC ATGGAATAAA
GTCCCACCTG AAATACACGG AAAGGGCAAA ATTAGTAAGA GGAAAGGTAC CAAAGCATCA
TTATTGGACA AGGATGCACA AAGCTCACAA ATAGCAAAAA CGATCACCTC CGAAATAATA
AAAAGAAAGA AGAAGACACC AGATTTCGAT TCCAGTCCAC AACAGATAAA TGCATTGCTT
AG
 
Protein sequence
MYNHMALTNL LNLEVQGHIT VDDTESILSS DNHHTIITLN APSCADLNVF PDRHDEVLTL 
FKEIVEYSAF VKNNTPRKSL QYRTPAAVFF NYKDAYHRPI VPFGMDVVIK ASSKEEYEKY
GRPLLKTEPH AFFGSIVGFA TDNYSYRILV QAEHFPIITN CNAKLLNSRQ FIENYFQSLD
LLQRDSAQYN ATVLDALEDK LADHIDIADK EVIFDATLLK NGDTSVQTAN IGDTNLSTPS
PQNPEALSNT PSSQMTIESL STRHPEVATQ HKRRIEELTS DTLPFPTDQE GNSVDGKHIK
RSPRLGGVKT TPLSATRTIR DHKTKSKIVD NNRDYAADPI KTAIEETRTK MSEDHNLNST
TTNSEELTSQ LASQEDRISG ELSGETNVDL STTPAQNTRS HTKSRLDKQI ESSKNWSNLD
TRKTQSWNKV PPEIHGKGKI RCTKLTNSKN DHLRNNKKKE EDTRFRFQST TDKCIA