Gene PICST_30704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30704 
Symbol 
ID4838157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp727829 
End bp729559 
Gene Length1731 bp 
Protein Length576 aa 
Translation table12 
GC content44% 
IMG OID640389472 
Productpredicted protein 
Protein accessionXP_001383772 
Protein GI150864797 
COG category[K] Transcription 
COG ID[COG3343] DNA-directed RNA polymerase, delta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTA TGTCCACAGC TACGACTCCC ACGATGAGTC CGAACCATTT GATCACGTCT 
CCCAATATCA CACTTAATAG TTCCAATAGC AATAATATAT CATATTCGTC ATGTTTAGCC
ACCAATACCA TTTCACACGC TGGCTCTTCC AATATCGGTG GCCATTCACA AAATCTGTCT
GGCTCCCAGT CGCAGAGCCA ACAACCCTCT TCAAGCCTGG CATCTTCTGG ACGTACACCA
TCTGATCGTT CAGACCGTAA ACTTGCCGTT CCCAAAAGTT TGACATCACA TTCATCTCCT
TCTTCATCGT CTTGTTCATC ATCCTCATAT TCATCTTCTT CTACTAAGAG TAGTCCCTTG
ACTCAACACA TTGATCCATT TCTTGCCCAG TACCTCAAGT CACCTACCAT TAATTCTACT
TTCACCAACC ACCGCCTTTT TTTCCAGGCG CTCAATAGAG AACGGAAAAT CAAGCTGATG
GACATCTACC TGAAGTACTT GACGTTCAAA CACACTAAGA ACCCATACAA GAACGCCCAG
ACATACCAAC AGTACATCCA GAGAAATGAC CCACCCAAAA TCAGGATCAA GTCGTCGCCC
TTGATATCTC AAAAATATCA AAATTATCAT CACCATCACC ATGAGCATAA CCAGAAAACC
CAGACTTTGA AGAAGTTACC TTCCCCTATG ATTACTCTTG AGAAAACGTT GCTGCCTTCT
CCCAAGAGAA AATTTCCAGA ACCAGTGCCT CAAGTCCAAG TAGTCAAGAA ATTCCTTCCC
CCACAGTTCA TGGACTGTCC TACGGACGAC TTAATAAACT TGATTTCGCG AATGTTGCTG
TCCCTCATAT CGTTGAATGA TAAGTCTGTG CCCGAGTCGA TATCTCACCC AAAGCCATCT
TCAGCAGCTT CGACGAACAG CTTATTGACA AGATACCATT CGCGTACACC TCCTAGCATT
TCTACCCATA CATACTTGAC AAGATTGAGT CAGTATAACA ACTTCAACCC AGCAACGTTG
CTCACAACCA TCTATTACAT CGACTTGTTG AGTCACCAAT ACCAGCCATT TTTCACGTTG
AACTCGTGGA CGGTACATCG TTTTCTTCTT GTAGCTACTA TGTTGTCGCA AAAGTCCATG
GAAGACTTCT TCTACACAAA TGACCACTAT GCCAAAGTTG GAGGTGTGGC TGTAGGCGAA
TTAAATTGTT TAGAGTTAGA TTTCTTGAAC CGCGTGGACT GGAGGTGTAT TCCAGGAAAA
CAGCATTTGC AGGGTCAAGG TCAAGAAAAA GAACACCAAT ACTGCAGTAT CAGATACGCG
AAGGACGTCT TAGATCTCTA CTACATCCAG TTGATCGAAT TGATGGGTAG ACACACCGTA
AACAGCGATC CGTTATCGAA GCATATTCAT TACTTGCCTC AGAGTAAAAA CGAAAATTCC
TCTAAAAATG CCGACGGTAT TGAAATTGAA CAGGAGGAAG AACAGAACGA AGAAGATATG
GACGAGGAAG AAGATGATGA AGAAGATGAT GACGACGACG ACGACGATGA CGATGACGAT
GACGATGATG AGGACGAAGT CAATTCAAGC AGCCAGGCTG TAGAAGATGA AGACGAAGAA
GAAGAGGAAA GCAGAAACGG ACTCAGGAAG AGTCCACTAT TTGATTCAGA CGGCTATAGT
GTCGACGGAA CTTCGTCACC ACACTTGAAG AGGAAGTATT CCAACGAATA G
 
Protein sequence
MTTMSTATTP TMSPNHLITS PNITLNSSNS NNISYSSCLA TNTISHAGSS NIGGHSQNSS 
GSQSQSQQPS SSSASSGRTP SDRSDRKLAV PKSLTSHSSP SSSSCSSSSY SSSSTKSSPL
TQHIDPFLAQ YLKSPTINST FTNHRLFFQA LNRERKIKSM DIYSKYLTFK HTKNPYKNAQ
TYQQYIQRND PPKIRIKSSP LISQKYQNYH HHHHEHNQKT QTLKKLPSPM ITLEKTLSPS
PKRKFPEPVP QVQVVKKFLP PQFMDCPTDD LINLISRMLS SLISLNDKSV PESISHPKPS
SAASTNSLLT RYHSRTPPSI STHTYLTRLS QYNNFNPATL LTTIYYIDLL SHQYQPFFTL
NSWTVHRFLL VATMLSQKSM EDFFYTNDHY AKVGGVAVGE LNCLELDFLN RVDWRCIPGK
QHLQGQGQEK EHQYCSIRYA KDVLDLYYIQ LIELMGRHTV NSDPLSKHIH YLPQSKNENS
SKNADGIEIE QEEEQNEEDM DEEEDDEEDD DDDDDDDDDD DDDEDEVNSS SQAVEDEDEE
EEESRNGLRK SPLFDSDGYS VDGTSSPHLK RKYSNE