Gene PICST_83789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83789 
Symbol 
ID4839608 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp267188 
End bp268348 
Gene Length1161 bp 
Protein Length376 aa 
Translation table12 
GC content44% 
IMG OID640390923 
Productpredicted protein 
Protein accessionXP_001384710 
Protein GI150865476 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG5242] RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 
TIGRFAM ID[TIGR00627] transcription factor tfb4 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.362151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGA TATCTGACCG AGTCTTCACG GAGACTACTC TGACGGAGAC CTCTAATGAT 
GACCCGTCAC TTTTGACGGT CATTCTCGAT GTCTCACCAG CTGGTTGGTA CAGAATCCGA
GATCAAACAT CAATCGACGA ATTGGCCAAG TCACTTTTAG TTTTCATGAA TGCCCATTTG
TCACTCAACA ATTCCAATCA GGTGGCATTT ATAGCGAGTA CACCACAGAA ATCGAAGTTC
TTGTTTCCTA ATCCAGAAAT AGACTACGAC GAGATTCGAA CCAGTTCTTC TAGCTCTGGT
TCTGCCTCAA ACCAGCATCA GAGTCAAGAT ATCGACTCCA ATGCTAGAGA AGAAACACCT
ACTTTAGTAT CTAAGGACAT GTATAGACAA TTTCGAGTTG TTGATGAAGC TGTTCTCGAG
GAGTTGAACG TGGTTTTCGA CGAAATTGCT AATGGGATAC AAGATATAAA TAATAACTCT
ACTCTATCCG GAGCTCTCAG CATGGCTCTA ACATATACAA ACCGAATGTT GACTCTTGAC
CAACTGATTT CTACAACTAC GGCTTCAGCC ATCAACTCTA CTACTAGTAT GGGAGCAGGT
TCTGGGTCTG GAAACACAGC TACCAATTCT TCTACCAGCA ATCCTTCCAA CAGCATTACT
TCTATGAAAT CGCGTATTCT TATTGTCACA GCCAACGACG AAGACGATGT CAAGTATATT
CCCGTGATGA ACTCGATCTT TGCGGCTCAG AAAATGAGGA CTTCCATTGA TATAGCCAAG
TTGGGCTTTG AGGACTCGTC ATACTTGCAA CAAGCGGCGG ATGCTACTAA TGGGATTTAC
TTCCACGTTC ATGATCCTCG TGGAATTGTG CAGACTTTGA CTTCTGCTTT TTTCATAGAA
CCTTCTATCA GACCGTTCAT CATACTCCCA ACCAACTCTA ATGTCAACTA CAGAGCCAGT
TGCTTTGTCA CCGGCAAATC CGTGGATATA GGCTTTGTTT GTTCAGTGTG TCTCTGCATC
ATGAGCAAGA TTCCACCGTC TGGCAAATGC CCGGCCTGCG AATCGGTGTT TGACGAAAAG
ATCATAGCCC AGTTGCTGAA AGGTCCTTCT GTTCTTTCCA AGAAGAAGAG AAAGATAGAT
ACGAATGGTG CAGCCAAATA G
 
Protein sequence
MDAISDRVFT ETTSTETSND DPSLLTVILD VSPAGWYRIR DQTSIDELAK SLLVFMNAHL 
SLNNSNQVAF IASTPQKSKF LFPNPEIDYD EIRTSSSSSG SASNQHQKTP TLVSKDMYRQ
FRVVDEAVLE ELNVVFDEIA NGIQDINNNS TLSGALSMAL TYTNRMLTLD QSISTTTASA
INSTTSMGAG SGSGNTATNS STSNPSNSIT SMKSRILIVT ANDEDDVKYI PVMNSIFAAQ
KMRTSIDIAK LGFEDSSYLQ QAADATNGIY FHVHDPRGIV QTLTSAFFIE PSIRPFIILP
TNSNVNYRAS CFVTGKSVDI GFVCSVCLCI MSKIPPSGKC PACESVFDEK IIAQLSKGPS
VLSKKKRKID TNGAAK