Gene PICST_70134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_70134 
Symbol 
ID4836929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1835074 
End bp1836742 
Gene Length1669 bp 
Protein Length526 aa 
Translation table12 
GC content45% 
IMG OID640388244 
Productpredicted protein 
Protein accessionXP_001383124 
Protein GI150864349 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.927836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTAG CAGGAAAAAT GGGATCGCAC CCCCGACCCG GAACGGAGAA GTACTCCCGT 
GGTTACAACA AGGAGCAGAG AAGAGCTCAC TACTCCAAAG ACAAGAAGCT CAATGCTGGT
CTCAAGAAGT TGGATCAACA GCATAAAGAA GCCATGAGGT CTGCAGCAGG AACCGAGTTG
CTTTTACAAG AAGAGGTCGG GTTTTTAGAA GCAGATGGAC CAATGGAAAA AACCTTCAAA
TTCAAACAAG ATGAAATCAC AGAGGTATTA GATGAGAGTA CAGTCAACAA GAAGTTCGAG
CTCAAGCTTC CACAACTAGG CCCTTACACC GTAGACTACA CGAGGAATGG CAGAGACTTG
TTGATAGGTG GAAAGAAAGG TCATATAGCC TCTTTTGACT GGAGAAAAGG AACTTTGGAT
TGTGAGTTAC ACTTGAATGA GACCGTACAT GCTGTAAAGT ATTTGCACAA CGACCAGTAC
TTTGCCGTGG CCCAGAAGAA GTATACCTTT ATCTACGACA AACAAGGCAC AGAGTTGCAC
CGTTTAAAAC AGCATATAGA TTCGACATTA TTGGACTTCT TGCCGTACCA CTTCCTCTTG
GCTACTGCAG GGAACACTGG CTTCTTGAAG TTCCACGATG TGTCTACTGG AGACTTGGTC
TCTGAATTCC GAACCAAGCT TGGACCTACC CAGGCTATGA AACAAAACCC ATGGAACGCT
GTGATGCACT TGGGCCATGG AAATGGTACT GTTTCCTTAT GGGCTCCCAA CATGGCTTCG
CCTCTAGCCA AGATGCTTTC GTGTAGAGGG CCAGTCAGAG ACGTGGCTAT AGACAGAGAA
GGCAAGTATA TGGCTGTGAG TGGAGCCGAC AAGACGTTGA AGATCTGGGA CTTGCGTAAG
TTCAAAGAAC TCGACCACTA CTTCACACCT ACACCAGCTT CTTCGTTGGA CATATCCGAT
ACTGGCTTGC TTTCCATTGG CTGGGGACCA CATGTAACCG TGTGGAAGGA CGTTTTCAAA
TCAAAGCAAT CTGATCCATA TATGACGCAT TTGATTCCGG GTTCCAAGAC GGAGAAGGTG
AAGTTCGTGC CCTTTGAAGA TATCTTGGGT GTGGGTCATC AGAACGGATT TAGTTCGTTG
ATCATTCCAG GCTCGGGTGA AGCCAACTTC GATGCCTTGG AGTTGAACCC ATACGAGACT
GCTAAACAGA GGCAAGAGCT GGAAGTCAGA TCACTTATCA ACAAGTTGTC ACCTGACACC
ATCTCACTTG ATCCTAATGT TATTGGAACA GTAGACAAGA GAGCCAACAG CATACGGTTG
AAGCCTGGTC AAATCGAAGA GCTAGATGCC GACAAAGAGA AATCGGAAAA AATGGAAATC
AGACCAGATG TCAGAGGTAA GAACTCGGCC TTGAGAAGAC ACTTGAGAAA GAAGGCACAG
AATGTTATTG ACCAGAGAAA GTTGAGAATC GAAAAGAACT TGAAGACGGA GAAGGAAGCT
AGACAGAGAC GACACAGAGA GTTCAAGGGT ATTCCTGAAG AGAAGGACCT TTTGGGCCCA
GCTTTGGCAA GATTCAAGTA GATTTTTGTT ATCATTAGCA TATATGCATA TATATAAGTA
ATTATATATA TACGAGTGTA CATTATTCAA ATTCCGTGGT TCAATATAT
 
Protein sequence
MAVAGKMGSH PRPGTEKYSR GYNKEQRRAH YSKDKKLNAG LKKLDQQHKE AMRSAAGTEL 
LLQEEVGFLE ADGPMEKTFK FKQDEITEVL DESTVNKKFE LKLPQLGPYT VDYTRNGRDL
LIGGKKGHIA SFDWRKGTLD CELHLNETVH AVKYLHNDQY FAVAQKKYTF IYDKQGTELH
RLKQHIDSTL LDFLPYHFLL ATAGNTGFLK FHDVSTGDLV SEFRTKLGPT QAMKQNPWNA
VMHLGHGNGT VSLWAPNMAS PLAKMLSCRG PVRDVAIDRE GKYMAVSGAD KTLKIWDLRK
FKELDHYFTP TPASSLDISD TGLLSIGWGP HVTVWKDVFK SKQSDPYMTH LIPGSKTEKV
KFVPFEDILG VGHQNGFSSL IIPGSGEANF DALELNPYET AKQRQESEVR SLINKLSPDT
ISLDPNVIGT VDKRANSIRL KPGQIEELDA DKEKSEKMEI RPDVRGKNSA LRRHLRKKAQ
NVIDQRKLRI EKNLKTEKEA RQRRHREFKG IPEEKDLLGP ALARFK