Gene PICST_30020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30020 
Symbol 
ID4836831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1748662 
End bp1749876 
Gene Length1215 bp 
Protein Length404 aa 
Translation table12 
GC content38% 
IMG OID640388146 
Productpredicted protein 
Protein accessionXP_001383104 
Protein GI150864332 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.201529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.620116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAG AAACATGCTA CGGCCAACTT TCATCGATGA TGTCAAAACA GCTCAAAGTA 
GTAGACAGTC GCATCAAATC AAGTACCAAG GAGTTGGATT GGAATTATCT TAAAAGAACT
CCCCAATTTC TCAAACAGAG AATCAAATTT GGAAATGGCT CATATACATC TGATGAGGAA
TTCAATTATT TAGAGAAGCA ATTTAAGGAA ACGGAAAAGA CACTTCAATC GGTAGAAAAG
TATGTCAAGA TGTATTCCAA AAGTACCTTA ACATTGTTGG ATAGAAGTAC AGCGGTTGGC
AAAGGTTATT CTCTTCTCTA TGATCCATAC GAAGATCTAG CGAAGAAGAC TGGTGAACAG
TCCAATTCCC AAGTTTTCGA AGACCAATAC AGGAAATGGC AAAACGTGAA CAACTACATT
GAAACAATAA ATATGTGCAA ATCTGAAATT GAGAATGAAA CCAAAACCCT TGCAGCTATG
GTTGAGCTGA AAATCCAAGA AATATACTCA AATATAAACA ATATTCACAA GAAGATAAGA
GTCAGGTCTT ATGCCTTGGT AGACTACGAC AAAGTCTACA ATAGTCATGA CAACTTGCTT
TTAAAGCAAA AGTCTGGAGA GCTTACTGTT AAGCAGTCTC AGCAATTGTT CAGTTCAGAG
AGAAAATTGG AAGAGAATAA GGTCAAGTTT GATGAGATTA ATGATCTCTT AAAAAAAGAA
TTACCCTATT TCTTGAAGTT GGTGGAATTG ATTTTGACCC CTTTGCAGGA GTACGTCTAC
TATGTCCAGC TCATGAACTA CTTTCAATTT TCAAGCAGAT GCAAATCGTA TGCCAATTTC
ATCAATTTGG ATGTCAGAAT CATTTCGTCT CCCAATTTCG CAGATGAGCT TATGACTCAA
AATTCCATTG ACAGTTTGGG AGCTTATGAC TCCATTAACC AACTCACTCT AATCAACTTT
AGAGACCGGT ATCTTTCTGA TATTACTTTG GCTTTAGAAC CAGAGAACAG GAACCCCATT
CTCAAGGAGA CCAACTCCTA CTACTACATG GCAAAATTTA ACTTTGAGGG ACAGCAGGAA
GGTGATCTTT CATTCAAACA TGGTGATATT ATCACGGTGT TAACTAGGAA TGGAAACTGG
TGGAAAGGGG AGTTGGATGG AGTAGTTGGA ATCTTTCCAA GGAACTACGT AGAAGAATAT
TCGCCCAGGG ATTAG
 
Protein sequence
MSEETCYGQL SSMMSKQLKV VDSRIKSSTK ELDWNYLKRT PQFLKQRIKF GNGSYTSDEE 
FNYLEKQFKE TEKTLQSVEK YVKMYSKSTL TLLDRSTAVG KGYSLLYDPY EDLAKKTGEQ
SNSQVFEDQY RKWQNVNNYI ETINMCKSEI ENETKTLAAM VESKIQEIYS NINNIHKKIR
VRSYALVDYD KVYNSHDNLL LKQKSGELTV KQSQQLFSSE RKLEENKVKF DEINDLLKKE
LPYFLKLVEL ILTPLQEYVY YVQLMNYFQF SSRCKSYANF INLDVRIISS PNFADELMTQ
NSIDSLGAYD SINQLTLINF RDRYLSDITL ALEPENRNPI LKETNSYYYM AKFNFEGQQE
GDLSFKHGDI ITVLTRNGNW WKGELDGVVG IFPRNYVEEY SPRD