Gene PICST_30446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30446 
Symbol 
ID4837900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp76216 
End bp77760 
Gene Length1545 bp 
Protein Length514 aa 
Translation table12 
GC content46% 
IMG OID640389215 
Productpredicted protein 
Protein accessionXP_001383296 
Protein GI150864470 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.474759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGTG AGGACGCGTC TGTTCCAAGA TCGCACGATT CGGCTGATGC CAGCCCCAAT 
TCCACAGCAA AAGAGATACC TGTGCCGCAA CCCCCTCTAT TGAACTTGAA CTTGAATTTG
AACATGAATA GCGGCTTCAA TCTATCCAAC TGGTGGCACC AGATCACACT GATTGAAGCT
GACAGTAGTG ACGACCGCAA CAACAATAAT AGTAACAATG GCATCAACGC TGCAGAGTCC
GATTCAGCCC AGTCTCAACC TATTGTAGTC TTGGGCCATT CATACCAAAC GACCGAAGAA
GCTCACGAGG ATATCATCAA GAAGTTGTGT CTCACATATC GATATGGCTT CGAGCGGATA
CCCCGGGCTG TTAATGGTCC CAGTCCGTTG TCGTTTATGC AATCGGTGAT CTTCAGTAAG
AGTCTTCTCT ATAATCTCCA GAACTTCAAC AACTTCATCG AAAAGGAAAA CTTCACTACA
GATGTAGGAT GGGGGTGTAT GATACGCACT TCACAAAGTT TGCTCGCCAA TACCTTCGTG
CGTTTGCTAG ACAAACAAAG CGACATTATC GCTCTCTTCA ACGATACCTA CTTAGCACCG
TTTTCATTGC ACAACTTCAT TCGTGTCGCC TCGTCACTGC CATTGAAGGT CAAGCCTGGC
GAATGGTTTG GTCCCAATGC TGCATCTCTC TCGATAAAAC GTCTCTGCGA TGGCTATTAT
GATAATTCGA CGTCAGAGAC GATCTTACCA CGAATCAATG TGCTTATCAG CGAAAGCACT
GATTTGTACG ATAGTCAGAT TGCCCAGTTG CTTGAGCCAA GCACCGAGAC CAAGGGCTTG
TTGGTACTCT TGCCCGTCAG ATTGGGTATC GACAGCATTA ATTCTTATTA TTTTTCAAGT
TTGCTCCATC TTCTTTCGCT TGAGCAATCT GTAGGAATCG CCGGAGGCAA GCCGTCCTCG
AGTTTCTACT TCTTTGGCTA TCAGGACAAT AGTCTCATCT ACATGGATCC ACATTCAGCT
CAGATATTCA GCAGTGACAT TGATATGAGC ACCTACTACG CCACACGATA CCAGAGGGTT
GACATTGGCA AGTTGGATCC GTCTATGTTG ATTGGAGTGT TCATTCGTGA CTTGACACTG
TACGAAAATT TCAAAAAGAG CTGCCTTGAT GCCGCGAACA AAATTGTCCA CTTCCATGCG
ACGGAGCGTC TGACGGTACC TGAGTCCAGA CGAAAGAACT CCGAGTTCGT CAACATCAAC
AGAAGCGATT TGAAGGACGA AGACTATATC AATATCGACA GAGTCAACCG CTTGGACAGC
ACTGACGACT TCATTGACTT AGGCGATGAC TATGTAGAAA CCAACACGAA CTTGGAAGAA
GCTACTCCGC TGGCGGAAGA TACAGTTCCT GTTTCTACAT TAAGTGCCAG CGAGCTGGAG
ATAACAACAT CTTCATACGA AACACCTACT TCAAAGGATG ACAACAGTTC CAGAGCAAGC
TTGGACGTGG TAGTGCTCGA CACGACAGGT GAACAACAGG AATAG
 
Protein sequence
MAREDASVPR SHDSADASPN STAKEIPVPQ PPLLNLNLNL NMNSGFNLSN WWHQITSIEA 
DSSDDRNNNN SNNGINAAES DSAQSQPIVV LGHSYQTTEE AHEDIIKKLC LTYRYGFERI
PRAVNGPSPL SFMQSVIFSK SLLYNLQNFN NFIEKENFTT DVGWGCMIRT SQSLLANTFV
RLLDKQSDII ALFNDTYLAP FSLHNFIRVA SSSPLKVKPG EWFGPNAASL SIKRLCDGYY
DNSTSETILP RINVLISEST DLYDSQIAQL LEPSTETKGL LVLLPVRLGI DSINSYYFSS
LLHLLSLEQS VGIAGGKPSS SFYFFGYQDN SLIYMDPHSA QIFSSDIDMS TYYATRYQRV
DIGKLDPSML IGVFIRDLTS YENFKKSCLD AANKIVHFHA TERSTVPESR RKNSEFVNIN
RSDLKDEDYI NIDRVNRLDS TDDFIDLGDD YVETNTNLEE ATPSAEDTVP VSTLSASESE
ITTSSYETPT SKDDNSSRAS LDVVVLDTTG EQQE