Gene PICST_29686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29686 
Symbol 
ID4836869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp872114 
End bp873478 
Gene Length1365 bp 
Protein Length454 aa 
Translation table12 
GC content41% 
IMG OID640388184 
Productpredicted protein 
Protein accessionXP_001382939 
Protein GI150864207 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGCG AAAAAGCGAC TTGGGAAGCA CTACTCAAGC ATCCCTTATA TACTTGTTTG 
TCTGACATAC CTCCAGACCT CGACCAGTTT ACCTGGGACG AGATTGATTT TGTGCTAGCG
ACTTTGGACC GTGTCATCGG CCCAATTGCA CGTCTTCCTG TGCCCAGCAA GCATCTTCCC
GTGTACCTCC AGTTCTGTGA TGGCTGGAAC GAGATCTGTG AACAGTCCCG GGAACTCTAC
TTGGTTCATG ACATGCATAC CGAGAATGAA GAACTCAAGC TGCTTAACAC AGGTGACTTT
GATGAAAGCG AAAAGAGAAT ATACGGCACT ATTAACAAGA ACGTGACTTT ATGGTTGAGA
TTTCATTTGA ATTTGGATGA ATCCTACAAG AACGTCTACG AGTGTATTAA GTTCGCTACC
CAGAAAGCAG AAATCGAAGA TAGAAGAATC AAAAATAACA TGAGTCTAGA AGATGATCGC
AACGGTTTTA GGCTCGAAGA GTTGGCACAG TTCTTGCAGG TCTCAGTGAA GATGGAAGCT
TGGATATACC TGCTTCCAAA GATTACTTAT GAAGACGAGT ACACCACTAA CAACTATGAT
ATCGATTACT CTCTAAGCTA CATGGCATGG AAACAATTGT GTGAACAATG TCCGATTCTC
GACAAGTTAT GTTTAATAGA TTGCCACAAT GTAGAAATCA GAACAAAGAC CCGAGAAAGT
GTAACGGAAA GAGAGTTTTA CATGATGAAG AGAATTTCAG AAAAAGTCAT TCATGTGTTG
AAGATGCGGT TGGGCCTTGA CAAATTTTGC GACACCATTT ATGAGTGCAT TAAGTTCATA
AAAGACAGAG TAGACGGATC TTGTAAGTAC CAGCAGATTG AACTTCCAGA GGAGCTTTCC
GACCAGATAT CAGCTCAACA ACGTCATTAC ATTAGGCATA TGGTTATCTA CACCAATGAA
TTCTTAGACA AGCTTCCAGA TATTACTCCA GAGAGCCAGG AGTATGTACC ATTGCTTCAG
GGCTGGACAT ACCTCTGTGA AAGGAACCCT ACCTTGTACA CCATGGTTGG CATGAATAGT
GAAGACGACA GATTGAAAAA AATAGCCAAC GGCATCTTGT CTGGTAAAGA CCTTGAGATA
CTCAAGACTA TAGACGCCAT CTTGGCCAGA GACTTCCAGA AAAAGCTTGG TTTAGATTTG
AAGTTTAAAA ATCTATTTGA ATGTCAGCAG TACTTGGAAG AGAAATGTCT GGAAAAACTC
ATAGAACAGA AGAAGGAACT TAATACAGGG ACTCAGAAAC AGATTAAAGA AGCACGACCT
ATGATTAAAG AAATATTTGA AAGCAAAGAC GTATCAACAT TATAA
 
Protein sequence
MRREKATWEA LLKHPLYTCL SDIPPDLDQF TWDEIDFVLA TLDRVIGPIA RLPVPSKHLP 
VYLQFCDGWN EICEQSRELY LVHDMHTENE ELKSLNTGDF DESEKRIYGT INKNVTLWLR
FHLNLDESYK NVYECIKFAT QKAEIEDRRI KNNMSLEDDR NGFRLEELAQ FLQVSVKMEA
WIYSLPKITY EDEYTTNNYD IDYSLSYMAW KQLCEQCPIL DKLCLIDCHN VEIRTKTRES
VTEREFYMMK RISEKVIHVL KMRLGLDKFC DTIYECIKFI KDRVDGSCKY QQIELPEELS
DQISAQQRHY IRHMVIYTNE FLDKLPDITP ESQEYVPLLQ GWTYLCERNP TLYTMVGMNS
EDDRLKKIAN GILSGKDLEI LKTIDAILAR DFQKKLGLDL KFKNLFECQQ YLEEKCSEKL
IEQKKELNTG TQKQIKEARP MIKEIFESKD VSTL