Gene PICST_31744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31744 
Symbol 
ID4838703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1578458 
End bp1580476 
Gene Length2019 bp 
Protein Length672 aa 
Translation table12 
GC content36% 
IMG OID640390018 
Productpredicted protein 
Protein accessionXP_001384257 
Protein GI150865155 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.743293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.568196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTGT TATCAATCGA GGATCTACTT CGAACTCTTC CCGAATACTA TGTAGAAGAG 
ATAGTGAATG GTATTCCTTT TTCCACTGTA TGTGCTCTCG TTTCCAACGG AACATCGCCG
TACCAGAAGT TCTTTCTCAA TAGAGTCTTC CGAGATGTTA TGGTTTTAGA GGCGATGCCT
GACGCTTCTC CAGTTTGTGA TATAGAATCT CTATATTTCG ACTTTTTCTT CAAAGACTTT
GACGATAGAA GACAAAATAC AGTTTCAATT GGTGGGTACG ATACCTTGGT TCAATTTGTT
GCCAGTTACC CAAATATTCG CCTTGGAAAG GTTGAAATTC AAACTGACCA CAACACGGAA
GACATTTTAA ATCTATTGGA AGCAAATAAT ATAACCATAA ATAAATCACC TCGACACGCT
GGAATCGATA TTAGTTTCCC GGAAACCCTA TATGGACTAC ATCATTTTGA AATAATACAA
AACATTCCAG AGAACCTACA GGAGCTTGTA CTCCAACATG TTAACTTTCA AAATATTGAT
ATATTGGAAT ATATTAAATC ATTACCTTCA AGACTAAAAA TTCTAGAAAT TGACTGTGTG
TCCACATCGA TAAATCCAAG GCACCTAGAA TTGTTACCAA AGTCTTTACG AAAGCTTTGT
TTGAGAAGTA CTAGTTTAGA AGGGGATGGA ATAATCAAGA CACACATGCC ACCGTTGTTA
GAATCTATTG AACTACATTT GCAGAATGTT GGTGATGTAT GTCTAGATAT ATCTCATTTG
AAGAAATTAA TAGAAGTCAA ACTCCTGACT TGCCTTTTAT TCAAGTTACC GGAGCAGTTA
CAGAGGCTCT TCTTGGAGAA TTTAGACGGT TTCAAAGATT TAAATCGGCT ATGTGAACTA
AAGAAATTAC GCTATTTATC AGTCACTGCG TTGTCATCCA ATATCCTTGA TGAAATTGTT
CTTCCCAAAT CCTTGCAAGA ATTAGAAGTC CAGAATCCTT TTCAAGAATA CGAATTACCA
ATGAGTACTC TAAATCTAAA ATTTGACAAC CTCCATGAAT CTGTAAGTTT CAAAGACCTT
TCGCATATCA CAATGTCTGC AGCTGACTAT TCTAAATTTG TACTGGGATC ATTGGCCGAA
AAATTGACCT CTTTAACTAT ACTCAATCAG CATAGTTTAC CGAAAAAGTT TTGGACAGAT
ATTGAAAAAT TAAAAGAGTT GAGGAAATTA TCAATAACTA AGTGTGAAGT TGAATCAACT
CCAAAGTACT TGCCACCCAA ATTAATAATT TTAGACCTTT CTCAAAATAG GATTTCCAGT
ATTGCCATTT CTGGAACGTT GAAGAAACTA GTTCTAGACA GAAACGAGTT CACCACTATA
TCCAATGCTA CCCTAGCGTT ACCTTCTACT ATTTGCGAGT TGTCGCTACA GTCCAATCGT
ATATCATCAC TAGAGGAAGG GTATGCATTT CCTAAGTGTC TTCAATTATT TGATTTACTT
GATAATGAAC ATTGTCCAAT TGAAGACATT CTTACAAACT TACCTCCCCG AATAGTGCAA
TTGAGGTTAT CCACTAATAA GAAGAAAAGC TTTCCAAATA AAACAAGCCC CAGTACTGCT
GAATTACAAT CTGCCACTAA AAACAAATAC TATAGAAGAG AAAAGCCTTT ACTTAATGTA
ACAAGTAGTA CTTTATGGAA AGTCTATCTA GGTGGAGTAA GAGATCATTA TTTGGACTCA
GAGTTGGTGT GGACTGGTTG CCCGAATTTG CAGTGCCTTG AAATCAGAAG TATTGATTTG
GGAAGTATTC TGCTTAAGAA TTATCCATCT TCTTTGAAAA AATTAGTGAT GCTCAACACT
AACATAAGTC AAGTAGAAGG GGATTTCCTC ACTTTACCGA GCTTGATTGT TGCCAGCTTA
GTGGATAACC CATTGGAGGA ATGGTTAGAG AAAAACAAAG ACCACATTCC CCTGAATGTG
AAGTTTAGTT ATTTCCAAGA TATATCATCA TATTGGTAA
 
Protein sequence
MNLLSIEDLL RTLPEYYVEE IVNGIPFSTV CALVSNGTSP YQKFFLNRVF RDVMVLEAMP 
DASPVCDIES LYFDFFFKDF DDRRQNTVSI GGYDTLVQFV ASYPNIRLGK VEIQTDHNTE
DILNLLEANN ITINKSPRHA GIDISFPETL YGLHHFEIIQ NIPENLQELV LQHVNFQNID
ILEYIKSLPS RLKILEIDCV STSINPRHLE LLPKSLRKLC LRSTSLEGDG IIKTHMPPLL
ESIELHLQNV GDVCLDISHL KKLIEVKLST CLLFKLPEQL QRLFLENLDG FKDLNRLCEL
KKLRYLSVTA LSSNILDEIV LPKSLQELEV QNPFQEYELP MSTLNLKFDN LHESVSFKDL
SHITMSAADY SKFVSGSLAE KLTSLTILNQ HSLPKKFWTD IEKLKELRKL SITKCEVEST
PKYLPPKLII LDLSQNRISS IAISGTLKKL VLDRNEFTTI SNATLALPST ICELSLQSNR
ISSLEEGYAF PKCLQLFDLL DNEHCPIEDI LTNLPPRIVQ LRLSTNKKKS FPNKTSPSTA
ELQSATKNKY YRREKPLLNV TSSTLWKVYL GGVRDHYLDS ELVWTGCPNL QCLEIRSIDL
GSISLKNYPS SLKKLVMLNT NISQVEGDFL TLPSLIVASL VDNPLEEWLE KNKDHIPSNV
KFSYFQDISS YW