Gene PICST_80114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80114 
Symbol 
ID4851434 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1805749 
End bp1806836 
Gene Length1088 bp 
Protein Length297 aa 
Translation table 
GC content42% 
IMG OID640393142 
Productpredicted protein 
Protein accessionXP_001387982 
Protein GI126274562 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.301364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGTGT GGGCGTTGTT TCGAGTCATC CTCTTGATCC GGGCGGTGCT TGCAGTCGCC 
AGCACTGGAT CAAAAGGCCT AAATCAGCAT ATTCGTGAAT TCGGCGATTC AGCCATTGAA
GATAATCTAA AGCGAGCCAG ATATTCTCTA ATTTACTTCT ATAGAGATGC GTGTCAGTAC
TGTGATAAGT TCAATCCGGA CTTTGAGAAC TTGAGCGTAC TATTCAATAA CGCTAGTGAC
TCTGGGGAAG GCGAGAACAG TATTATTCAG GTTATCAAGA CGAATGGGAA GGTCAACCCC
AGATTGAACC AGCTTTTCAA GGTTCAACTG TATCCGACCT TGAAGCTTTT GGACTTCAAG
ACTATGGAAA TATTCACATA CACGAAAAGA AAAAGAGATA TTCTCTCATT ACTTGAGTTT
GTCAAGGAAA AAGTGCCAGA CGCAAAGCCC AACTATAAGA ACTTTGTCTC CAAAGTCAAA
TACTTGGATA ATGCTAGCTT TGATGACCAT GTCAAACAGC TGAAGAAGGA TACGTTGGTG
GTTTTCACTA TGCCATATAT GGACGACTGG ATCAACTACC AATATCCTGC TCATTTCTAT
CAGCAATTGG CCGATAGAAT GTCTAGCGAT GAACGTAACA TTCAATTCTC TCTTGTAGAT
GCTGGATCCC AAGCAGCCAG TGATGTAATA GCTGGACTAA AGATCAGCAA CTTCCCATCT
ATAGTCTATT TCAAGGGAGA CGGTAGAGTC AAAGCTTATG GAGTTTATGA CCAGAACCAA
GTAATGCATG GGATATTGAG TGAAAAAACC TTGGACAGCT TCATAGACAA TATAGATTCT
GAAGAACATG GAAAATGGTT TGAGTCTGTT GAGAAGATGG TAGAGTCCAG GGAAGAGTCT
ACGGAGTACG ACGGAAACTT GCACTACAAG CCAGGATTCA ACGTGAGACA GGATAATCGA
AATGGAGAGG ATGAAGAGGA GCAGTATAGA CAGCTCTTGA GAGAGGTAGA GTTGTAATGC
TACAATGTAT ATAGATATCT ATACCTGGTA TATAGCAGAA GAACTTCAGG GCATATTTGA
CATTTTCC
 
Protein sequence
MQVWALFRVI LLIRAVLAVA STGSKGLNQH IREFGDSAIE DNLKRARYSL IYFYRDACQY 
CDKFNPDFEN LSVLFNNASD SGEGENSIIQ VIKTNGKVNP RLNQLFKVQL YPTLKLLDFK
TMEIFTYTKR KRDILSLLEF VKEKVPDAKP NYKNFVSKVK YLDNASFDDH VKQLKKDTLV
VFTMPYMDDW INYQYPAHFY QQLADRMSSD ERNIQFSLVD AGSQAASDVI AGLKISNFPS
IVYFKGDGRV KAYGVYDQNQ VMHGILSEKT LDSFIDNIDS EEHGKWFEMK RSSIDSS