Gene PICST_32802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32802 
Symbol 
ID4840357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp666771 
End bp668906 
Gene Length2136 bp 
Protein Length711 aa 
Translation table12 
GC content41% 
IMG OID640391672 
Productpredicted protein 
Protein accessionXP_001385479 
Protein GI126137912 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.955784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.581511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGA CGAAGTCCAG AGGTAGAAGA GTCGAAAAGA GTGCTGATAA GCTTGAAAAG 
AAGGCTCAGT CTCTTTTCGA AGATACCAAG TCCACTGGCG AAGAAGGCGG AGATGACAGT
AACGAAATCG ATGCCTCATT GAAAAGTCCA TTTTTCGGTT TAGTAGATTC CAACGAATTG
GACTACTTCA AACAAGCAGA GTCAACATTG AACGTAAATG CGTTTGATAG TGACGAAGAT
CGTGAAGGGT TCATTAGATC TGTTTTGGAA GAAGCCAGAG GAAAGGAATT GAAATTGGTC
ACCAACCAGA TCTGTTCCAA GTTGATGGAA AGATTGATTT TGTTTGCTAG CGATAGACAA
TTGAAAAACA TATTTGGCCA GTTTTCGGGA CATTTTGTAG CATTGGCCCA TCATAAATAC
TCTTCCCATG TATTGGAAAC TTTGTTGGTG AGATCTGCTG CCTTGATCGA AAAGGAGTTG
ATCCATGATG ACAGCAGTCA AAATGAAGAG GAACGGGAAG AACAGGAAGA AGGAGAAGTG
ACAGATCCTA TGGAAGGTTT GTTCATCAAG ATGGTTGACG AATTCAAGCC TCATTTGCAA
GGAATGTTGG AACACCAATA CTCATCGCAT GTTCTCCGTT TGCTTATCTT GATTTTGGCA
GGTAAGGAAT TACCTTCTAC AACTACTTCC AACTCTACCT TGAGATCGAA AAAGTCCAAG
ATCGCCAGAA AAATGATTGA AATAAAAGAT AACCAAGACT TCAACAAGTC ATTCCAGACA
CCATCCTCGT TCAAGATTCA ACTAAGAGAA CTCTGTAATT CCGTAAGCAA CAACCAAAAT
AGCAAACGTA TGAGAGAACT TGCTATACAC AAGATCGCAT CTCCAGTTTT GCAATTACTT
ATTCAAGTTG AAGGCTTGGT TGATAGAGAT AGAACCTTCT GGCACTTGAT ATTCTTAAAG
GATTCGGAAG ACAAGAACTC TCAAGAAGAA GCCTTCGTGG AATACTTGTT GTCTGACTCT
GTTGGTTCTC ATTTCTTGGA AGCAACTATC AAGAATGACG GTGCCAGAAT CAAATACATT
GAAAGATTAT ACAAGTTATA CATGGAGGAT AGAATCTTAA AGTTAGCAAA GAGATCGACT
ACCGGTGTTT ATATCATCCA AGCCTTGTTG TTCAAGTTGA AACCAGTGGA CGTTGAACAC
ATTCTTGATG AAATCATTCC CGAGTTGTCC AATTTGATTT CCATTTCCGA GAACCAAAAC
TTAGACTTAG GTCAGAGATT AATAGATGCG TCCATCTCCA GAGGTAACTA CAGAAGAGAT
GAGATCATCG AGCAATTGTT CTTGAAGTTT GCTCCTAACT ACAATGTCCA AGATCCACAA
CTCAAAACCA CCTCCGAGTT CATCGAAAAC GTCTTGCAAT TGACAGGCTC AACTTTGGGG
AATACTCGTG ACGACTGGCC AACGGCAGAA GAAAGAAGAA GATCATTTTT CTTGGAAAAG
TTGATGGAAT ACGACTACAA GTTCGTGATA TGTGTGTGGT ATAACTTCTT GGCTTTGCCA
GTAGAAAGAT TCATCCAGAT GTGTTTCCAC GGCGTTTTTT CTCATATTGT AGAACGTGCT
TTAGTGGTTA TACCATCTTC TGAAGGTGAA CCAAAGCCCG TTTTGATTCT CAGAAAGAGG
GTTTTGAATC TTTTTAAAGA TCAAATTGTC AACATGTCGT GCAACTCTTA CGGATCCCAC
ATCGTTGATG CATTGTGGAA CTTTTCTGTG TTGTTACCTA TGTATAAGGA TAGAATTGGC
ACGGAATTGC AGGGAGACTC GCATAAGGTC AAGGAAAGTA CCTACGGTAG ATTGGTGTGG
AAGAACTGGT CCATGGAATT GTTTGTTAGA AAGAAGTACG ACTGGAAGTC GTTGATCAAG
CAACAAGAGC AGGCATACTA TGGTGTGAAT GACGAAAATG GAACCACTTC AAGAGTCAAA
AAACCAATTG AATTGAAGAT GGAAAAATTG GCTGAAGAAA GGAGGTTGCG TGAAGAAGCC
GCAGCTAAGT CTGAAAGTGG CTACAAGAGA CGACACGAAG ATGATAACGA GGATGACTAC
GCTAAAAAAC AGAAGCTTAG AGGTCGTAGA AGATAG
 
Protein sequence
MAKTKSRGRR VEKSADKLEK KAQSLFEDTK STGEEGGDDS NEIDASLKSP FFGLVDSNEL 
DYFKQAESTL NVNAFDSDED REGFIRSVLE EARGKELKLV TNQICSKLME RLILFASDRQ
LKNIFGQFSG HFVALAHHKY SSHVLETLLV RSAALIEKEL IHDDSSQNEE EREEQEEGEV
TDPMEGLFIK MVDEFKPHLQ GMLEHQYSSH VLRLLILILA GKELPSTTTS NSTLRSKKSK
IARKMIEIKD NQDFNKSFQT PSSFKIQLRE LCNSVSNNQN SKRMRELAIH KIASPVLQLL
IQVEGLVDRD RTFWHLIFLK DSEDKNSQEE AFVEYLLSDS VGSHFLEATI KNDGARIKYI
ERLYKLYMED RILKLAKRST TGVYIIQALL FKLKPVDVEH ILDEIIPELS NLISISENQN
LDLGQRLIDA SISRGNYRRD EIIEQLFLKF APNYNVQDPQ LKTTSEFIEN VLQLTGSTLG
NTRDDWPTAE ERRRSFFLEK LMEYDYKFVI CVWYNFLALP VERFIQMCFH GVFSHIVERA
LVVIPSSEGE PKPVLILRKR VLNLFKDQIV NMSCNSYGSH IVDALWNFSV LLPMYKDRIG
TELQGDSHKV KESTYGRLVW KNWSMELFVR KKYDWKSLIK QQEQAYYGVN DENGTTSRVK
KPIELKMEKL AEERRLREEA AAKSESGYKR RHEDDNEDDY AKKQKLRGRR R