Gene PICST_51784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_51784 
Symbol 
ID4851046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp803824 
End bp805956 
Gene Length2133 bp 
Protein Length710 aa 
Translation table 
GC content39% 
IMG OID640392754 
Producthypothetical protein 
Protein accessionXP_001387382 
Protein GI126274032 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.672165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.613282 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAAG TCAAGAGCCA GGCTCCAAGG GCCTCTACCA GAAGAAGGCA CACAAACTCA 
AAGCTTGGAT GTCTAAACTG TAAACGGAAG AAAATTAGAT GTAATGAAAG CTTGCCCATA
TGTGGAAACT GCGCTAAGGG TAAGAAGGAA ACATGCTCGT ATTTGAGCTT GGACCAGCAG
GAAATCGATC GAATTCGCTT AACCCATTCA CTCAGAGATA GTCAGAACAA GTTGCTTAAC
CTGAATTACA GATTGCCTAC CTCTTCCAAT AAAGACTCAC ACAAGTTCAA AAAAGTATCA
CCAGTTACTT CTGCCACAGC ATTGGAGTTC AAGTTTGAGC TTATGAAGTT GCCGTTGAGA
ATCCCTACCT TTGCATACCC TCCACTCCAG TTCAACAATC TATCGATGAA CAACTTTTCC
AACGAGTTCA GGGTCATCAA CGACTTCGAC ACCAGTGACG ATTCAGGTTC TTCTCCTCAA
AGTATCGTGG AGAACAGAAT GTTACTGGGA GGCTTTGTGC AGCCAACGTC TTTCACGAAG
TTGGACTTCA AAAGCAAAGT TGTTTGGGGT TCTAGTCCGT ATAAGAACAA ATTGAGAGCT
CCATTACAGT TAGATATAAA CATATTCACC GGGATGGAAA GCGTCGCCGA CCACCTCCGC
AGCTTCATAA TGTCCATAAA CAGCAATAGC CCTCATACTG ATGTGTTATT TAATACGTTT
GTGTGTTTAG GACGCACGAT TATTCTGACC TATTTCCAGC ACATTAAACA GTATGATTCT
CATTACCAGT GGTTGCAAAA CTATATTCCT TCATTTGAAT TACGTTGTCT TGAATCGCAC
GCATTGGCTC TTGCAAAGTT GAGAAGGGCC ATTAGCCATT TTCATTCCAT TGAGGCAACG
AAGGAAAATG CTGAAGAAAT AGAGTACTAC ACTACAATGC TCGGCTACTG TTACAGCTAC
TTGACAGCTG TTACTTTCAT GTTCAAATTT GAAAGCGATA GATACTTTAA TTCTTCAAGG
GGAGCGTTCA CAGTCTTTCA GATATATTTC AAATTTACAA ATGACCATAA CTTAGAACCC
AGTCCATTAT TCAAATTCCT TCTCAAAAAT ATCCGTATTA ACATATTGAG TGTGAACATC
CCGTCATACA ACCCGCATTT CTTTTACGAA CTTAACGACA ATTTCAAACT GTTGGGATTC
ATTTTCCAGA ATCAGAACAT TAAATTCGAG GATCCTGAAT TATCCAAATC TTATGATCGA
GTGTGTTATC GTTACAAGAC ACTCTTAACC TATTTGGAGG AGGAGATTTT GCCCCATATG
ATTCTGAAAA GAAACGAAAA GTTTGTCTCT ACTTATCCAC CGGAGGTAAT TTTTGGAGGG
TTAAAAAAAT GGCTCAGCAA TTTTCCTTCA GAAACTATAA CTTTCAGACC CATTTGCAAT
TCTGAATACC CTGAAGAGTG TGCATTTATT AACGATTTGA GCACAACGTT ATACCTATAC
TATTACTCTA TCGCCTCGGC ATTAGATTCT GTGTTCCCTG CAAGCAAGTA TTTATTTGGT
ATTTCTTTTC AAGTGCCAGT GAATTGTCAC TTTTTTGATC GCAATATTAC AACAATTCAG
AAAGAGAATA CATACCATCA GAAATTGTTT GATCGTCGGA TTGATAATTT ATTACAGAGG
CATATATATT TTTCGTTGAG ATTATTTTCT TTCTTCAGAA GGAGACATAA TTTCTACCAA
GATCATTTGA CGTGGTCCAA TCCTTTCACA GAACAGTTGA GAGATGACCG TTTTGCATCG
AGATCTGTCA AGGATTCATT TGAGACTCCG ATCAAGAGTT TCAACACGAC ATTAATTAGA
CCAGAGCATT ACCCCACTAA AAGAAGCGAG CTGGAAAACT CCACATTCAC CCGAACTGAC
GACACAATGG TACAGAAAAT GTACACGAGA AATATTGAAA CTCTTGACTT CTTCAATGAA
GGCTCCATAC TACACTTTGA CTACGAAACA CTACTTTTGT TACAGGATTA TAGACCAATT
AGTGAAAATT CCGAATCGAG TGTCCAGATC GATGGGAAAA TCTTGAGAGA GTATTTTGAA
GATAAAACGA TTATTCTCAG CAGTCTACAT TAG
 
Protein sequence
MTKVKSQAPR ASTRRRHTNS KLGCLNCKRK KIRCNESLPI CGNCAKGKKE TCSYLSLDQQ 
EIDRIRLTHS LRDSQNKLLN LNYRLPTSSN KDSHKFKKVS PVTSATALEF KFELMKLPLR
IPTFAYPPLQ FNNLSMNNFS NEFRVINDFD TSDDSGSSPQ SIVENRMLLG GFVQPTSFTK
LDFKSKVVWG SSPYKNKLRA PLQLDINIFT GMESVADHLR SFIMSINSNS PHTDVLFNTF
VCLGRTIILT YFQHIKQYDS HYQWLQNYIP SFELRCLESH ALALAKLRRA ISHFHSIEAT
KENAEEIEYY TTMLGYCYSY LTAVTFMFKF ESDRYFNSSR GAFTVFQIYF KFTNDHNLEP
SPLFKFLLKN IRINILSVNI PSYNPHFFYE LNDNFKLLGF IFQNQNIKFE DPELSKSYDR
VCYRYKTLLT YLEEEILPHM ILKRNEKFVS TYPPEVIFGG LKKWLSNFPS ETITFRPICN
SEYPEECAFI NDLSTTLYLY YYSIASALDS VFPASKYLFG ISFQVPVNCH FFDRNITTIQ
KENTYHQKLF DRRIDNLLQR HIYFSLRLFS FFRRRHNFYQ DHLTWSNPFT EQLRDDRFAS
RSVKDSFETP IKSFNTTLIR PEHYPTKRSE LENSTFTRTD DTMVQKMYTR NIETLDFFNE
GSILHFDYET LLLLQDYRPI SENSESSVQI DGKILREYFE DKTIILSSLH