Gene PICST_31732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31732 
Symbol 
ID4838827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1542725 
End bp1545934 
Gene Length3210 bp 
Protein Length1069 aa 
Translation table12 
GC content36% 
IMG OID640390142 
Productpredicted protein 
Protein accessionXP_001384601 
Protein GI150865400 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACA TTCTACCTAT CTCTTGCATC TCCTGCAGAA GGAAGAAGAT AAAGTGCAAC 
AAGTTGAAAC CTTGCAACCA ATGCATTAAG AGATCCATAC CGTGCGAATT TCCTCCTACA
TTTAGAAATA TCAAAATCAA TGAAGAAGAA TTAGATTTCG CAGCCACTTC GAGCAACATG
AGTAGGGCCC TAGAAAATTT TCCTCAACAG GGATTTGCAG CTCCACAAAC TGGAACGATC
GAACATAATC TGGATGCTAC CGAAGCTAGT TTTGTCAGCC TAAGAAGTGA ATTGGACTTG
TTGAAAAACG AAAATTTGCA ATTATTGCAG GAGAATTTAC GGTTGAACCA ACAGATTTCC
CGAACAGGCA TCCCTAATAA TCCAAGTATG TTAAAATATC AACAGCCCGT GCCTATCGTT
CAAGAGGGAA GGTCAGAAGA CTTGCATGCT GCTCCGATCT CGATCCCACC AGATCAATCT
GAGACAGATG AAAAATTCTT CTTTCCGCAA TCTGACATTT ACGGGATTGA GGCTCAATTT
GCAAAACAAC GAAGAGCATC CCAAGCAAAA GAAAATAACA CAGATGGAAT TTCAAAGCTA
CAAAATGTTT CTTCAAGCAC TGGAAGTGAT ATTACTTCTC ACCAAGCTTT GAAAAAGGCA
AGGATGATGT TTGCAAATAC AGACGTATTA GATAATTCTT CTACTTCTTC CGAATCGAAA
ATTTCAGCTC AATGGGAAAA ATTAAATGAC GACTCAAGTA AAGAATCAAA GGGAAACAAT
TTAATCCAAC AAAGTTTGCT CAAGAAAAAA TTACCAACTT TAATTCTTGC TTTACTAAAA
TATGATGATC CAGATTTTAG CTCTGATTCG GACACATTTC AAGATTTGCA GAGATGGAAT
TACAATGTTA TAATTAGGCT AGTTGAACTA TTCTTCGAAA AGAATAATTA TTACGGCACA
TTTATTTCCC AACTGAAAGT TTTTGAATTT TTAAAGGCAT ACCCAAATTT AAAGGATAAG
GAATGGGAGT ACGATGATGA TTTGGTTTTG CTTTATTTGA TTTTGATACT TTCTGTTCAA
AAATTGACAC CCAAAGAATT CTTGGATCTT GAATTACTTC CAGCATCTTC TTCTAAAAAC
ATCCGAAAAT TCAGAAACTA CTTATCTAAG AACATATTGT ACAACAGTTT TGAAAAGCTT
CGGCATAATT TGATTAATGA GTCTCTGCTC ACTATTCAGG CTTACATATT ATGTACCGAA
TGGTTATTTC TTGAACATAA GTTTGAAGAA TGTTGGTCGA TGATGTTTCA TACTTGTTCA
ATTGCTTATT CTATCGGATT GCATGTAATT GGACAAGTGA AGAGAACTCC TTCTTTGAGT
ATTCCTGAGA GAATGTCCGC TGATAAAATA AATGAATCAG ATGAAACAGA CAATGAAGAA
GACGATGACA GAAGAAAAAT TTGGTTTGCT TTGAAGAATC TAACAGCTCA AATATGTTCT
GTCCTAGGAA GGCCAAATCC AATATCAATT CAGGTGAATG GTCAAATTTC TGAACTTGGA
AATCAAAAAC TTTCGGATAA AATAAACTAC ACAACATTAA AAATTGGATT AAGTGAGTGT
TTAAGGCTAT CAAATTTGAT GTTGATTGAA AACTTCATGA TTGACTTCAC TATTGAGGAA
GTTCTACCTT TAGAAGAAAA ATTTAGGGAA GAGATAGCAA AATTGGAACT GGTGCTTAAT
GATGATATAT TGAGGACAAA TAAAACACAA GGAGAGTCAG AGTGGAATAA AGTAGACCAA
ACCAATCTTT TAATGGATTT GATAACTCTC TACATAAACA GAGCCAAGTT ACTTGAACCA
TTTCTTAAGA AATTTGAAGA CCAAGAAAAG CACAACATTA TTATTGAAAG ATTTGTCAAT
TCCATACTTC AAAGTTTTGA ATTGTTAAAT TATTTTGTCG AGATTTTTTT GAACCAGTTT
TTTGAAAAGA ATACAATCCC TCCAAGGAAG AGATCAAGTG GTAATATACT CGCTCCAAAA
GATGAGAAGA TAGAGAATCA AAGTTCCCAA CAAGATGTGA ATTCTCTAAT AAAATTTGAA
AGGGTATTCC GTGTGTACTT TCCGTTTCTT ACGTCCTTCA TCTACCAAGG AATCATAGTG
ATTTTCACAT TTCTTCATTG CAAATTTAAA TTGTTTGTTA ATAATAATCC ATCTTCTTTA
TTAAACAACG AATTGCTCAA GCATATTGAG ATTAATTTGA ATACTTTGAC GAACTTTGAT
AGCAGGATCT CAACCAGATT GAACTGTATT TCTAAACTTT GGTCTGCCAA CATCAGATAT
TTGATTGATC GAGTTCTAAT TTACATCAAA ATGATTTACG AAAGACAAGA GGACAAGTTT
CTGCAGTTAT CAGAAAAGAA GAGAAGAAGG GTGCTGCAAA ATAATCTACT TTCAGGGGAC
CAAGATACAA ATCTTAAAGT ATTTGATTTC AATACTGGGA GACAAAGTGA GCAACCATTA
GAAAATACGG AGAATCCTTT GGAAAGTCCA GAGCTTGAAT ATTTGTATGG ATTCCAATTC
AACGATCCTT TTTGGTTAAC AAATCCAGAG AATTTGCCCT ATTACCTTAG TTCTCCAAGT
GATGACGATA AATACAACAA CAAGCTTACT CCTACAAAGA GATCACAACC ATCCACAGAC
TCAGCTATAT CTTATGGCGT TGGGTCTATG ACAGTTGAAC CATCGTTAGC CAAGCCAATC
TCCAGTCAGA TGCCGGTCCC AATACATGGA GATGGTATGT ATAGCGCACC TAATCTTTCT
ACTCAGCAGC AAAACTATGG AAACCTCTGG CCGAATAGTA GTGCCCCTCT GATTTCACAG
AATGTCATTC AAATTCTGCA ATCACAAGGA CAATCTTTCA CAGTTGGTTT CAACCAACAA
CTCTCGTTGT TGTTCAATCA GCCTACATTT GGTAGTCATA ATTCGGTATA CAGTGGCTCT
ACAGCTGGGC ATATTCCTTC CCAACAAGTC TCAGGTCAAA ACCCCCAAGT ACAACTTGCT
CATCAACAAG TTATCCACCC ACCAATTGCC GATCCCCAGA TTCAGGAATA CAGAATTCAG
GATCAGCACC AAGAACCAGT CCAGTCCAGA AATCCATTCC GGAACACTCT GCGAAACATG
GGCTCCTCTG GATCCGACGA TTCTAGTTGA
 
Protein sequence
MTDILPISCI SCRRKKIKCN KLKPCNQCIK RSIPCEFPPT FRNIKINEEE LDFAATSSNM 
SRALENFPQQ GFAAPQTGTI EHNSDATEAS FVSLRSELDL LKNENLQLLQ ENLRLNQQIS
RTGIPNNPSM LKYQQPVPIV QEGRSEDLHA APISIPPDQS ETDEKFFFPQ SDIYGIEAQF
AKQRRASQAK ENNTDGISKL QNVSSSTGSD ITSHQALKKA RMMFANTDVL DNSSTSSESK
ISAQWEKLND DSSKESKGNN LIQQSLLKKK LPTLILALLK YDDPDFSSDS DTFQDLQRWN
YNVIIRLVEL FFEKNNYYGT FISQSKVFEF LKAYPNLKDK EWEYDDDLVL LYLILILSVQ
KLTPKEFLDL ELLPASSSKN IRKFRNYLSK NILYNSFEKL RHNLINESSL TIQAYILCTE
WLFLEHKFEE CWSMMFHTCS IAYSIGLHVI GQVKRTPSLS IPERMSADKI NESDETDNEE
DDDRRKIWFA LKNLTAQICS VLGRPNPISI QVNGQISELG NQKLSDKINY TTLKIGLSEC
LRLSNLMLIE NFMIDFTIEE VLPLEEKFRE EIAKLESVLN DDILRTNKTQ GESEWNKVDQ
TNLLMDLITL YINRAKLLEP FLKKFEDQEK HNIIIERFVN SILQSFELLN YFVEIFLNQF
FEKNTIPPRK RSSGNILAPK DEKIENQSSQ QDVNSLIKFE RVFRVYFPFL TSFIYQGIIV
IFTFLHCKFK LFVNNNPSSL LNNELLKHIE INLNTLTNFD SRISTRLNCI SKLWSANIRY
LIDRVLIYIK MIYERQEDKF SQLSEKKRRR VSQNNLLSGD QDTNLKVFDF NTGRQSEQPL
ENTENPLESP ELEYLYGFQF NDPFWLTNPE NLPYYLSSPS DDDKYNNKLT PTKRSQPSTD
SAISYGVGSM TVEPSLAKPI SSQMPVPIHG DGMYSAPNLS TQQQNYGNLW PNSSAPSISQ
NVIQISQSQG QSFTVGFNQQ LSLLFNQPTF GSHNSVYSGS TAGHIPSQQV SGQNPQVQLA
HQQVIHPPIA DPQIQEYRIQ DQHQEPVQSR NPFRNTSRNM GSSGSDDSS