Gene PICST_18070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_18070 
Symbol 
ID4840720 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp656897 
End bp658375 
Gene Length1479 bp 
Protein Length486 aa 
Translation table12 
GC content42% 
IMG OID640392035 
Productpredicted protein 
Protein accessionXP_001386319 
Protein GI150866652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.235503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.132963 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTCA TAGGTGAGAT TGTTGAGCAT GAAATAGAAG CTCCATCTCT GACTATAGAC 
GTTGAAGTTG GCGGATTTCC AGATCCATCT AAGTCAAGAG AGAAGAAAGT CTCTAGATGG
AAGAAGAGAG TACAGAAGAA AGGAACGGAG GATGCTGCCA AACCTCTTCG AGAATCTGCA
AATAAGGTGA AAAATAAAAA TGTAGTAAAT GATCTTACGG AAGCTGAAAA GATTCACCAG
GAGAATATGG ATAAAATTGC GAGTATGACA GAAGAAGAAA TTACTCACGA AAGAGAAGAA
CTCTTACAAG GGTTGGATCC TAAGTTGATC CAGAGTTTGC TTAAAAGAAC TGAGTCTAGA
ATCCAAAAGC AAGCCGATCA TGTTCACGGA GACAGTTCCA ATCATGGTGA CCATGGACAT
GAACATTCAG AAGGGTATAA TGGGTGGATA GGAGGAATGA GAACCAAAGA AGGCATGTCT
GATTTGTCGC AGTTGGATCC AGAGGATGTA GACAAAGCTT TGGGAATCAA ACAGCTAAAC
ATAAAAGATG ATTTTGGTGA CACTGTGGTA GAAAAAGATA TAGAAAAAGA CAATAGCCTG
AAAGCAAGCG AAAAGCCTAA AAAGTCAGTG ACATTCAACG AAGTAGCTAC GGTAAGTTAC
GAGGACTTGG ATGGTGGCAT AGAGCTTGAT CCCAATGGCT GGGAAGATGT AGAGGACCTT
CATGAGATGA TCCCCAATAT CCCGCACACA AACAATGAGA ACGAGATAGC TCCAGAGGAA
TACCAATTAC TCACAGAAGA AGAGGAAAAC AGGATGGATG TTCATTTTCC AAAACCTAAA
CCTGTAGATG ATTTGGATTT GAACGACCCA GATTTCTATG ACAAGCTTCA CGAGAAGTAT
TATCCGGATC TCCCCAAAGA AACGGATAAA TTGTCGTGGA TGACGAAGCC TTTGCCCAAA
CAAACATCTA CCACCTACGA GTCTGTGGCA GATATGAGAT TTAACTTTCA TGGTGACTTG
ATTGAACTAC ACGACGAAAC GAGCCCACAA GCGAATCCCA AAAACGAGAT TCCAACGTAT
ATGGGGTTGC ACCACCATGC AGAAAATCCT CATTTGGCAG GCTATACCTT GGCAGAACTT
GCGCATTTGG CTAGATCTGT AGTTCCGAGC CAGAGATGTG TTAGCATCCA AACTCTCGGC
AGAATTCTTC ACAAGTTAGG CTTGCACAAG TACGCTGGTG TATCCGTAGA AGAAAACGAC
GAAGAGGATA AACATTTTAA TGAAAACGTC AGAGAGATGG TAGCAAGTTT CGAGAAGATG
TTATGGGACT TAATCGATCA ACTTCGTGTT ATAGAAACTA TTACAGACGC TGCAGACGAA
GCCAAGACGA AAAATTTATC TGTGAGGAAC TATGCTATCG AAGCATTATG GTTGTGGAAG
AAGGGCGGAG GTAGACCTGA AGAGTATGAA CAAACTGAA
 
Protein sequence
MDFIGEIVEH EIEAPSSTID VEVGGFPDPS KSREKKVSRW KKRVQKKGTE DAAKPLRESA 
NKVKNKNVVN DLTEAEKIHQ ENMDKIASMT EEEITHEREE LLQGLDPKLI QSLLKRTESR
IQKQADHVHG DSSNHGDHGH EHSEGYNGWI GGMRTKEGMS DLSQLDPEDV DKALGIKQLN
IKDDFDIEKD NSSKASEKPK KSVTFNEVAT VSYEDLDGGI ELDPNGWEDV EDLHEMIPNI
PHTNNENEIA PEEYQLLTEE EENRMDVHFP KPKPVDDLDL NDPDFYDKLH EKYYPDLPKE
TDKLSWMTKP LPKQTSTTYE SVADMRFNFH GDLIELHDET SPQANPKNEI PTYMGLHHHA
ENPHLAGYTL AELAHLARSV VPSQRCVSIQ TLGRILHKLG LHKYAGVSVE ENDEEDKHFN
ENVREMVASF EKMLWDLIDQ LRVIETITDA ADEAKTKNLS VRNYAIEALW LWKKGGGRPE
EYEQTE