Gene PICST_82902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82902 
Symbol 
ID4837739 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp632265 
End bp633749 
Gene Length1485 bp 
Protein Length481 aa 
Translation table12 
GC content43% 
IMG OID640389054 
Productpredicted protein 
Protein accessionXP_001383755 
Protein GI150864783 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.453642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.3745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAG CTGCTGAGGA GCCAGCGCCC TCTGCCTCGA CAAATGAAAC CATTCTCAAA 
GAAATCGACC AATCCTTTCT GTTGTTGTCC AAAGCTGGCT CATCCTACGA CAACAGATAC
ATATCCAAAG TGTTCCGTGA CTTGGGACCG TTGAGACGGA AGATTCGTGC CGCAGACGAC
GTGTTGCCTT CTGTTGTCGC CAAATTGTAC CCATCGGCTC ATACCAATAA GGTGTATTTG
CTTGAAGCAC TTGGAGCAGA AGAAAATGTC AAGTCTGAAA AGGCCGACGA AATGGAGGTC
GATTCGACTT CTTCCAACTC AGTCAATTCT ACCAATGCTG AGACTCCAGA AGCGGAACTC
TATGTGCATT TGCTAGTACA AGTGTATCTC TTGGATACCA ACCAATTGGC CAAAGCCGAT
ACGCTCAATG CTAATATCAT CCAGTTGATG AAGCTCTACA ATAGAAGATC GCTTGACTTT
ATCCAGGCCA AGATATGGTT CTTTATTGCC AGAACCAAGG AGCTTCTTGG TGACTTGGTT
ACGATACGTC CCGAGCTCTT ATCGTCTTTG AGAACTGCCA CTTTGAGACA CGACACCGAA
ACAACCGCAT CCATAATAAC CCTCTTGTTA CGGAACTATT TGTTGTCTCA CGACATTTCT
CAAGCTGCTA ATTTAGTAGA AAAGACAGAG TTTCCCGAAA ATGCTGGAAA TGCTCTTGTA
GCTAGATACT ACTACTATTT AGCTCGAATT AACTCCATCC AATTGGACTA TTCCACAGCC
CACGAATGTG TCATTGCTGC CATCAGAAAG GCGCCACAGA CCCATTTGGC CAACGGATTC
ATCCAGAGCG CAAGCAAGTT GCGGATCTTG ATCGAATTGT TGATGGGAGA CATTCCTGAG
TTGAAAGTTT TCCATAAGCA AGCAGGATCT TTTGAGCCAT ATTTCTATGT GACCCAGGCC
GTGAAATTGG GTGATTTGAA ATTGTTTGGT CAAGTTTTGA ACAAGTACGA AACTGTATTC
AAACGCGATG ATAATTTTAC ATTGGTCTCG AGATTACGTC AGAACGTCAT AAAAACAGGC
ATCAGAATTA TCTCATTGTC CTACTCCAAG ATTCTGTTAA AGGATATCTG TATCAAGTTG
CACTTGGACT CGGAAGAACT GACAGAATAC ATCGTATCCA AAGCGATCAG AGACGGAGTT
ATAGAAGCCA CTATCAACCA CCAGAAAGGG TTTATGCAAT CCAAGGAGTT ATTGGACGTC
TACTCGACTA AGTTGCCTCA GAACGAGTTT GACCAGAGAA TCAAGTTCTG CTTATCTTTA
CACAATGATA GCGTCAAGTC GATGAGATAT CCTAATGATA GTGATGATAA GGCAGATGCG
GTTAAGAATG AGTCAAAGGA AGATGAGATG GACATAATGA GAGCCATTGA AGAGGGTGAC
TTGGACGACT TCTTGGACTA GATAGAAAGA ATAAAGAGTA AGAAT
 
Protein sequence
MKEAAEEPAP SASTNETILK EIDQSFSLLS KAGSSYDNRY ISKVFRDLGP LRRKIRAADD 
VLPSVVAKLY PSAHTNKVYL LEALGAEENA DEMEVDSTSS NSVNSTNAET PEAELYVHLL
VQVYLLDTNQ LAKADTLNAN IIQLMKLYNR RSLDFIQAKI WFFIARTKEL LGDLVTIRPE
LLSSLRTATL RHDTETTASI ITLLLRNYLL SHDISQAANL VEKTEFPENA GNALVARYYY
YLARINSIQL DYSTAHECVI AAIRKAPQTH LANGFIQSAS KLRILIELLM GDIPELKVFH
KQAGSFEPYF YVTQAVKLGD LKLFGQVLNK YETVFKRDDN FTLVSRLRQN VIKTGIRIIS
LSYSKISLKD ICIKLHLDSE ESTEYIVSKA IRDGVIEATI NHQKGFMQSK ELLDVYSTKL
PQNEFDQRIK FCLSLHNDSV KSMRYPNDSD DKADAVKNES KEDEMDIMRA IEEGDLDDFL
D