Gene PICST_55876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_55876 
Symbol 
ID4837555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp738695 
End bp739885 
Gene Length1191 bp 
Protein Length396 aa 
Translation table12 
GC content44% 
IMG OID640388870 
Productpredicted protein 
Protein accessionXP_001382908 
Protein GI150864184 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0416989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.862936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAA CCTTATCGCT CCAGAGTAGG GCAAAAACGA CTGCTCTCAA GCAGCCTAAA 
GAGATTTTTG CTTTTGCTCG AGACATAGAC GGCGAATTCG TGTACGACCA GAAAATAGTC
AAAGACGAAA ACGTATCGTA CTACTACTTG CCAGACTCCA AGATTGATGG AAGCATCGAC
TTGCAAGCTG GGTACGCCAA ATTCAAAAAA ATCCCAGAAG AGAAGAACAT GCTGGATATG
AAGTGTTTGC TTACGGCACT CACGAAGTAT GAGCAAGAAC ACAACAACGG CGAAAAAGTA
AATGTAGATA TCATCACATA CCGAGGGTTA ATGACTAAAT TGCTTGCTTT ACCATACAAC
TTGAACGACC CTGTAGATCT CAATGTACTA GCCTATGATG GACAATTGTT TATCAACAGC
GACGAGGAGA TCGAATTGGC AAGAAGAAAA GAAGAAGACG AGCACAAACA ACAGAGTATG
ACTCCAGAAA AGTATGATCA CATGAAGCGG TGTGAATTTA GCGGATACAA GTTTGAAGCC
ATAGCCACAT TGCCCAAGCC CTGGGCCGAC TGTAGTCGTC AACAAATCGA TAAAAGAGGC
AAGAAAATGG TGAACAACTA CGAACAGTAT ATTTCAGTAA TAAAGACTGG CATTGGTGAG
GCCAAGATGC TTTTGGCAGG AGAAGTGGAC TGTGTGTGGG ACTATATTCC AGAAGACGGA
AAAGATGTTC TTTCACATTA TATGGAGTTG AAGACAACTA GAATATTGGA GTCGAACGGC
CAGGTGGTCA ACTTTGAAAA GAAGTTGTTC AAGACGTGGG CCCAGTGTTT CTTGATGGGT
ATCCGTAAAG TGGTGTACGG ATTCCGTGAC GATTCGTTCT TCTTGCGCGA CGTCGAGTTG
TACAAGACGG AGGAGATCCC GTTGCTAATC AAGAACAATG CGCTTACTGA GAACAAATCC
GGGGGAAAGA TCAACTGTAC CACTGCCTTG AAATGGTATG GAGCAGTCAT TGAATGGCTC
TTGCAGGAGA TTCCAAGAGA CGATACTTCC AAGGCCTATC GAGTGAGTTT TGATCCAAGC
ACAAGAACTT TCACGTTAAG AGAGTTGATG GGTAATGAGA ATAGTAGGTT GAGAAACGGC
GAGATGTTGA CCTCGGAATT CAAGCAATGG AGAGAAAGCA TCCAAAAGTG A
 
Protein sequence
MMKTLSLQSR AKTTALKQPK EIFAFARDID GEFVYDQKIV KDENVSYYYL PDSKIDGSID 
LQAGYAKFKK IPEEKNMSDM KCLLTALTKY EQEHNNGEKV NVDIITYRGL MTKLLALPYN
LNDPVDLNVL AYDGQLFINS DEEIELARRK EEDEHKQQSM TPEKYDHMKR CEFSGYKFEA
IATLPKPWAD CSRQQIDKRG KKMVNNYEQY ISVIKTGIGE AKMLLAGEVD CVWDYIPEDG
KDVLSHYMEL KTTRILESNG QVVNFEKKLF KTWAQCFLMG IRKVVYGFRD DSFFLRDVEL
YKTEEIPLLI KNNALTENKS GGKINCTTAL KWYGAVIEWL LQEIPRDDTS KAYRVSFDPS
TRTFTLRELM GNENSRLRNG EMLTSEFKQW RESIQK