Gene PICST_55211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_55211 
Symbol 
ID4836966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1734205 
End bp1735200 
Gene Length996 bp 
Protein Length331 aa 
Translation table12 
GC content42% 
IMG OID640388281 
Productpredicted protein 
Protein accessionXP_001383099 
Protein GI126133148 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.110005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCAG CTGCTACACC ACAAACTCAG GAAACTCAGG ACAATGATGC CTTTTTAAAG 
AGCATGCCCT TAGAGAATGT CGCACCCTTA TGGCATATCT TGAAGGACTT ATCTCCTCCA
AAGCCAAAGC CAACATCTGT GCCTCACCTC TGGAACTACA AAAAATTGAA ACCAATCTTA
GATGAATCTG GGCGTTTAGT ACCAACTGAA CTTGCCGAAA GAAGAGTGCT TATGTTGGTT
AACCCAAAGT TGACAGGACC ACGTACAACT GAAACTTTAT ACGCTGGTCT TCAATACATT
AAACCTGGTG AAGTTGCTCC AGCACACAGA CATGTTGCCT TTGCTTTTAG ATTCATTCTT
GAGGGACAAG GTGGATTTAC TAATGTTGAA GGTACAAGAA TGACTATGAA GAGAGGTGAT
GTGCTTTTGA CCCCAAGAAA CTGCTGGCAC GATCACGGTA AGGACGGAAG CGGTCCAATG
ATCTGGTTGG ATGGTTTGGA CTTGCCAATT TTCCAAACTA TTCCAGTCAA CTACACTAAC
CACTACGAAG AAGATAGATT CCCAGCAGTT GACAATGATA ATACACCAAT GAAGTTCCCA
TGGCAACCAG TTCAAGATAA GCTTGACAGT ATTAAAGGTG ACTATGCTAT TTTCGAATAC
CGTGACCAGG AAAACCCTGA AAAATTTGTA TCTTCCATTC TCGGTGCTGA AGCTTTGAGA
ATTTCTCCAA ATGCTTCTAC TCCTGTACGT CAGGAAAACA GTTCCTTCGT TTTCTGTGTC
TACGAAGGAA AGGGTCACAC TATCGTCTAC GGTGATGATG GCGAAGAAAA TGTTTTAAAC
TGGGAAAACA GTGATGTGTT CTGTATTCCA TGTAACATGC CATTCAAGCA CTTTAACGAC
AGCAGCAATG AACAAGCCTA CCTCTTTAAC TTCTCGGACA CCCCATTGCT CAAGAACTTA
AATATCCATA GTTCCGATTT GGAAGTTAAG AACTAG
 
Protein sequence
MAPAATPQTQ ETQDNDAFLK SMPLENVAPL WHILKDLSPP KPKPTSVPHL WNYKKLKPIL 
DESGRLVPTE LAERRVLMLV NPKLTGPRTT ETLYAGLQYI KPGEVAPAHR HVAFAFRFIL
EGQGGFTNVE GTRMTMKRGD VLLTPRNCWH DHGKDGSGPM IWLDGLDLPI FQTIPVNYTN
HYEEDRFPAV DNDNTPMKFP WQPVQDKLDS IKGDYAIFEY RDQENPEKFV SSILGAEALR
ISPNASTPVR QENSSFVFCV YEGKGHTIVY GDDGEENVLN WENSDVFCIP CNMPFKHFND
SSNEQAYLFN FSDTPLLKNL NIHSSDLEVK N