Gene PICST_90939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_90939 
SymbolAROC 
ID4840678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp423323 
End bp424518 
Gene Length1196 bp 
Protein Length377 aa 
Translation table12 
GC content46% 
IMG OID640391993 
ProductChorismate synthase 
Protein accessionXP_001386097 
Protein GI126139149 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.352684 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGCATGTCCT CATTTGGTAC CTTATTCCGT GTAACAACCT ATGGGGAGTC GCACTGCAAA 
TCAGTCGGCT GTATAGTGGA TGGAGTCCCA CCCAATTTGG AATTGACAGA AGATGATATC
CAACCCCAAT TAACCAGAAG AAGACCAGGA CAGTCGAAAT TGTCGACTCC AAGAAATGAA
AAGGACCGCG TAGAAATCCA GAGTGGTACC GAAAATGGTT TAACCTTGGG TTCACCTATT
GCCATGATTG TGAAAAATGA GGATCACAGA CCTCACGACT ACTCCGAGAC AGACCTTTAC
CCAAGACCAT CGCATGCTGA TTGGACATAC ATACAGAAAT ACGGTACCAA GTCTTCCAGT
GGAGGAGGTA GATCCTCGGC AAGAGAAACA ATCGGTAGAG TTGCTGCTGG AGCCATTGCT
GAAAAGCTCT TGTCCAAGGC TAATGGTGTT GAAATCGTAG CCTTCGTTTC GTCCATTGGC
CCGGTTTCCA TGGCCAGAGA CGCCTCTGAT CCCAAATTCC ACGAATTGTT AAACACTGTA
ACCAGAGAAC AAATCGATGC TACTGGTCCT ATCAGGTGCC CAGATGAAAC TGTAAGAGAA
GACATGGTCA AGGTCATTGA AAAGTACCGT GACGCACAAG ACTCGATTGG TGGGGTTGTC
ACTTGTGTAG TGAGGAACTG TCCCATCGGA TTGGGAGAGC CATGTTTTGA TAAGTTGGAA
GCTAAGTTGG CACATGCCAT GTTGTCGTTG CCAGCTACCA AGGGGTTCGA ATTTGGCTCG
GGTTTCTTGG GTACACAAAT TCCAGGTTCT AAGCACAATG ATCCATTTTA CTACGACGAA
TTGCACAAAA GATTGAGAAC CACCACCAAC TTCTCTGGTG GTATCCAGGG TGGTATCTCC
AATGGTGAAA ATATTTACTT TTCCGTTGCC TTCAAGTCGG CTGCTACCAT TTCCCAGGAA
CAGCCTACTG CCACCTACGA TGGAAAAGAT GGTGTCTTGG CTGCTAGAGG TAGACATGAC
CCAAGCGTAA CACCAAGAGC TGTTCCTATT GTGGAGTCCA TGACTGCTTT GGTGTTGGCT
GACCAGCTTC TTATTCAAAA GGCTAGAGAA TCTGGTGCTG CCATCGTCGG CAATTAAGTA
CATAAGCATG TAAAATAAGC GTAAATAATC TACATAATAA TGAAAAATGT ACTTTT
 
Protein sequence
MSSFGTLFRV TTYGESHCKS VGCIVDGVPP NLELTEDDIQ PQLTRRRPGQ SKLSTPRNEK 
DRVEIQSGTE NGLTLGSPIA MIVKNEDHRP HDYSETDLYP RPSHADWTYI QKYGTKSSSG
GGRSSARETI GRVAAGAIAE KLLSKANGVE IVAFVSSIGP VSMARDASDP KFHELLNTVT
REQIDATGPI RCPDETVRED MVKVIEKYRD AQDSIGGVVT CVVRNCPIGL GEPCFDKLEA
KLAHAMLSLP ATKGFEFGSG FLGTQIPGSK HNDPFYYDEL HKRLRTTTNF SGGIQGGISN
GENIYFSVAF KSAATISQEQ PTATYDGKDG VLAARGRHDP SVTPRAVPIV ESMTALVLAD
QLLIQKARES GAAIVGN