Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_90939 |
Symbol | AROC |
ID | 4840678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 423323 |
End bp | 424518 |
Gene Length | 1196 bp |
Protein Length | 377 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640391993 |
Product | Chorismate synthase |
Protein accession | XP_001386097 |
Protein GI | 126139149 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.352684 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGCATGTCCT CATTTGGTAC CTTATTCCGT GTAACAACCT ATGGGGAGTC GCACTGCAAA TCAGTCGGCT GTATAGTGGA TGGAGTCCCA CCCAATTTGG AATTGACAGA AGATGATATC CAACCCCAAT TAACCAGAAG AAGACCAGGA CAGTCGAAAT TGTCGACTCC AAGAAATGAA AAGGACCGCG TAGAAATCCA GAGTGGTACC GAAAATGGTT TAACCTTGGG TTCACCTATT GCCATGATTG TGAAAAATGA GGATCACAGA CCTCACGACT ACTCCGAGAC AGACCTTTAC CCAAGACCAT CGCATGCTGA TTGGACATAC ATACAGAAAT ACGGTACCAA GTCTTCCAGT GGAGGAGGTA GATCCTCGGC AAGAGAAACA ATCGGTAGAG TTGCTGCTGG AGCCATTGCT GAAAAGCTCT TGTCCAAGGC TAATGGTGTT GAAATCGTAG CCTTCGTTTC GTCCATTGGC CCGGTTTCCA TGGCCAGAGA CGCCTCTGAT CCCAAATTCC ACGAATTGTT AAACACTGTA ACCAGAGAAC AAATCGATGC TACTGGTCCT ATCAGGTGCC CAGATGAAAC TGTAAGAGAA GACATGGTCA AGGTCATTGA AAAGTACCGT GACGCACAAG ACTCGATTGG TGGGGTTGTC ACTTGTGTAG TGAGGAACTG TCCCATCGGA TTGGGAGAGC CATGTTTTGA TAAGTTGGAA GCTAAGTTGG CACATGCCAT GTTGTCGTTG CCAGCTACCA AGGGGTTCGA ATTTGGCTCG GGTTTCTTGG GTACACAAAT TCCAGGTTCT AAGCACAATG ATCCATTTTA CTACGACGAA TTGCACAAAA GATTGAGAAC CACCACCAAC TTCTCTGGTG GTATCCAGGG TGGTATCTCC AATGGTGAAA ATATTTACTT TTCCGTTGCC TTCAAGTCGG CTGCTACCAT TTCCCAGGAA CAGCCTACTG CCACCTACGA TGGAAAAGAT GGTGTCTTGG CTGCTAGAGG TAGACATGAC CCAAGCGTAA CACCAAGAGC TGTTCCTATT GTGGAGTCCA TGACTGCTTT GGTGTTGGCT GACCAGCTTC TTATTCAAAA GGCTAGAGAA TCTGGTGCTG CCATCGTCGG CAATTAAGTA CATAAGCATG TAAAATAAGC GTAAATAATC TACATAATAA TGAAAAATGT ACTTTT
|
Protein sequence | MSSFGTLFRV TTYGESHCKS VGCIVDGVPP NLELTEDDIQ PQLTRRRPGQ SKLSTPRNEK DRVEIQSGTE NGLTLGSPIA MIVKNEDHRP HDYSETDLYP RPSHADWTYI QKYGTKSSSG GGRSSARETI GRVAAGAIAE KLLSKANGVE IVAFVSSIGP VSMARDASDP KFHELLNTVT REQIDATGPI RCPDETVRED MVKVIEKYRD AQDSIGGVVT CVVRNCPIGL GEPCFDKLEA KLAHAMLSLP ATKGFEFGSG FLGTQIPGSK HNDPFYYDEL HKRLRTTTNF SGGIQGGISN GENIYFSVAF KSAATISQEQ PTATYDGKDG VLAARGRHDP SVTPRAVPIV ESMTALVLAD QLLIQKARES GAAIVGN
|
| |