Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_75451 |
Symbol | MET3 |
ID | 4851940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 3248672 |
End bp | 3250528 |
Gene Length | 1857 bp |
Protein Length | 523 aa |
Translation table | |
GC content | 45% |
IMG OID | 640393648 |
Product | Sulfate adenylyltransferase (Sulfate adenylate transferase) (SAT) (ATP-sulfurylase) |
Protein accession | XP_001386944 |
Protein GI | 126276087 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2046] ATP sulfurylase (sulfate adenylyltransferase) |
TIGRFAM ID | [TIGR00339] ATP sulphurylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.20752 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0971418 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ACTGAGCTCA TATCACTAGG CCTGTTCGTT TGTATATACC TGGGAATATC CGTACACTAG ACTCGATTTA CCCCACATTT CGCAGCTATT TAAAATTTTC AGATTTTTCA GATTTTTCAG ATAGAATTTT CACTCTGAAA TTCACCCGAA CTATTTATTA AACTTATTCC TATTACAATG CCTATTCCTG CTGCCCACGG TGGTGTATTG AACGACTTGG TCATTCGTGA CGCTGGTATC AGAGATCAGT TAATTCAGGA AGCTGCTGGG CTTTCTGCTT TGACTTTAAC TGATAGACAG CTCTGCGACT TGGAATTGAT CTTGAACGGT GGATTCTCTC CTTTGAAAGG TTTCTTGAAC GAAGACGACT ACAAGTCGGT GGTTTCTGAC TTGAGATTAT CTTCCGTAAC CGACAAGAAA TCCGGCAAGG GTTTGTTGTG GCCAATTCCT ATCACATTGG ACGTTTCTCC AGAGACCGCT GCTCAGTACA AGGTTGGTGA TAGAATCGTG TTGAAAGACT TGAGAGACGA AACAAATCTC GCTATTTTGA CCATTGAATC GATCTATAAG CCCGATAAGA AGCTCGAAGC AGAAAGTGTC TTCAGAGGTG ACCCAGAACA CCCAGCCATC AGATATTTGA ACGAAACTGC TGGCGACGTC TACATTGGTG GTTCTCTCCA GGGTTTGAAC TACCCAAGAC ACTATGACTA TGTCGAATCG AGAAAGACTC CTACTGAACT CAGAGCTGAG TTCCAGAAGT TGGGCTGGGA CGACCAGAAC ATCGTTGCTT TCCAAACTAG AAACCCTATG CACAGAGCTC ACAGAGAATT GACCATTAGA GCTGCTAAGG ATATCGGTGA AACTGGCCAC ATCTTGATCC ACCCAGTTGT TGGTTTGACC AAGCCAGGTG ATATTGACCA CCACACCAGA GTCAAGGTGT ACACCCAAAT CTTGAAGAAG TTCCCTGATG GTTTGGCCAC ATTGTCACTC TTGCCTCTTG CCATGAGAAT GGGTGGAGAT AGAGAAGCCT TGTGGCACGC TTTGATCAGA ACCAACTACG GTGTTGACCA CTTCATTGTC GGTAGAGATC ATGCTGGTCC CGGTAAGAAC TCGCAGGGTG TCGACTTCTA CGGTCCTTAC GATGCCCAAG AGTTGTTAGC CAACTACGAA GATGAGTTGA CTATCAAGAT CGTTCCTTTC AGAATGGTCA CTTACTTGCC AGAAGAGGAC AGATACGCTC CTATTGACAC AATTGACACC TCCAAGGTGA AAACGGCCAA CATTTCTGGT ACTGAGTTGA GAAACAGATT GAAGACTGGT GACCATATTC CTGAATGGTT CTCGTATCCA GAAGTAGTCA AGATCTTGAG AGAAACCAAC CCTCCTAGAG CCAAGCAGGG TTTTGCAATC CTCATTGACA ACTCCAGCAA GAACGGTGAC TACCTTGCAT TCGCCTTGCA ATCCACCTTG AACCAGTTCT CCGGTGAACG TCGTATCACC AAGTTGAGCT CGACTCACGT CGATGACTTC ATCATCAATG AGTTGGTCAA GGCTGGTTCC GGAGTTCTCA TTCCAACCAC TACTGGAGTC GACTCCATTG TCAAGTCTAT TGGAAAGGGT AACGTCTTGA CTGTCAAGTC TGGCAAAGAT GCCCAAATTG AGCAAGGTGA ATTTGCTTTG AACGGGAGCG ATTTGTCCGT CGTCATCAAG GAAATCGTTG AGTACTTGCA CCAACAAGGT TTCTACTAAG TTTCTTTATA TTCTTGAACA TACTTTTGGG GTTTAAAATA TCTTCACTTC GCAGCATTTA TAAACGTTTA GCTAGACAAT ACAGTTGCAA TGGCAATATC GATAATT
|
Protein sequence | MPIPAAHGGV LNDLVIRDAG IRDQLIQEAA GLSALTLTDR QLCDLELILN GGFSPLKGFL NEDDYKSVVS DLRLSSVTDK KSGKGLLWPI PITLDVSPET AAQYKVGDRI VLKDLRDETN LAILTIESIY KPDKKLEAES VFRGDPEHPA IRYLNETAGD VYIGGSLQGL NYPRHYDYVE SRKTPTELRA EFQKLGWDDQ NIVAFQTRNP MHRAHRELTI RAAKDIGETG HILIHPVVGL TKPGDIDHHT RVKVYTQILK KFPDGLATLS LLPLAMRMGG DREALWHALI RTNYGVDHFI VGRDHAGPGK NSQGVDFYGP YDAQELLANY EDELTIKIVP FRMVTYLPEE DRYAPIDTID TSKVKTANIS GTELRNRLKT GDHIPEWFSY PEVVKILRET NPPRAKQGFA ILIDNSSKNG DYLAFALQST LNQFSGERRI TKLSSTHVDD FIINELVKAG SGVLIPTTTG VDSIVKSIGK GNVLTVKSGK DAQIEQGEFA LNGSDLSVVI KEIVEYLHQQ GFY
|
| |