Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_51037 |
Symbol | ARG2 |
ID | 4851202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 1184579 |
End bp | 1186324 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | |
GC content | 40% |
IMG OID | 640392910 |
Product | acetylglutamate synthase |
Protein accession | XP_001387873 |
Protein GI | 126274188 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG5630] Acetylglutamate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0751076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0746208 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAAAT TGAAGAATTT GAACCGAGAA TTCATCTCCA ACTTGAAAAG TCACAAGCTC ATAACCGATG CTAAACGGAA CTTAATACTC TCCATCCTCA AATCTACTAC TACCAAGAGG GAAGCCCGGA ATTACCTCAA TAAGTACCAG AACCAGTTTG ATTTTGGCGA TTTGAAGATC TCGTCATCAG CAAAATACGA GCAGGATGTT TCCAAATTGA CAAAAAGAGA CTCCCAGCGT GAGTTGTTTG TCAACAGGTA CTTGAACAAA CAGAATCCAT TCATCAATAT CTACGATGAT GAGACCAAAC TCAAAAAAAT CCCTTTGAGA GTAGCATTGT TCAAGTTGAA ATTTCTCAAC ATCGATCCCA AAGAATGGCG TGGCATTGCC GAGACTTTCA AGCGATTGGT AAATTTGGGA ATTTCGCCCA TAGTGTTTCT AGATTATGAC CATCTTCCCA CAGACTCGTT CAAGTATAAC GAATTGTATA TGATTAATCA GGTCAATAAG GTCATGAACT ACCTTGGAAA ACCCGAAGAA GAAGGAAACC TAAAAACGAC GGTTTTGCGG TCATTATTCA CTGTCGAGAA CAAAGAAAGG GGTCCAGTAA TTAATAGTTT GGAATCTATA TTGATTCCCT TGTATCAGGG AATTATTCCT TTTATCCAGC CCATCATTTA CAATGCTGAG AGTACATTTC AGCAGTTTAT CAACTCGAAT CAGCTTTTGT ACAGCTTGTG TGAATCGTTG TTGGACAAGA AGGATCTTCT CTCAGTGGAG AAAATAGTCA TGATTGATCC TATTGGAGGA ATTCCCTCGG TTGAAAGAAA CCAGACGAGC CACGTATTTA TCAACTTGTC TCAAGAATAC TCGGATATAG TTTCCGAATT GTATATTGGA CATATTGAGC CTGATCAACG TGATTTGCAT CTTGCCAACT TGAATACCAT GCATGAAATC TTGACACTAG CTTCCTCCAA ATCGGGCAAT GATGACACGA CGGGGATCAT CACCACTCCA TTCATCATGT CTGTCAATGA TGATCTCATC AATCCGATTA TTTATAATGT TTTGACGGAT AGGCCCATCA TCTCTTCATC TCTACCTAGC TCCAACAACA GGACACCACA GCTTTCAACT TCTATTTTGA AGAAAGGAGT GGATGTGCGA TCGTACGATG CCGACAACTA TGCGAGAAAG TTCACTTTGC ATAATTTAAT AGAAGATGAA CTCGTAGACA AAAATAGGTT GGTAGCTCTT CTAGATGATT CGTTCGGCAA GAACTTGGAC ACAGATTCTT ATTTTGATAG AATCAATAAT TCGCTAGCTA CCCTTGTCAT TGTAGGGGAT TACGACGGTG CTGCTATCAT CACTTGGGAG TATAGTGGTA CCAACAAGAT CGCGTACTTG GACAAGTTCG CCATAGCAAA GAAGAACCAA GGATTACCTG GATTGGCAGA TGTGATCTTC AAGATAATTC TCCTGTCGCA TCCCCATGAG TTGATATGGA GATCTCGGAA AGTAAACCCT GTCAATAAGT GGTACTTTGA GAGATGTGTA GGCTCCATGA GTTCACCTGA GTCCCAATGG AGAATCTTTT ACACGGGTGA TATTTTCAAC CGCAGAATCG ACAAGAGAAG AAAGAGAATA GTTGGGAGTG AAGCTGTAAA CATTTCAGAC AAATTGGTGC AATACAGTGA AATTTGTGAA GGCATTCCTC CTTCTTTCTT TTCGTCTAAG GAATGA
|
Protein sequence | MSKLKNLNRE FISNLKSHKL ITDAKRNLIL SILKSTTTKR EARNYLNKYQ NQFDFGDLKI SSSAKYEQDV SKLTKRDSQR ELFVNRYLNK QNPFINIYDD ETKLKKIPLR VALFKLKFLN IDPKEWRGIA ETFKRLVNLG ISPIVFLDYD HLPTDSFKYN ELYMINQVNK VMNYLGKPEE EGNLKTTVLR SLFTVENKER GPVINSLESI LIPLYQGIIP FIQPIIYNAE STFQQFINSN QLLYSLCESL LDKKDLLSVE KIVMIDPIGG IPSVERNQTS HVFINLSQEY SDIVSELYIG HIEPDQRDLH LANLNTMHEI LTLASSKSGN DDTTGIITTP FIMSVNDDLI NPIIYNVLTD RPIISSSLPS SNNRTPQLST SILKKGVDVR SYDADNYARK FTLHNLIEDE LVDKNRLVAL LDDSFGKNLD TDSYFDRINN SLATLVIVGD YDGAAIITWE YSGTNKIAYL DKFAIAKKNQ GLPGLADVIF KIILLSHPHE LIWRSRKVNP VNKWYFERCV GSMSSPESQW RIFYTGDIFN RRIDKRRKRI VGSEAVNISD KLVQYSEICE GIPPSFFSSK E
|
| |