Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0966 |
Symbol | argC |
ID | 5055468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 858940 |
End bp | 860001 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640468522 |
Product | N-acetyl-gamma-glutamyl-phosphate reductase |
Protein accession | YP_001153198 |
Protein GI | 145591196 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0002] Acetylglutamate semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.170153 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000000021717 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTTTGA GGACATGTAT AGTCGGGGCG TCGGGCTTTG TTGGCGGTGA GCTATTGCGC ATTCTTCTCC AGCACAGCGG GGTCGAGGTG GTGTGCGCCA CGTCTAGGAA GTTTAAGGGG GAGTACATCT ACAGGGTGCA CCCCCACTTG AGGGGGTTTA CCCAGCTTAA GTTCGTGGAG CCCACTATCG ACGCTGCGCT TAAGGCAGAC GTGGTCTTCC TCGCTCTGCC CCACGGGGAG TCTGTGAAGT GGGTGCCCAA GCTCTACGAG TCCGGAGTCG CCGTGTTTGA CCTTAGCGCC GATTTTAGGC TAAAGGACCC CAACGCCTAC GTGGAGTGGT ACAAGTGGCC GCAGCCCCAT CCCTACCCCG ACTTGTTGCA GAAGGCGGTG TACGGCCAGC CTGAGCTCCA CAGGAATGAG CTGGTCGGCG CCAAGCTGGT GGCAGTCCCC GGTTGCATGG CCACAGCCTC TATCCTCATG CTGGCCCCCC TAGCTAAGCA CGGCTTCCTC GGCAGGGCGC CTCCCGTGGT AGACGCCAAG ATAGGGTCAA GCGGCGCCGG CGCCGAGGGG TCCATCGTCG ATCTCCACAG CTTCCGCACC TACGTGGTTA GGCCCTACGA GCCTGTCCAC CACCGCCACA TCGCGGAGAT TGAGCAGGAG CTTAGCCTAC TGGCTGGGAA GAAGATCAGG GTGGCCTTTA CCCCCCACGC CGTGGATTTG GTGAGGGGAA TCTTCGCCAC CGGCCACACC TACGTGGAGA AGCTCCCCAC CGAGGCCGAC ATGTGGAAGA TGTACCGCGC CCTCTTCGGC GACTCGAAGT TCATTAGGAT TGTGAAGGAC AGGCTGGGGA TCGCCCGCTA CCCCAACACC AAGTATGTTA TAGGCTCAAA TTTCGTTGAC ATCGGCTTCG AGCTAGACCC GAGGCTCAAC AGGGTAGTCA CCTTCGCCGC GATAGACAAC CTCGTAAGGG GCGCCGCGGG ACAGGCGGTG CAAGCCTTCA ACGTGGCCTT TGGCTTCCCC GAAGACGAGG GCCTGAGATA CATCCCGCTG GCTCCGGTAT GA
|
Protein sequence | MTLRTCIVGA SGFVGGELLR ILLQHSGVEV VCATSRKFKG EYIYRVHPHL RGFTQLKFVE PTIDAALKAD VVFLALPHGE SVKWVPKLYE SGVAVFDLSA DFRLKDPNAY VEWYKWPQPH PYPDLLQKAV YGQPELHRNE LVGAKLVAVP GCMATASILM LAPLAKHGFL GRAPPVVDAK IGSSGAGAEG SIVDLHSFRT YVVRPYEPVH HRHIAEIEQE LSLLAGKKIR VAFTPHAVDL VRGIFATGHT YVEKLPTEAD MWKMYRALFG DSKFIRIVKD RLGIARYPNT KYVIGSNFVD IGFELDPRLN RVVTFAAIDN LVRGAAGQAV QAFNVAFGFP EDEGLRYIPL APV
|
| |