Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4452 |
Symbol | argC |
ID | 6482587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4331140 |
End bp | 4332144 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642739691 |
Product | N-acetyl-gamma-glutamyl-phosphate reductase |
Protein accession | YP_002043385 |
Protein GI | 194445279 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0002] Acetylglutamate semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.00023941 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTGAATA CGCTGATTGT AGGCGCTAGC GGTTATGCGG GCGCAGAGCT TGTAAGCTAC GTAAATCGCC ACCCACATAT GACCATAACC GCTTTGACCG TCTCAGCGCA AAGCAATGAT GCAGGAAAGT TAATTTCTGA TTTACATCCG CAGTTAAAAG GGATTGTCGA CCTGCCATTG CAACCGATGT CGGATGTACG TGACTTTAGC GCAGATGTTG ATGTGGTGTT CCTGGCGACA GCGCATGAAG TGAGCCACGA TCTGGCGCCG CAATTTTTAC AGGCCGGATG CGTCGTATTC GATCTTTCCG GCGCGTTTCG GGTAAACGAT CGCGCCTTCT ATGAAAAGTA TTATGGATTT ACTCATCAGT ATCCTGAGCT GCTGGAGCAG GCGGTGTACG GTCTGGCGGA GTGGAATGCC GATAAACTGA ACACGGCGAA TCTGATTGCG GTTCCTGGTT GCTATCCAAC CGCAGCGCAG TTGTCGCTAA AACCGCTGAT TGACGGCGGG CTTCTGGATC TGACGCAGTG GCCGGTAATT AATGCCACCA GCGGGGTGAG CGGCGCGGGG CGCAAAGCGG CAATTTCGAA CAGTTTCTGT GAAGTGAGTC TGCAACCGTA TGGCGTGTTT ACGCATCGTC ATCAGCCGGA AATCGCCGTC CATCTGGGCG CGGAGGTCAT CTTCACCCCG CATCTGGGGA ATTTTCCGCG CGGGATTCTG GAAACCATTA CCTGCCGCCT GAAAGCGGGA GTAACCCATG CGCAGGTCGC CGACGTGTTG CAAAAAGCCT ATGGCGACAA ACCGCTGGTG CGCCTGTATG ACAAAGGCGT TCCGGCGTTA AAGAACGTCG TCGGGCTGCC GTTCTGCGAT ATTGGTTTTG CCGTCCAGGG CGAACATCTG ATTGTGGTGG CAACCGAAGA CAACTTGTTG AAAGGCGCAG CGGCGCAGGC AGTACAGTGC GCTAATATTC GTTTTGGCTT CGCTGAAACG CAGTCTCTTA TTTAA
|
Protein sequence | MLNTLIVGAS GYAGAELVSY VNRHPHMTIT ALTVSAQSND AGKLISDLHP QLKGIVDLPL QPMSDVRDFS ADVDVVFLAT AHEVSHDLAP QFLQAGCVVF DLSGAFRVND RAFYEKYYGF THQYPELLEQ AVYGLAEWNA DKLNTANLIA VPGCYPTAAQ LSLKPLIDGG LLDLTQWPVI NATSGVSGAG RKAAISNSFC EVSLQPYGVF THRHQPEIAV HLGAEVIFTP HLGNFPRGIL ETITCRLKAG VTHAQVADVL QKAYGDKPLV RLYDKGVPAL KNVVGLPFCD IGFAVQGEHL IVVATEDNLL KGAAAQAVQC ANIRFGFAET QSLI
|
| |