Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C4449 |
Symbol | argC |
ID | 6491051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 4336319 |
End bp | 4337323 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642744531 |
Product | N-acetyl-gamma-glutamyl-phosphate reductase |
Protein accession | YP_002048120 |
Protein GI | 194450761 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0002] Acetylglutamate semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.000120471 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTGAATA CGCTGATTGT AGGCGCTAGC GGTTATGCGG GCGCAGAGCT TGTAAGCTAC GTAAATCGCC ACCCACATAT GACCATAACC GCTTTGACCG TCTCAGCGCA AAGCAATGAT GCAGGAAAGT TAATTTCTGA TTTACATCCG CAGTTAAAAG GGATTGTCGA CCTGCCATTG CAACCGATGT CGGATGTACG TGACTTTAGC GCAGATGTTG ATGTGGTGTT CCTGGCGACA GCGCATGAAG TGAGCCACGA TCTGGCGCCG CAATTTTTAC AGGCCGGATG CGTCGTATTC GATCTTTCCG GCGCGTTTCG GGTAAACGAT CGCGCCTTCT ATGAAAAGTA TTACGGATTT ACTCATCAGT ATCCTGAGCT GCTGGAGCAG GCGGTGTACG GTCTGGCGGA GTGGAATGTC GATAAACTGA ACACGGCGAA TCTGATTGCG GTTCCTGGTT GCTATCCAAC CGCAGCGCAG TTGTCGCTAA AACCGCTGAT TGACGGCGGG CTTCTGGATC TGACGCAGTG GCCGGTAATT AATGCCACCA GCGGGGTGAG CGGCGCGGGG CGCAAAGCGG CAATTTCGAA CAGTTTCTGT GAAGTGAGTC TGCAACCGTA TGGCGTGTTT ACGCATCGTC ATCAGCCGGA AATCGCTGCC CATCTGGGCG CGGAGGTCAT CTTCACCCCG CATCTGGGGA ATTTCCCGCG CGGGATTCTG GAAACCATTA CCTGCCGCCT GAAAGCGGGA GTAACCCATG CGCAGGTCGC CGACGTGTTG CAAAAAGCCT ATGGCGACAA ACCGCTGGTG CGCCTGTATG ACAAAGGCGT TCCGGCGTTA AAGAACGTCG TCGGGCTGCC GTTCTGCGAT ATTGGTTTTG CCGTCCAGGG CGAACATCTG ATTGTGGTGG CAGCCGAAGA CAACTTGTTG AAAGGCGCAG CGGCGCAGGC AGTACAGTGC GCTAATATTC GTTTTGGCTT CGCTGAAACG CAGTCTCTTA TTTAA
|
Protein sequence | MLNTLIVGAS GYAGAELVSY VNRHPHMTIT ALTVSAQSND AGKLISDLHP QLKGIVDLPL QPMSDVRDFS ADVDVVFLAT AHEVSHDLAP QFLQAGCVVF DLSGAFRVND RAFYEKYYGF THQYPELLEQ AVYGLAEWNV DKLNTANLIA VPGCYPTAAQ LSLKPLIDGG LLDLTQWPVI NATSGVSGAG RKAAISNSFC EVSLQPYGVF THRHQPEIAA HLGAEVIFTP HLGNFPRGIL ETITCRLKAG VTHAQVADVL QKAYGDKPLV RLYDKGVPAL KNVVGLPFCD IGFAVQGEHL IVVAAEDNLL KGAAAQAVQC ANIRFGFAET QSLI
|
| |