Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4405 |
Symbol | argC |
ID | 6144314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4500824 |
End bp | 4501828 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641619226 |
Product | N-acetyl-gamma-glutamyl-phosphate reductase |
Protein accession | YP_001746350 |
Protein GI | 170681939 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0002] Acetylglutamate semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.149183 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAATA CGCTGATTGT GGGTGCCAGC GGCTACGCTG GCGCAGAGCT AGTGACCTAT GTAAATCGCC ATCCGCATAT GAACATAACC GCTTTGACTG TTTCAGCGCA AAGCAATGAT GCGGGAAAGT TAATCTCCGA TTTGCATCCG CAGCTAAAAG GCATCGTTGA TCTGCCGTTG CAGCCGATGT CGGATATCAG CGAGTTTAGC CCAGGGGTGG ACGTGGTGTT TCTCGCCACC GCCCATGAAG TTAGCCACGA TTTAGCGCCG CAATTTCTTG AAGCCGGCTG CGTGGTGTTC GACCTTTCCG GCGCGTTTCG TGTTAACGAC GCCGCCTTCT ATGAAAAATA TTACGGCTTT ACCCATCAAT ACCCGGAACT GCTGGAACAG GCAGCCTACG GTCTGGCGGA GTGGTGCGGT AATAAATTAA AAGAAGCAAA CTTGATTGCG GTGCCAGGCT GTTATCCGAC GGCAGCACAG CTGGCGCTGA AACCGTTGAT TGATGCCGAT CTTCTTGACC TCATTCAGTG GCCGGTGATC AACGCTACCA GCGGCGTGAG CGGTGCAGGA CGTAAGGCAG CGATTTCAAA TAGTTTTTGT GAAGTTAGCC TGCAACCGTA TGGCGTCTTT ACCCATCGCC ATCAACCAGA GATCGCCACA CATCTCGGTG CTGACGTTAT CTTCACCCCG CATCTGGGCA ATTTCCCGCG CGGCATTCTC GAAACCATTA CCTGCCGCCT GAAATCGGGT GTGACCCAGG CGCAAGTCGC GCAAGTGTTA CAGCAGGCGT ATGCCCATAA ACCGCTGGTG CGGCTGTATG ACAAAGGCGT TCCGGCGCTG AAAAATGTCG TTGGGCTGCC ATTTTGCGAT ATCGGGTTTG CCGTTCAGGG CGAGCATCTG ATTATTGTGA CGACCGAAGA CAACTTACTG AAAGGCGCGG CGGCACAAGC GGTACAGTGC GCCAATATTC GTTTCGGCTA TGCGGAAACG CAGTCTCTTA TTTAA
|
Protein sequence | MLNTLIVGAS GYAGAELVTY VNRHPHMNIT ALTVSAQSND AGKLISDLHP QLKGIVDLPL QPMSDISEFS PGVDVVFLAT AHEVSHDLAP QFLEAGCVVF DLSGAFRVND AAFYEKYYGF THQYPELLEQ AAYGLAEWCG NKLKEANLIA VPGCYPTAAQ LALKPLIDAD LLDLIQWPVI NATSGVSGAG RKAAISNSFC EVSLQPYGVF THRHQPEIAT HLGADVIFTP HLGNFPRGIL ETITCRLKSG VTQAQVAQVL QQAYAHKPLV RLYDKGVPAL KNVVGLPFCD IGFAVQGEHL IIVTTEDNLL KGAAAQAVQC ANIRFGYAET QSLI
|
| |