Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2965 |
Symbol | argA |
ID | 6145876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3037873 |
End bp | 3039204 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617834 |
Product | N-acetylglutamate synthase |
Protein accession | YP_001744986 |
Protein GI | 170680431 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0548] Acetylglutamate kinase [COG1246] N-acetylglutamate synthase and related acetyltransferases |
TIGRFAM ID | [TIGR01890] amino-acid N-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.80689 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTAAAGG AACGTAAAAC CGAGTTGGTC GAGGGATTCC GCCATTCGGT TCCCTATATC AATACCCACC GGGGAAAAAC GTTTGTCATA ATGCTCGGCG GTGAAGCCAT TGAGCATGAG AATTTCTCCA GTATCGTTAA TGATATCGGG TTGCTGCACA GCCTCGGCAT CCGTCTGGTC GTGGTCTATG GAGCGCGCCC GCAGATCGAT GCAAATCTGG CAGCGCATCA CCACGAACCG CTATATCACA AAAATATACG CGTGACCGAC GCCAAAACAC TGGAACTGGT GAAGCAGGCT GCGGGAACAT TGCAACTGGA TATTACTGCT CGCCTGTCGA TGAGTCTCAA TAACACGCCG CTGCAGGGCG CGCATATCAA CGTCGTCAGT GGCAATTTTA TTATTGCCCA GCCGCTGGGC GTGGATGACG GTGTGGATTA CTGCCATAGC GGGCGTATCC GGCGGATTGA CGAAGATGCG ATTCATCGTC AACTGGACAG CGGTGCGATA GTGCTGCTGG GGCCGGTCGC GGTTTCAGTC ACTGGCGAGA GCTTTAATCT GACCTCGGAA GAGATTGCCA CTCAACTGGC CATCAAACTG AAAGCTGAGA AGATGATTGG TTTTTGCTCT TCACAGGGCG TCACTAATGA CGACGGTGAT ATTGTCTCAG AACTTTTCCC TAACGAAGCG CAAGCGCGGG TAGAAGCCCA GGAAGAGAAA GGCGATTACA ACTCCGGTAC GGTGCGCTTT TTGCGTGGCG CAGTGAAAGC CTGCCGCAGC GGCGTGCGTC GCTGTCATTT AATCAGTTAT CAGGAAGATG GCGCGCTGTT GCAAGAGTTG TTCTCACGTG ACGGTATCGG TACGCAGATT GTGATGGAAA GCGCCGAGCA AATTCGTCGC GCAACAATCA ACGATATTGG CGGCATTCTG GAGTTGATTC GCCCACTGGA GCAGCAAGGT ATTCTGGTAC GCCGTTCTCG CGAGCAGCTG GAGATGGAAA TCGACAAATT CACCATTATT CAGCGCGATA ACACGACTAT TGCCTGCGCC GCGCTCTATC CGTTCCCGGA AGAGAAGATT GGGGAAATGG CCTGTGTGGC AGTTCACCCG GATTACCGCA GCTCATCACG GGGCGAGGTT CTGCTGGAAC GCATTGCCGC TCAGGCGAAG CAGAGCGGCT TAAGCAAATT GTTTGTGCTG ACCACGCGCA GTATTCACTG GTTCCAGGAA CGTGGATTTA CCCCAGTGGA TATTGATTTA CTGCCCGAGA GCAAAAAGCA GTTGTACAAC TACCAGCGTA AATCCAAAGT TTTGATGGCG GATTTAGGGT AA
|
Protein sequence | MVKERKTELV EGFRHSVPYI NTHRGKTFVI MLGGEAIEHE NFSSIVNDIG LLHSLGIRLV VVYGARPQID ANLAAHHHEP LYHKNIRVTD AKTLELVKQA AGTLQLDITA RLSMSLNNTP LQGAHINVVS GNFIIAQPLG VDDGVDYCHS GRIRRIDEDA IHRQLDSGAI VLLGPVAVSV TGESFNLTSE EIATQLAIKL KAEKMIGFCS SQGVTNDDGD IVSELFPNEA QARVEAQEEK GDYNSGTVRF LRGAVKACRS GVRRCHLISY QEDGALLQEL FSRDGIGTQI VMESAEQIRR ATINDIGGIL ELIRPLEQQG ILVRRSREQL EMEIDKFTII QRDNTTIACA ALYPFPEEKI GEMACVAVHP DYRSSSRGEV LLERIAAQAK QSGLSKLFVL TTRSIHWFQE RGFTPVDIDL LPESKKQLYN YQRKSKVLMA DLG
|
| |