Gene EcSMS35_2965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2965 
SymbolargA 
ID6145876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3037873 
End bp3039204 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content53% 
IMG OID641617834 
ProductN-acetylglutamate synthase 
Protein accessionYP_001744986 
Protein GI170680431 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0548] Acetylglutamate kinase
[COG1246] N-acetylglutamate synthase and related acetyltransferases 
TIGRFAM ID[TIGR01890] amino-acid N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.80689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTAAAGG AACGTAAAAC CGAGTTGGTC GAGGGATTCC GCCATTCGGT TCCCTATATC 
AATACCCACC GGGGAAAAAC GTTTGTCATA ATGCTCGGCG GTGAAGCCAT TGAGCATGAG
AATTTCTCCA GTATCGTTAA TGATATCGGG TTGCTGCACA GCCTCGGCAT CCGTCTGGTC
GTGGTCTATG GAGCGCGCCC GCAGATCGAT GCAAATCTGG CAGCGCATCA CCACGAACCG
CTATATCACA AAAATATACG CGTGACCGAC GCCAAAACAC TGGAACTGGT GAAGCAGGCT
GCGGGAACAT TGCAACTGGA TATTACTGCT CGCCTGTCGA TGAGTCTCAA TAACACGCCG
CTGCAGGGCG CGCATATCAA CGTCGTCAGT GGCAATTTTA TTATTGCCCA GCCGCTGGGC
GTGGATGACG GTGTGGATTA CTGCCATAGC GGGCGTATCC GGCGGATTGA CGAAGATGCG
ATTCATCGTC AACTGGACAG CGGTGCGATA GTGCTGCTGG GGCCGGTCGC GGTTTCAGTC
ACTGGCGAGA GCTTTAATCT GACCTCGGAA GAGATTGCCA CTCAACTGGC CATCAAACTG
AAAGCTGAGA AGATGATTGG TTTTTGCTCT TCACAGGGCG TCACTAATGA CGACGGTGAT
ATTGTCTCAG AACTTTTCCC TAACGAAGCG CAAGCGCGGG TAGAAGCCCA GGAAGAGAAA
GGCGATTACA ACTCCGGTAC GGTGCGCTTT TTGCGTGGCG CAGTGAAAGC CTGCCGCAGC
GGCGTGCGTC GCTGTCATTT AATCAGTTAT CAGGAAGATG GCGCGCTGTT GCAAGAGTTG
TTCTCACGTG ACGGTATCGG TACGCAGATT GTGATGGAAA GCGCCGAGCA AATTCGTCGC
GCAACAATCA ACGATATTGG CGGCATTCTG GAGTTGATTC GCCCACTGGA GCAGCAAGGT
ATTCTGGTAC GCCGTTCTCG CGAGCAGCTG GAGATGGAAA TCGACAAATT CACCATTATT
CAGCGCGATA ACACGACTAT TGCCTGCGCC GCGCTCTATC CGTTCCCGGA AGAGAAGATT
GGGGAAATGG CCTGTGTGGC AGTTCACCCG GATTACCGCA GCTCATCACG GGGCGAGGTT
CTGCTGGAAC GCATTGCCGC TCAGGCGAAG CAGAGCGGCT TAAGCAAATT GTTTGTGCTG
ACCACGCGCA GTATTCACTG GTTCCAGGAA CGTGGATTTA CCCCAGTGGA TATTGATTTA
CTGCCCGAGA GCAAAAAGCA GTTGTACAAC TACCAGCGTA AATCCAAAGT TTTGATGGCG
GATTTAGGGT AA
 
Protein sequence
MVKERKTELV EGFRHSVPYI NTHRGKTFVI MLGGEAIEHE NFSSIVNDIG LLHSLGIRLV 
VVYGARPQID ANLAAHHHEP LYHKNIRVTD AKTLELVKQA AGTLQLDITA RLSMSLNNTP
LQGAHINVVS GNFIIAQPLG VDDGVDYCHS GRIRRIDEDA IHRQLDSGAI VLLGPVAVSV
TGESFNLTSE EIATQLAIKL KAEKMIGFCS SQGVTNDDGD IVSELFPNEA QARVEAQEEK
GDYNSGTVRF LRGAVKACRS GVRRCHLISY QEDGALLQEL FSRDGIGTQI VMESAEQIRR
ATINDIGGIL ELIRPLEQQG ILVRRSREQL EMEIDKFTII QRDNTTIACA ALYPFPEEKI
GEMACVAVHP DYRSSSRGEV LLERIAAQAK QSGLSKLFVL TTRSIHWFQE RGFTPVDIDL
LPESKKQLYN YQRKSKVLMA DLG