Gene EcolC_0897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0897 
Symbol 
ID6064574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp970132 
End bp971463 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content53% 
IMG OID641600300 
ProductN-acetylglutamate synthase 
Protein accessionYP_001723893 
Protein GI170018939 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0548] Acetylglutamate kinase
[COG1246] N-acetylglutamate synthase and related acetyltransferases 
TIGRFAM ID[TIGR01890] amino-acid N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.763096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTAAAGG AACGTAAAAC CGAGTTGGTC GAGGGATTCC GCCATTCGGT TCCCTATATC 
AATACCCACC GGGGAAAAAC GTTTGTCATC ATGCTCGGCG GTGAAGCCAT TGAGCATGAG
AATTTCTCCA GTATCGTTAA TGATATCGGG TTGTTGCACA GCCTCGGCAT CCGTCTGGTG
GTGGTCTATG GCGCACGTCC GCAGATCGAC GCAAATCTGG CTGCGCATCA CCACGAACCG
CTGTATCACA AGAATATACG TGTGACCGAC GCCAAAACAC TGGAACTGGT GAAGCAGGCT
GCGGGAACAT TGCAACTGGA TATTACTGCT CGCCTGTCGA TGAGTCTCAA TAACACGCCG
CTGCAGGGCG CGCATATCAA CGTCGTCAGT GGCAATTTTA TTATTGCCCA GCCGCTGGGC
GTCGATGACG GCGTGGATTA CTGCCATAGC GGGCGTATCC GGCGGATTGA TGAAGACGCG
ATCCATCGTC AACTGGACAG CGGTGCAATA GTGCTAATGG GGCCGGTCGC TGTTTCAGTC
ACTGGCGAGA GCTTTAACCT GACCTCGGAA GAGATTGCCA CTCAACTGGC CATCAAACTG
AAAGCTGAAA AGATGATTGG TTTTTGCTCT TCCCAGGGCG TCACTAATGA CGACGGTGAT
ATTGTCTCCG AACTTTTCCC TAACGAAGCG CAAGCGCGGG TAGAAGCCCA GGAAGAGAAA
GGCGATTACA ACTCCGGTAC GGTGCGCTTT TTGCGTGGCG CAGTGAAAGC CTGCCGCAGC
GGCGTGCGTC GCTGTCATTT AATCAGTTAT CAGGAAGATG GCGCGCTGTT GCAAGAGTTG
TTCTCACGCG ACGGTATCGG TACGCAGATT GTGATGGAAA GCGCCGAGCA GATTCGTCGC
GCAACAATCA ACGATATTGG CGGTATTCTG GAGTTGATTC GCCCACTGGA GCAGCAAGGT
ATTCTGGTAC GCCGTTCTCG CGAGCAGCTG GAGATGGAAA TCGACAAATT CACCATTATT
CAGCGCGATA ACACGACTAT TGCCTGCGCC GCGCTCTATC CGTTCCCGGA AGAGAAGATT
GGGGAAATGG CCTGTGTGGC AGTTCACCCG GATTACCGCA GTTCATCAAG GGGTGAAGTT
CTGCTGGAAC GCATTGCCGC TCAGGCGAAG CAGAGCGGCT TAAGCAAATT GTTTGTGCTG
ACCACGCGCA GTATTCACTG GTTCCAGGAA CGTGGATTTA CCCCAGTGGA TATTGATTTA
CTGCCCGAGA GCAAAAAGCA GTTGTACAAC TACCAGCGTA AATCCAAAGT GTTGATGGCG
GATTTAGGGT AA
 
Protein sequence
MVKERKTELV EGFRHSVPYI NTHRGKTFVI MLGGEAIEHE NFSSIVNDIG LLHSLGIRLV 
VVYGARPQID ANLAAHHHEP LYHKNIRVTD AKTLELVKQA AGTLQLDITA RLSMSLNNTP
LQGAHINVVS GNFIIAQPLG VDDGVDYCHS GRIRRIDEDA IHRQLDSGAI VLMGPVAVSV
TGESFNLTSE EIATQLAIKL KAEKMIGFCS SQGVTNDDGD IVSELFPNEA QARVEAQEEK
GDYNSGTVRF LRGAVKACRS GVRRCHLISY QEDGALLQEL FSRDGIGTQI VMESAEQIRR
ATINDIGGIL ELIRPLEQQG ILVRRSREQL EMEIDKFTII QRDNTTIACA ALYPFPEEKI
GEMACVAVHP DYRSSSRGEV LLERIAAQAK QSGLSKLFVL TTRSIHWFQE RGFTPVDIDL
LPESKKQLYN YQRKSKVLMA DLG