Gene EcolC_4058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4058 
SymbolargC 
ID6065244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4476069 
End bp4477073 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content53% 
IMG OID641603481 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_001726984 
Protein GI170022030 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.689571 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATA CGCTGATTGT GGGTGCCAGC GGCTACGCTG GCGCAGAGCT AGTGACCTAT 
GTAAATCGCC ATCCGCATAT GAACATAACC GCTTTGACTG TTTCAGCGCA AAGCAATGAT
GCGGGAAAGT TAATCTCCGA TTTGCATCCG CAGCTAAAAG GCATCGTTGA TCTGCCGTTG
CAGCCGATGT CGGATATCAG CGAGTTTAGC CCAGGGGTGG ACGTAGTGTT TCTCGCCACC
GCCCATGAAG TTAGCCACGA TTTAGCGCCG CAGTTTCTTG AAGCGGGCTG CGTGGTGTTC
GACCTTTCCG GCGCGTTTCG TGTTAACGAC GCCACCTTCT ATGAAAAATA TTACGGCTTT
ACCCATCAAT ACCCGGAACT GTTGGAACAG GCAGCCTACG GTCTGGCGGA GTGGTGCGGT
AATAAATTAA AAGAAGCGAA TTTGATTGCG GTGCCGGGCT GTTATCCGAC GGCGGCACAG
CTGGCGCTGA AACCGTTGAT TGATGCCGAT CTTCTTGACC TCAATCAGTG GCCGGTGATC
AACGCCACCA GCGGCGTGAG CGGTGCAGGG CGTAAAGCGG CCATTTCAAA CAGCTTTTGT
GAAGTTAGCC TGCAACCGTA TGGCGTCTTT ACTCATCGCC ATCAACCAGA GATCGCCACA
CACCTCGGTG CTGACGTTAT CTTCACCCCA CATCTGGGCA ATTTCCCGCG CGGCATTCTC
GAAACCATTA CCTGCCGCCT GAAATCGGGT GTGACCCAGG CGCAAGTCGC GCAAGTGTTA
CAGCAGGCGT ATGCCCATAA ACCGCTGGTG CGGCTGTATG ACAAAGGCGT TCCGGCGCTG
AAAAATGTCG TTGGGCTGCC ATTTTGCGAT ATCGGGTTTG CCGTTCAGGG CGAGCATTTG
ATTATTGTGG CGACCGAAGA CAACTTACTG AAAGGCGCGG CGGCACAAGC GGTACAGTGC
GCCAATATTC GTTTCGGCTA TGCGGAAACG CAGTCTCTTA TTTAA
 
Protein sequence
MLNTLIVGAS GYAGAELVTY VNRHPHMNIT ALTVSAQSND AGKLISDLHP QLKGIVDLPL 
QPMSDISEFS PGVDVVFLAT AHEVSHDLAP QFLEAGCVVF DLSGAFRVND ATFYEKYYGF
THQYPELLEQ AAYGLAEWCG NKLKEANLIA VPGCYPTAAQ LALKPLIDAD LLDLNQWPVI
NATSGVSGAG RKAAISNSFC EVSLQPYGVF THRHQPEIAT HLGADVIFTP HLGNFPRGIL
ETITCRLKSG VTQAQVAQVL QQAYAHKPLV RLYDKGVPAL KNVVGLPFCD IGFAVQGEHL
IIVATEDNLL KGAAAQAVQC ANIRFGYAET QSLI