Gene EcHS_A4192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4192 
SymbolargC 
ID5594140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4184667 
End bp4185671 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content53% 
IMG OID640923294 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_001460753 
Protein GI157163435 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAATA CGCTGATTGT GGGTGCCAGC GGCTACGCTG GCGCAGAGCT AGTGACCTAT 
GTAAATCGCC ATCCGCATAT GAACATAACC GCTTTGACTG TTTCAGCGCA AAGCAATGAT
GCGGGAAAGT TAATCTCCGA TTTGCATCCG CAGCTAAAAG GCATCGTTGA TCTGCCGTTG
CAGCCGATGT CGGATATCAG CGAGTTTAGC CCAGGGGTGG ACGTAGTGTT TCTCGCCACC
GCCCATGAAG TTAGCCACGA TTTAGCGCCG CAGTTTCTTG AAGCGGGCTG CGTGGTGTTC
GACCTTTCCG GCGCGTTTCG TGTTAACGAC GCCACCTTCT ATGAAAAATA TTACGGCTTT
ACCCATCAAT ACCCGGAACT GTTGGAACAG GCAGCCTACG GTCTGGCGGA GTGGTGCGGT
AATAAATTAA AAGAAGCGAA TTTGATTGCG GTGCCGGGCT GTTATCCGAC GGCGGCACAG
CTGGCGCTGA AACCGTTGAT TGATGCCGAT CTTCTTGACC TCAATCAGTG GCCGGTGATC
AACGCCACCA GCGGCGTGAG CGGTGCAGGG CGTAAAGCGG CCATTTCAAA CAGCTTTTGT
GAAGTTAGCC TGCAACCGTA TGGCGTCTTT ACTCATCGCC ATCAACCAGA GATCGCCACA
CACCTCGGTG CTGACGTTAT CTTCACCCCA CATCTGGGCA ATTTCCCGCG CGGCATTCTC
GAAACCATTA CCTGCCGCCT GAAATCGGGT GTGACCCAGG CGCAAGTCGC GCAAGTGTTA
CAGCAGGCGT ATGCCCATAA ACCGCTGGTG CGGCTGTATG ACAAAGGCGT TCCGGCGCTG
AAAAATGTCG TTGGGCTGCC ATTTTGCGAT ATCGGGTTTG CCGTTCAGGG CGAGCATTTG
ATTATTGTGG CGACCGAAGA CAACTTACTG AAAGGCGCGG CGGCACAAGC GGTACAGTGC
GCCAATATTC GTTTCGGCTA TGCGGAAACG CAGTCTCTTA TTTAA
 
Protein sequence
MLNTLIVGAS GYAGAELVTY VNRHPHMNIT ALTVSAQSND AGKLISDLHP QLKGIVDLPL 
QPMSDISEFS PGVDVVFLAT AHEVSHDLAP QFLEAGCVVF DLSGAFRVND ATFYEKYYGF
THQYPELLEQ AAYGLAEWCG NKLKEANLIA VPGCYPTAAQ LALKPLIDAD LLDLNQWPVI
NATSGVSGAG RKAAISNSFC EVSLQPYGVF THRHQPEIAT HLGADVIFTP HLGNFPRGIL
ETITCRLKSG VTQAQVAQVL QQAYAHKPLV RLYDKGVPAL KNVVGLPFCD IGFAVQGEHL
IIVATEDNLL KGAAAQAVQC ANIRFGYAET QSLI