Gene GM21_3347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3347 
SymbolargC 
ID8138714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3874725 
End bp3875765 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content64% 
IMG OID644870965 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_003023130 
Protein GI253701941 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.0512328 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAGG TCGCCATCGT CGGCGCAAGC GGTTATACCG GTGTTGAACT GCTGCGGATT 
CTCCATTCCC ATCCCGAGGT CGCCGTCACC TGCGTCACCT CCGAACAAAG CGCCGGGCGC
CCGGTCAGCT CCGTCTTCCC GAGCCTGCGC GGCAGGTGCG ATATCGTCCT GGAGAACCTG
GAGCCGGTAG GGATCGCGGA GAAGGTCGAC ATCGTCTTCA CGGCGCTGCC GCATAAGGCC
GCCATGGAAG TGGTGCCCAC CTTCATGAAG ATGGGAAAGG ACGTGATCGA TCTTTCCGCC
GACTATCGCA TTCACGACGC GGACACCTAC GGCAAGTGGT ACGAGCCGCA CCTGAACCCG
GAGCTTTTGC CGGAGGCGGT GTACGGCATC CCGGAGCTGC GTCGCGCCGA GATCGCCGAG
GCCTCGCTGA TCGCGAACCC CGGCTGCTAC CCGACCAGCG TCATCCTCGG GCTCGCGCCG
CTTTTGAAGG GGAAGGTGAT CGATCCCAGG TCCATCATCG TGGACGCCGC CTCCGGCACC
TCCGGCGCTG GGCGCGGAGC CAAGGTGGAC AACCTCTACT GTGAGGTGAA CGAAGGGTTC
CGCGCCTACG GCGTGGGTGG CGTGCACAGG CACATCCCGG AGATAGAGCA GGAGCTGTCG
CTTCTGGCCG GGAGCCCGCT CAACATCACC TTCACTCCGC ACCTGGTTCC CATGGACCGC
GGCATCTTGT CGACCATCTA CTCCCAGACG GCCGGCAGCG TCAAGGCCGC CGATCTGATC
GCTCTGTACG AGGCGTTTTA CGACGGCGAG CCTTTCGTCA GGGTGCTGCC GGAGGGTGTT
CTCCCCTCCA CGGCGCACGT CAGGGGCTCC AACTTCTGCG ACATCGGCAT CACGGTCGAC
CAGAGGACCG GGCGGGTCAT CGTCATCTCC GCCATAGATA ACCTGGTGAA GGGGGCTTCC
GGGCAGGCGG TGCAGAACAT GAACCTGATG TGCGGCCTCC CCGAGACCCT CGGACTGGAT
CTCCTGCCGG TCTTTCCTTA A
 
Protein sequence
MLKVAIVGAS GYTGVELLRI LHSHPEVAVT CVTSEQSAGR PVSSVFPSLR GRCDIVLENL 
EPVGIAEKVD IVFTALPHKA AMEVVPTFMK MGKDVIDLSA DYRIHDADTY GKWYEPHLNP
ELLPEAVYGI PELRRAEIAE ASLIANPGCY PTSVILGLAP LLKGKVIDPR SIIVDAASGT
SGAGRGAKVD NLYCEVNEGF RAYGVGGVHR HIPEIEQELS LLAGSPLNIT FTPHLVPMDR
GILSTIYSQT AGSVKAADLI ALYEAFYDGE PFVRVLPEGV LPSTAHVRGS NFCDIGITVD
QRTGRVIVIS AIDNLVKGAS GQAVQNMNLM CGLPETLGLD LLPVFP