Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0559 |
Symbol | argC |
ID | 3784779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 645222 |
End bp | 646250 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637810641 |
Product | N-acetyl-gamma-glutamyl-phosphate reductase |
Protein accession | YP_411259 |
Protein GI | 82701693 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0002] Acetylglutamate semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0973109 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTAAGA TTGGCATCGT AGGTGGAACC GGATATACCG GAGTAGAGTT GCTGCGCATC TTGTCTCAAC ATCCGGAGGT GAATATCGAG GCGATTACAT CGCGCAAGGA AGCTGGCATG GATGTCGCCC AGTTGTTTCC AAGTCTGCGT GGACGGATAG AGTTAAAATT TTCCGATCCT GCAGAGGCAA ACCTCGAAAA ATGCGATGTG GTATTTTTTG CCACTCCGAA TGGCATTGCC ATGAAACAGG TTCCATCCCT GCTCGATGCA GGCGTGCGGA TAATTGATCT TGCCGCCGAT TTTCGCATCA AGGATATCGC CGTCTGGGAA AAATGGTATG GAATGCCCCA TGCGTGCCCC GAACTGGTTG CGGAGGCGGT CTACGGTCTG CCGGAAATCA ATCGCGACAG GATAAAGACA GCGCGGCTCA TAGCCAATCC CGGCTGCTAT CCCACGGCCG TGCAGCTGGG TTTCCTTCCC CTGGTGGAAT CGGGTGCCGC GGATCTGGAT CATCTCGTCG CAGACGCGAA ATCGGGCGTA TCAGGCGCGG GACGCAACGC GGAAATTCAT ACCCTTTTTG CTGAGGCAGC GGACAACTTC AAGGCCTATG GAGTATCCGG GCATCGCCAT CTCCCCGAGA TCAGGCAGGG CTTGTCGCAA GCGGCAAAAC ACCCCGTCGG TCTGACATTC GTACCACACC TCACCCCTAT GATTCGCGGC ATTCATGCCA CGCTTTATGT AAAACTCCTT AAAGAAGTCG ACTTGCAGGC GTTATATGAA AACCGCTACG TGAATGAACC GTTTGTCGAC GTGCTTCCGG CAGGCTCCCA TCCGGAAACC CGCTCCGTGC GGGGTTCGAA TCTCTGCCGC ATCGCCGTCC ATCGACCCCA GGGGGGGGAT ACGGCCGTCA TACTCTCGGT TACAGACAAC CTTGTCAAGG GTGCGGCCGG ACAGGCAGTG CAGAATATGA ATCTCATGTT CGGCTTGCCG GAAACCCTTG CCATCACCCA TCTTCCCTTG TTTCCGTAA
|
Protein sequence | MLKIGIVGGT GYTGVELLRI LSQHPEVNIE AITSRKEAGM DVAQLFPSLR GRIELKFSDP AEANLEKCDV VFFATPNGIA MKQVPSLLDA GVRIIDLAAD FRIKDIAVWE KWYGMPHACP ELVAEAVYGL PEINRDRIKT ARLIANPGCY PTAVQLGFLP LVESGAADLD HLVADAKSGV SGAGRNAEIH TLFAEAADNF KAYGVSGHRH LPEIRQGLSQ AAKHPVGLTF VPHLTPMIRG IHATLYVKLL KEVDLQALYE NRYVNEPFVD VLPAGSHPET RSVRGSNLCR IAVHRPQGGD TAVILSVTDN LVKGAAGQAV QNMNLMFGLP ETLAITHLPL FP
|
| |