Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1234 |
Symbol | argC |
ID | 7083894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1367240 |
End bp | 1368277 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643698250 |
Product | N-acetyl-gamma-glutamyl-phosphate reductase |
Protein accession | YP_002354889 |
Protein GI | 217969655 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0002] Acetylglutamate semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000699076 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCAAGG TTGGAATCGT AGGCGGTACC GGATATACCG GCGTCGAGCT CTTGCGTCTG CTGGCGCGCC ACCCCGGGGT CGAGCTGACC GCGATCACCT CGCGCGGCGA GGCCGGCATG GCGGTCTCCG ACATGTTCCC CAGCCTGCGC CGCCGGGTCG ATCTCAAGTT CGTCACCCCG CAGGACGCCG CCCTGGAGAA GTGCGACGCG GTGTTCTTCG CCACGCCCAA CGGCATCGCG ATGAAGCAGG CGGCCGAGTT GGTCGCTGCC GGGGTGCGGG TGATCGACCT CGCCGCCGAC TTCCGCATCC GCGACGTCGC CGAGTGGCAG AAGTGGTACG GCATGGAGCA TGCGGCGCCG GAACTCGTCG CCGAGGCGGT GTATGGCCTG CCCGAGGTCA ATCGCGCGCA GATCCGCGGC GCGCGCGTGC TCGCCAACCC GGGCTGCTAC CCCACCGCGG TCCAGCTCGG CTTCCTGCCC TTGGTCGAGG CCGGGCTGAT CGACACCGAT CACCTGATCG CCGATGCCAA GTCGGGCGTC TCGGGCGCCG GCCGCAAGGC CGAGGTGCAC ACCCTGCTTC CGGAGGCCGC CGACTCCTTC AAGGCCTATG GCGTGCCGGG CCATCGCCAT CTGCCGGAGA TCCGTCAGGG GCTGGCTCTG GCCGCCGGGC ACGCGGTCGG GCTGACCTTC GTGCCCCACC TGACGCCGAT GATCCGCGGC ATCCACGCCA CGCTGTACGG TCGTCTGAAG CCGGGGGGCG ACGCTGCCGA CCTGCAGGGG CTGTACGAGA AGCGCTATGC TGGCGAGGCC TTCGTCGACG TCATGCCCGC GGGCAGCCAT CCCGAGACGC GCTCGGTGCG CGCCTCCAAC CTGTGCCGCA TCGCGGTCCA TCGCCCGCAG GGCGGCGACA CCGTGGTGGT GTTGTCGGTG ATCGACAATC TGGTCAAGGG CGCGGCAGGC CAGGCGGTGC AGAACCTCAA CATCATGTTC GGTTTCGACG AGGAAACCGG GCTCGACATC GTGCCGGTCA GCCCCTGA
|
Protein sequence | MVKVGIVGGT GYTGVELLRL LARHPGVELT AITSRGEAGM AVSDMFPSLR RRVDLKFVTP QDAALEKCDA VFFATPNGIA MKQAAELVAA GVRVIDLAAD FRIRDVAEWQ KWYGMEHAAP ELVAEAVYGL PEVNRAQIRG ARVLANPGCY PTAVQLGFLP LVEAGLIDTD HLIADAKSGV SGAGRKAEVH TLLPEAADSF KAYGVPGHRH LPEIRQGLAL AAGHAVGLTF VPHLTPMIRG IHATLYGRLK PGGDAADLQG LYEKRYAGEA FVDVMPAGSH PETRSVRASN LCRIAVHRPQ GGDTVVVLSV IDNLVKGAAG QAVQNLNIMF GFDEETGLDI VPVSP
|
| |