Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA2084 |
Symbol | argA |
ID | 3104535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 2243667 |
End bp | 2245013 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637171238 |
Product | N-acetylglutamate synthase |
Protein accession | YP_114514 |
Protein GI | 53803879 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0548] Acetylglutamate kinase [COG1246] N-acetylglutamate synthase and related acetyltransferases |
TIGRFAM ID | [TIGR00761] acetylglutamate kinase [TIGR01890] amino-acid N-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.212235 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCAG ACACAGCCCC CCAGACCTTC GTACGCTGGC TCCGTAGTGC CGCTCCCTAC ATCCATGCGC ACCGGAACCG CACCTTCGTC GTCAGTTTCG GCGGCGAAGC CCTGACCGAT GGGCGCTTCG CGGAGCTGGT CCACGACTTC GCCTTGCTCC ACAGCTTGGG GGTCGGGCTG GTGCTGGTGC ACGGTACCCG TCCTCAAGTC GAGCGGCGCC TGCGCAGCCG CGGTGCGGAA CTGCGCTACC ACCAGGGTCT GCGGATCACC GATGCGACCG GGCTGGAATG CGTGCAGGAG GCCGCCGGCG CCGTCCGTGT CGAGATCGAG GCCCTGCTGT CGATGGGCGC CGCCAATTCA CCCATGGCCG GGGCACGGAT CCGGGTGGCC TCGGGCAATT TCGTGGTCGC CAAGCCTTTG GGCGTACGCG ACGGCGTGGA TTTCGGCTAC ACCGGCGAGG TGCGCCGGGT GGACGCCGGG GCGATGCGCC GGCTGCTGGA CGAAGGCAGC GTGGTTCTGG TGTCTCCGCT CGGCTATTCC CCCACCGGGG AAATCTTCAA CCTCAGCGCC GAGGAGGTGG CGACTGCCGT GGCCGGAGCC CTCGGGGCCG ACAAGCTTCT GTTGCTGATG GAACAGCCCT GCGCCGGTCC CGACGGCCGG ATGATCCACC AGATCACGGT GCAGGAAGCC GAGCGTCTGC TGGAACGCGG CGGTTCGCTG GAACCGGGCG TGGCCGCCCA CCTGGCCGCC GCCCTCAAGG CCTGCCGCTC CGGCGTGGCC CGCGCCCATC TGCTGGACCG CCACATCGAC GGCGCCCTGC TGCTGGAGCT GTTCACCCGC GACGGCGTCG GCACCCTCGT CAGCGCCAAT CCGTTCGAGG AATTGCGCCC GGCCCGCATC AACGACGTGG TCGGCATCCT GGATCTGATC CGGCCGCTCG AACAGACCGG CGCCTTGGTG AACCGCTCCC GTGAACGCCT GGAGACTGAA ATCGAGGACT ACGTCGTCAT CGAGCGCGAC GGACTGATCG TAGGCTGCGC CGCCCTGCAT CCATTTCCGG CCGAATCCGT GGGCGAGTTC GCCTGCCTCG CCCTGCACCC CGAATACCGG GGCGAAAGCC GGGGCGAAAG GCTGCTGGAA TTCATCGAGC AGCGGGCCCG CAAGCTCGGC CTGGCGCGGC TCTTCGCACT GACCACCCAG GCCATGCAGT GGTTCCGCGA ACGCGGTTTC GAGCCTGCCT CACTCACGGA CCTGCCTGCG CAGCGGCGTG CCAGCTACAG CCCCGAGCGC AACTCCCAGG TGCTGATCAA GACGATCAGG CCGTGTGAAG TTCCCCGAAA AAGCTGA
|
Protein sequence | MNPDTAPQTF VRWLRSAAPY IHAHRNRTFV VSFGGEALTD GRFAELVHDF ALLHSLGVGL VLVHGTRPQV ERRLRSRGAE LRYHQGLRIT DATGLECVQE AAGAVRVEIE ALLSMGAANS PMAGARIRVA SGNFVVAKPL GVRDGVDFGY TGEVRRVDAG AMRRLLDEGS VVLVSPLGYS PTGEIFNLSA EEVATAVAGA LGADKLLLLM EQPCAGPDGR MIHQITVQEA ERLLERGGSL EPGVAAHLAA ALKACRSGVA RAHLLDRHID GALLLELFTR DGVGTLVSAN PFEELRPARI NDVVGILDLI RPLEQTGALV NRSRERLETE IEDYVVIERD GLIVGCAALH PFPAESVGEF ACLALHPEYR GESRGERLLE FIEQRARKLG LARLFALTTQ AMQWFRERGF EPASLTDLPA QRRASYSPER NSQVLIKTIR PCEVPRKS
|
| |