Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2046 |
Symbol | |
ID | 4710120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2249335 |
End bp | 2250669 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639856519 |
Product | N-acetylglutamate synthase |
Protein accession | YP_001003612 |
Protein GI | 121998825 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0548] Acetylglutamate kinase [COG1246] N-acetylglutamate synthase and related acetyltransferases |
TIGRFAM ID | [TIGR01890] amino-acid N-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.398173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCTCG CAGACCTCGA CCCGCAGCAG TTCGTGGAGT GGTTCCGCCA CGCGGCACCG TACATCAACG CCCACCGCGG GCGGACCTTC GTGATCGCCT TCCCCGGTGC CGCCGTCGAG CCGCCGCAGA TCGATGGCCT GGTCCACGAC CTGGCCGTCC TGGCCAGTCT GGGGGTGCGC CTGGTGCTGG TCCCCGGCGC CCGCCCCCAG GTGGAGCAGC GCCTGCAGCG CCGCGGCCTG GCAGCCCGCT ACGCCCAGGG CAGCACCGGG CTGCGCATCA CCGACGCCGA GGCCCTGGAG TGCGTGCGCG ACGCCATCGG CGAGGTGCGC ACGCGGCTCG AGGCGCGCCT CTCCACCGGG CTGGCCACCT CACCCATGGC CGGACTGCGC ATCCGGGTGG CCTCCGGCAA CGTCATCACC GCCCGCCCGG TGGGCGTCCG CGACGGCATC GATCACCTCT ACACCGGCGA GGTCCGGCGG GTGGACGCCG AGGCCCTGCG CCGGCGGCTC GACGACGGCG ACATCGCCCT GGTCTCGCCG CTGGGCTACT CGCCCACCGG CGAGGCGTTC AACCTCTCCG CCGAGTCGGT GGCGCGGGCC GTGGGCGAGG CCCTGGCCGC CGACAAGCTC ATCCACCTGA CCCGGCACGC ACCGCTGCAC GAGGCCGACG GCACCCCGGT GCGCGAACTC ACCCCGCGCG AGGCGCGGGC CCGGCTGGAC ACCGACGGCC TGGCCGGCGA CGCCCGGCGC CTGCTGCACA GCGCCCACCA GGCCTGCCTG GGCGGCGTGG CCCGGGTGCA CCTGCTCGAC CGCAATCAGC ACGGCGCCCT GCTCCTGGAG CTCTTCACCC GCGACGGCGT CGGCACCCTG ATCGCCCCGG AGCCGTTCGA GTCCCTGCGC ACCGCTGGGG TGGACGACAT CCCGGGGATC CTGGGGCTGA TCCGCCCGCT GGAGGAGAGC GGCGCGCTGG TCTACCGCCC CCAGGAGCTG CTTGAGGAGC AGATCGGCAC CTTCACCGTG GCCGAACGCG ACGGCGCCGT GATCGCCACC GGGGCGCTGC TGCCCTGGCC GGCGGAGGAC GCCGGGGAGA TCGCGTGCCT GGCGGTCGAT CCGGACTACC GCGGCGCCGG GCGCGCCGAT ACGCTGGTCC GGCGCCTGGA GCAGCAGGCC CGCAACCACG GCCTGCGCCG GCTGTTCGTG CTCACCACCC GGGCCGAGCA CTGGTTCCGC GAGCGCGGTT TCGAACCCGC CGGCCCCGAG GCCCTGCCGG CGGCCCGCCG GGCCCTCTAC GACCAGACCC GGGGCTCGAA GGTCCTGGTC CGGGACATCC CGTAA
|
Protein sequence | MRLADLDPQQ FVEWFRHAAP YINAHRGRTF VIAFPGAAVE PPQIDGLVHD LAVLASLGVR LVLVPGARPQ VEQRLQRRGL AARYAQGSTG LRITDAEALE CVRDAIGEVR TRLEARLSTG LATSPMAGLR IRVASGNVIT ARPVGVRDGI DHLYTGEVRR VDAEALRRRL DDGDIALVSP LGYSPTGEAF NLSAESVARA VGEALAADKL IHLTRHAPLH EADGTPVREL TPREARARLD TDGLAGDARR LLHSAHQACL GGVARVHLLD RNQHGALLLE LFTRDGVGTL IAPEPFESLR TAGVDDIPGI LGLIRPLEES GALVYRPQEL LEEQIGTFTV AERDGAVIAT GALLPWPAED AGEIACLAVD PDYRGAGRAD TLVRRLEQQA RNHGLRRLFV LTTRAEHWFR ERGFEPAGPE ALPAARRALY DQTRGSKVLV RDIP
|
| |