Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2026 |
Symbol | |
ID | 4710379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2228816 |
End bp | 2230018 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639856499 |
Product | arginine biosynthesis bifunctional protein ArgJ |
Protein accession | YP_001003592 |
Protein GI | 121998805 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) |
TIGRFAM ID | [TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0302452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGGCG AGGAAGCCGT GATCCATCCG GTGCCGGGGT TCCGGCTCGG TACCGTGAGT GCCGGCATCC GCAAGCCGGG GCGGCCTGAT CTGGTGGTCA TGGCCTTGCC CCCGGAAGGG CGCGCGGCGG GGGTGTTCAC CCGCAATCGC TTCCGCGCCG CTCCGGTCCG CATCGCCGAG CGCCACCTGG CTGCGACCGC GCCGCGTTAC CTGCTGGTCA ATACCGGCTT CGCCAACGCC GGGACCGGCG AGCGCGGCGA GGCCGACGCC CTGGCCTGCT GCCAGGCCCT GGCTGATCAG GCCGGGTGCG TGCTCGAGGC GGTGGTGCCC TTCTCCACCG GCGTGATCGG TGAGCCCCTC CCCGTCGAGC GCGTCACCGC CGGGATCCCC GGGGCCCTGG ACGCGCTGGA CGAGAACGGC TGGCAGGCCG CCGCGGCGGG GATCCAGACC ACCGACACCC GCGACAAGAT CGCTAGCTAC CAGGTCGACC TCGCCGGCGG GACCTGCACG GTGACCGGGA TCGCCAAGGG GGCGGGGATG ATCCGCCCGG ACATGGCCAC CATGCTGGCC TTCGTGGCCA CCGACGCCGA TCTCTCCGAC GCCGCCCTGG ATGCCTCTCT GCGCCGTGCC ACCGAGCGCT CCTTCAACCG CGTAACGGTG GACGGCGACA CCTCCACCAA CGACGCCTGT CTGCTGGCCG CCACCGGTCG CGGCCCGCGG GTGCCCGATC ACGGCGGCGA CTTCGAGCGC TTCCAGGCGG CCGTCGAGGC GGTCTGCATT GACCTGGCGC GGGCCATCGC CGCCGACGGG GAGGGGGCGA CGCGGCTGGT GGATGTGGTC GTCGAGGGGG CGCGGGACAC CGCCGAGGCG GAGCGGGTGG CGTTCACCGT GGCGGAGTCG CCGCTGGTCA AGACGGCGCT GGCCGCCGCC GACCCCAACT GGGGGCGGAT CCTGGCCGCC GTGGGGCGCG CCGGCCTGGA CGACCTCGAC GTTGATGCCG TGGCGCTCAC CATCGGCGAT CAGGTGGTTG CCGAGCACGG CGGCGCCGCG GCGGGCTACG ACGAAGCGGC CGCCGCGGCG CACCTCTCCG GCTCCGAGGT GAGGCTCGGC ATCGAGCTCG GCCGCGGGCC GGCAGCGGCC ACCGTGTGGA CCTGCGACTT CACGGCGGAG TACGTCCGCA TCAACGCCGA GTATCGAACC TGA
|
Protein sequence | MSGEEAVIHP VPGFRLGTVS AGIRKPGRPD LVVMALPPEG RAAGVFTRNR FRAAPVRIAE RHLAATAPRY LLVNTGFANA GTGERGEADA LACCQALADQ AGCVLEAVVP FSTGVIGEPL PVERVTAGIP GALDALDENG WQAAAAGIQT TDTRDKIASY QVDLAGGTCT VTGIAKGAGM IRPDMATMLA FVATDADLSD AALDASLRRA TERSFNRVTV DGDTSTNDAC LLAATGRGPR VPDHGGDFER FQAAVEAVCI DLARAIAADG EGATRLVDVV VEGARDTAEA ERVAFTVAES PLVKTALAAA DPNWGRILAA VGRAGLDDLD VDAVALTIGD QVVAEHGGAA AGYDEAAAAA HLSGSEVRLG IELGRGPAAA TVWTCDFTAE YVRINAEYRT
|
| |