Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4139 |
Symbol | |
ID | 5736000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5287056 |
End bp | 5288243 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641281293 |
Product | arginine biosynthesis bifunctional protein ArgJ |
Protein accession | YP_001546899 |
Protein GI | 159900652 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) |
TIGRFAM ID | [TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATCT TTCGTTTTGC CGCCGGCTTC CGCAGTGCTG CGGGGCGATG TGGCTTGAAG GCCAGTGGTA ATCCTGATTT AAGCTTACTT GTTGCTGATA ATGTTTGCAC CGGGGCTGGG GTTTTTACTA CCAGCCTCGT CAAAGCCGCG CCAGTGCTCT ACGATCAAGC AGTTTTGGCC GAGCATGCCA GCGAAATTCG GGCAATTATT GCCAATGCTG GCTGTGCCAA CGCTTGTACC GGAGCGCAGG GCGATGCGGC GGCTCGTGAG ATGGCACGTT TAGCGGCTGA AGCAGTTGGT TGCGAGCCAC ACCAAGTTTT GGTGCTCTCA ACGGGCGTAA TCGGCCATCA ACTGAATGTT GAAAAAGTTG CCAAGGGCGT GGCGGCAATT GCGCCTGAAC TGGGCGTTGA GCATGCTCCA GCGCTGTCCG AGGCGATTAT GACCACCGAT ACCCGCCCCA AAACGTCGAG CGCCACGGCG GTGATCGATG GAGTTGAGGT AACGGTAGCT GGGGTGGCCA AAGGCGCAGG CATGATCCAT CCGATGATGG CAACCATGCT TTCAATTGTC ACCACCGATG CAGCAATCGA TGCCGATTTG GCCCAAAGTT TGTTGCGCGA AGTCACCGAT GCATCATTTA ACTGTGTAAC GGTGGATGGC GACCCGAGTA CCAACGATAC GCTATTGTTG TTGGCCTCAG GCGTGAGTGG TGTGACGATC AATGCCAGTA ATATTGCAGC CTTCCGCCAA GCGCTTGAAA TTGTCTGCAT TGATTTGGCC AAACAAATTG CTGCCGATGG CGAAGGCGCA ACCAAGCTGA TTACGATTAC GGTTGATCAT GCGCCGAGTG TGGCTGCCGC CCGCACCGTT GCCCGCAAAA TTGCCTGCTC ACCCTTGGTC AAAACCGCGA TTCACGGCGG CGATCCCAAT TGGGGGCGAA TTTTGGCAGC AGCCGGAGTC GCGGGTGTGC CATTCGATCC CAGCCACGTT GAATTGTGGT TGGGCGAGGT GCAATTAGTT GCTGGTGGCA CGCCCACCAA CTACAACGAA CGCGAAGCCG CCAGCCAAAT CGGCGGCCAA CAAGTGGCAA TTCGCCTAAA TCTTGGGGCT GGCGCGGCCA CTGGCTACGC TTGGACCTGC GATTTTAGCG CGGAATATGT GCGAATTAAC GCTGATTATC GGACGTAG
|
Protein sequence | MSIFRFAAGF RSAAGRCGLK ASGNPDLSLL VADNVCTGAG VFTTSLVKAA PVLYDQAVLA EHASEIRAII ANAGCANACT GAQGDAAARE MARLAAEAVG CEPHQVLVLS TGVIGHQLNV EKVAKGVAAI APELGVEHAP ALSEAIMTTD TRPKTSSATA VIDGVEVTVA GVAKGAGMIH PMMATMLSIV TTDAAIDADL AQSLLREVTD ASFNCVTVDG DPSTNDTLLL LASGVSGVTI NASNIAAFRQ ALEIVCIDLA KQIAADGEGA TKLITITVDH APSVAAARTV ARKIACSPLV KTAIHGGDPN WGRILAAAGV AGVPFDPSHV ELWLGEVQLV AGGTPTNYNE REAASQIGGQ QVAIRLNLGA GAATGYAWTC DFSAEYVRIN ADYRT
|
| |