Gene Haur_4139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4139 
Symbol 
ID5736000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5287056 
End bp5288243 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content56% 
IMG OID641281293 
Productarginine biosynthesis bifunctional protein ArgJ 
Protein accessionYP_001546899 
Protein GI159900652 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATCT TTCGTTTTGC CGCCGGCTTC CGCAGTGCTG CGGGGCGATG TGGCTTGAAG 
GCCAGTGGTA ATCCTGATTT AAGCTTACTT GTTGCTGATA ATGTTTGCAC CGGGGCTGGG
GTTTTTACTA CCAGCCTCGT CAAAGCCGCG CCAGTGCTCT ACGATCAAGC AGTTTTGGCC
GAGCATGCCA GCGAAATTCG GGCAATTATT GCCAATGCTG GCTGTGCCAA CGCTTGTACC
GGAGCGCAGG GCGATGCGGC GGCTCGTGAG ATGGCACGTT TAGCGGCTGA AGCAGTTGGT
TGCGAGCCAC ACCAAGTTTT GGTGCTCTCA ACGGGCGTAA TCGGCCATCA ACTGAATGTT
GAAAAAGTTG CCAAGGGCGT GGCGGCAATT GCGCCTGAAC TGGGCGTTGA GCATGCTCCA
GCGCTGTCCG AGGCGATTAT GACCACCGAT ACCCGCCCCA AAACGTCGAG CGCCACGGCG
GTGATCGATG GAGTTGAGGT AACGGTAGCT GGGGTGGCCA AAGGCGCAGG CATGATCCAT
CCGATGATGG CAACCATGCT TTCAATTGTC ACCACCGATG CAGCAATCGA TGCCGATTTG
GCCCAAAGTT TGTTGCGCGA AGTCACCGAT GCATCATTTA ACTGTGTAAC GGTGGATGGC
GACCCGAGTA CCAACGATAC GCTATTGTTG TTGGCCTCAG GCGTGAGTGG TGTGACGATC
AATGCCAGTA ATATTGCAGC CTTCCGCCAA GCGCTTGAAA TTGTCTGCAT TGATTTGGCC
AAACAAATTG CTGCCGATGG CGAAGGCGCA ACCAAGCTGA TTACGATTAC GGTTGATCAT
GCGCCGAGTG TGGCTGCCGC CCGCACCGTT GCCCGCAAAA TTGCCTGCTC ACCCTTGGTC
AAAACCGCGA TTCACGGCGG CGATCCCAAT TGGGGGCGAA TTTTGGCAGC AGCCGGAGTC
GCGGGTGTGC CATTCGATCC CAGCCACGTT GAATTGTGGT TGGGCGAGGT GCAATTAGTT
GCTGGTGGCA CGCCCACCAA CTACAACGAA CGCGAAGCCG CCAGCCAAAT CGGCGGCCAA
CAAGTGGCAA TTCGCCTAAA TCTTGGGGCT GGCGCGGCCA CTGGCTACGC TTGGACCTGC
GATTTTAGCG CGGAATATGT GCGAATTAAC GCTGATTATC GGACGTAG
 
Protein sequence
MSIFRFAAGF RSAAGRCGLK ASGNPDLSLL VADNVCTGAG VFTTSLVKAA PVLYDQAVLA 
EHASEIRAII ANAGCANACT GAQGDAAARE MARLAAEAVG CEPHQVLVLS TGVIGHQLNV
EKVAKGVAAI APELGVEHAP ALSEAIMTTD TRPKTSSATA VIDGVEVTVA GVAKGAGMIH
PMMATMLSIV TTDAAIDADL AQSLLREVTD ASFNCVTVDG DPSTNDTLLL LASGVSGVTI
NASNIAAFRQ ALEIVCIDLA KQIAADGEGA TKLITITVDH APSVAAARTV ARKIACSPLV
KTAIHGGDPN WGRILAAAGV AGVPFDPSHV ELWLGEVQLV AGGTPTNYNE REAASQIGGQ
QVAIRLNLGA GAATGYAWTC DFSAEYVRIN ADYRT