Gene Hhal_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2046 
Symbol 
ID4710120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2249335 
End bp2250669 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content75% 
IMG OID639856519 
ProductN-acetylglutamate synthase 
Protein accessionYP_001003612 
Protein GI121998825 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0548] Acetylglutamate kinase
[COG1246] N-acetylglutamate synthase and related acetyltransferases 
TIGRFAM ID[TIGR01890] amino-acid N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.398173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCTCG CAGACCTCGA CCCGCAGCAG TTCGTGGAGT GGTTCCGCCA CGCGGCACCG 
TACATCAACG CCCACCGCGG GCGGACCTTC GTGATCGCCT TCCCCGGTGC CGCCGTCGAG
CCGCCGCAGA TCGATGGCCT GGTCCACGAC CTGGCCGTCC TGGCCAGTCT GGGGGTGCGC
CTGGTGCTGG TCCCCGGCGC CCGCCCCCAG GTGGAGCAGC GCCTGCAGCG CCGCGGCCTG
GCAGCCCGCT ACGCCCAGGG CAGCACCGGG CTGCGCATCA CCGACGCCGA GGCCCTGGAG
TGCGTGCGCG ACGCCATCGG CGAGGTGCGC ACGCGGCTCG AGGCGCGCCT CTCCACCGGG
CTGGCCACCT CACCCATGGC CGGACTGCGC ATCCGGGTGG CCTCCGGCAA CGTCATCACC
GCCCGCCCGG TGGGCGTCCG CGACGGCATC GATCACCTCT ACACCGGCGA GGTCCGGCGG
GTGGACGCCG AGGCCCTGCG CCGGCGGCTC GACGACGGCG ACATCGCCCT GGTCTCGCCG
CTGGGCTACT CGCCCACCGG CGAGGCGTTC AACCTCTCCG CCGAGTCGGT GGCGCGGGCC
GTGGGCGAGG CCCTGGCCGC CGACAAGCTC ATCCACCTGA CCCGGCACGC ACCGCTGCAC
GAGGCCGACG GCACCCCGGT GCGCGAACTC ACCCCGCGCG AGGCGCGGGC CCGGCTGGAC
ACCGACGGCC TGGCCGGCGA CGCCCGGCGC CTGCTGCACA GCGCCCACCA GGCCTGCCTG
GGCGGCGTGG CCCGGGTGCA CCTGCTCGAC CGCAATCAGC ACGGCGCCCT GCTCCTGGAG
CTCTTCACCC GCGACGGCGT CGGCACCCTG ATCGCCCCGG AGCCGTTCGA GTCCCTGCGC
ACCGCTGGGG TGGACGACAT CCCGGGGATC CTGGGGCTGA TCCGCCCGCT GGAGGAGAGC
GGCGCGCTGG TCTACCGCCC CCAGGAGCTG CTTGAGGAGC AGATCGGCAC CTTCACCGTG
GCCGAACGCG ACGGCGCCGT GATCGCCACC GGGGCGCTGC TGCCCTGGCC GGCGGAGGAC
GCCGGGGAGA TCGCGTGCCT GGCGGTCGAT CCGGACTACC GCGGCGCCGG GCGCGCCGAT
ACGCTGGTCC GGCGCCTGGA GCAGCAGGCC CGCAACCACG GCCTGCGCCG GCTGTTCGTG
CTCACCACCC GGGCCGAGCA CTGGTTCCGC GAGCGCGGTT TCGAACCCGC CGGCCCCGAG
GCCCTGCCGG CGGCCCGCCG GGCCCTCTAC GACCAGACCC GGGGCTCGAA GGTCCTGGTC
CGGGACATCC CGTAA
 
Protein sequence
MRLADLDPQQ FVEWFRHAAP YINAHRGRTF VIAFPGAAVE PPQIDGLVHD LAVLASLGVR 
LVLVPGARPQ VEQRLQRRGL AARYAQGSTG LRITDAEALE CVRDAIGEVR TRLEARLSTG
LATSPMAGLR IRVASGNVIT ARPVGVRDGI DHLYTGEVRR VDAEALRRRL DDGDIALVSP
LGYSPTGEAF NLSAESVARA VGEALAADKL IHLTRHAPLH EADGTPVREL TPREARARLD
TDGLAGDARR LLHSAHQACL GGVARVHLLD RNQHGALLLE LFTRDGVGTL IAPEPFESLR
TAGVDDIPGI LGLIRPLEES GALVYRPQEL LEEQIGTFTV AERDGAVIAT GALLPWPAED
AGEIACLAVD PDYRGAGRAD TLVRRLEQQA RNHGLRRLFV LTTRAEHWFR ERGFEPAGPE
ALPAARRALY DQTRGSKVLV RDIP