Gene Haur_4144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4144 
Symbol 
ID5736005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5291529 
End bp5292539 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content53% 
IMG OID641281298 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_001546904 
Protein GI159900657 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.606943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCGTG TTGGTATTTT TGGCGTAACT GGCTATGCTG GATACGAACT AGCTCGCTGG 
TTGCAGCGCC ACCCAAGCGC CAGCGTGGTT TGGGCTGTTT CCGAATCGTC GGCGGGCAAA
CGCTTGGCGC AGGTTGTGCC TGGCCCACTT GACATGCCGT TGTTGGCTGC GAATCAGGTT
GATTGGCAGG CGGTTGATCT GATTTTCACG GGGTTGCCGC ATGGTGTCGC TGCCCAAACG
GTGGCAGAGG CCCGCAAGCA TGGTGTCAAA GCCATCGACC TCTCCGCCGA CCTTCGCCTG
GACAGCCCCG CCGCTTACAC CCGCTGGTAC GACCACACGC ATCCCCACCC TGAACTTTTG
AATGCTCCCT ATGGCTTACC TGAACTGAAT CGCGCTGTAT TGGTTGATGT GCCAGCGATC
GCCAACCCAG GTTGCTATCC CACTAGTGTT TTGCTTGGTT TAGCACCTTT GCTGGAACAA
GGCTGGTGGC AAACTGGCCA AACCATCATC ATTAATGCTG CTTCGGGAGT TTCGGGCGCT
GGCCGTGCAC CCAAACAGCA CTTACATTTT GTCGAGGCTC ACGAAAATTA TAGCCCTTAC
AACATTGGCC ATACCCATCG CCATGTTGGC GAAATTGAGC AAGAACTGAG CAAGTTGGCA
AACGCACCAG TTAATACGAT TTTTGCACCA CACCTCTTGC CGACCCAACG CGGTATTTTA
AGCACAATCT ATGTGCCAAT TCAGCCAGAG CTGGATTTGG CCAGCATTCA TGCGCTTTAT
CGCCAACGTT ATGCCGCTGA ACCATTCGTC AATGTGCTCG ATCAAGGTCA GTTGGCAAGT
TTGGCGCATG TTGTGCATAC CAACGATTGT GCAATTGGCT TGACGCTCGC TCAGCCTGGC
ATGTTGATCG TCACAGCGGC GATTGATAAT TTGCTCAAGG GTGCTTCGGG TCAAGCAATT
CAAAATATGA ATATTATGTT TGGTTTGCCT GAAACCACGG GCTTGCGCTA G
 
Protein sequence
MIRVGIFGVT GYAGYELARW LQRHPSASVV WAVSESSAGK RLAQVVPGPL DMPLLAANQV 
DWQAVDLIFT GLPHGVAAQT VAEARKHGVK AIDLSADLRL DSPAAYTRWY DHTHPHPELL
NAPYGLPELN RAVLVDVPAI ANPGCYPTSV LLGLAPLLEQ GWWQTGQTII INAASGVSGA
GRAPKQHLHF VEAHENYSPY NIGHTHRHVG EIEQELSKLA NAPVNTIFAP HLLPTQRGIL
STIYVPIQPE LDLASIHALY RQRYAAEPFV NVLDQGQLAS LAHVVHTNDC AIGLTLAQPG
MLIVTAAIDN LLKGASGQAI QNMNIMFGLP ETTGLR