Gene Smed_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0404 
SymbolhisZ 
ID5321238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp434278 
End bp435411 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content66% 
IMG OID640789339 
ProductATP phosphoribosyltransferase regulatory subunit 
Protein accessionYP_001326096 
Protein GI150395629 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3705] ATP phosphoribosyltransferase involved in histidine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.133547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0561317 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCTCA TTGATCTTCC CGGTTTCGCC GGCGACCTCC TTGCGGATTT CGAACGCCGG 
AACACGCTGC GCGTCGACAC GCCGGTCATC CAGCCTGCCG AGCCTTTCCT CGACATGGCC
GGGGAAGACC TGCGCCGGCG GATCTTCATG ACCGAGAGCG AAACCGGCGA GAGCCTGTGC
CTGCGGCCGG AGTTCACCAT CCCCGTCTGC CTGCGACATA TCGAGACCGC AACCGGAACG
CCGCAACGCT ACGCCTATCT GGGCGAGGTG TTCCGGCAGC GCCGCGACGG ATCGAGCGAG
TTCTACCAGG CGGGCATCGA AGATCTGGGC GATCCGGATA CGGCCGCCGC TGATGCGCGG
GTGGTCGGTG ATGCCTTGTT CGTCCTTTCC AATCGACTGC CGGGCGAGCG GCTGAAGGTC
ACGCTTGGCG ACCAGTCGGT CTTCGAAGCG GTGATTGCCG CCTGTGGCCT GCCCGGCGGT
TGGCAGAAAC GGCTCATTCA TGCCTTCGGG GATCAGAAGC AGTTGGACAG GCTCTTGGCC
GAGCTGGCCG ACCCGAAATC GCCCGGCGTC TTCGGCCACG ACGTCGAGCG CGTCGCAGCC
TTGGGCATGC TCGACGACGA AGAGCGGCTC GTCGCTCATA TCGGCGAGAC GATGGAGGCG
ACCGGTTATT CGACCAATGC CAGCCGCTCG CCCCGCGATA TCGCCCGGCG CCTGAAGGAA
AAGGTCGAAC TTGCAGCCAC CCGGCTGGAC AAGGAAGCGC TTGCCGTCAT GCGCGCGTTC
CTCGCTCTTG ATCTGCCGCT CGCCGACGCT CCGGCCGCGC TCCACAGCTT CGCCGGCAAG
GCGCGTCTGA GGATCGACGA CGCGCTGGAA CTCTTCGATG CGCGCGTGGC TGCGCTCGCA
TTGGCGGGCG CCGATCCCGG CCCGATGCGT TACCGCGCCG CCTTCGGACG ACCGCTCGAC
TATTACACGG GCCTCGTCTT CGAAATCCAC GTCGAAGGCA CCCCCGCAGT GCTCGCCGGC
GGCGGCCGGT TCGACCGCCT CCTCACCTTG CTCGGCGCTC GTGAGCATAT TCCGGCCGTC
GGCTTTTCTC TTTGGCTCGA CCGGATAGAA CAGGCTGCGG GGAGAGAGAA ATGA
 
Protein sequence
MPLIDLPGFA GDLLADFERR NTLRVDTPVI QPAEPFLDMA GEDLRRRIFM TESETGESLC 
LRPEFTIPVC LRHIETATGT PQRYAYLGEV FRQRRDGSSE FYQAGIEDLG DPDTAAADAR
VVGDALFVLS NRLPGERLKV TLGDQSVFEA VIAACGLPGG WQKRLIHAFG DQKQLDRLLA
ELADPKSPGV FGHDVERVAA LGMLDDEERL VAHIGETMEA TGYSTNASRS PRDIARRLKE
KVELAATRLD KEALAVMRAF LALDLPLADA PAALHSFAGK ARLRIDDALE LFDARVAALA
LAGADPGPMR YRAAFGRPLD YYTGLVFEIH VEGTPAVLAG GGRFDRLLTL LGAREHIPAV
GFSLWLDRIE QAAGREK