Gene GM21_0566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0566 
SymbolhisZ 
ID8135881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp690803 
End bp692125 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content62% 
IMG OID644868183 
ProductATP phosphoribosyltransferase regulatory subunit 
Protein accessionYP_003020398 
Protein GI253699209 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3705] ATP phosphoribosyltransferase involved in histidine biosynthesis 
TIGRFAM ID[TIGR00443] ATP phosphoribosyltransferase, regulatory subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.1336e-23 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACTAACC CCTCATGCAT TGAAGCTCCG CTCCCCAAAG GGGTGAGCGA CTTCCTCCCG 
GAAACCGCCG ACAAGATCAC CTTCATCGCG GACAGCATCC ACCGAGTCTT CGAACTATGG
GGCTTCAGGC GGATGATCAC CCCGCTGCTC GAGTTCGAGC ATGTGCTCGC TCTGGGCATG
GGCGACGAAC TGCGGAGCAA GACCTTCCGC TTCGACGACC GCCAGACCGG ACGGCTCCTC
GCCATTCCGC CGGACATCAC GCCGCAGGTG GCACGCATCG TCGCGACGCG AATGCACGCC
CTCCCCCTCC CCCATCGCAT CTACTATTCC GGACGCGTGC TCAGACAGGC CCAGATGCAG
TCGGGCAAAA GCCGCGAGAT CTTTCAGTCC GGGGTAGAGC TGATCGGGCT GGACTCCCCC
GAGGCGGACG CCGAGATGGT GGCGATGGCG GTGGAGGTGC TGAAAAACCT CGGTTTCACC
GGCTTCAAGA TCGACCTGGG ACAGGTGGAG TTCTACCGCG GCATCATGGA CGCCTCCGGC
CTCTCCACCG AAGTGCGGAA ACAGTTGCAG GAAGCGATCA GCAAGAAGGA AGTCAGCGCG
GTGCGCTCCA TCCTTGAATC GGCCGGGGCT CCCGACCGGG TCAAGGAAGA GATCGCCCTG
CTGCCCAGAC TCTACGGCGG GCGCGAGGTG TTGCAGGAAG CGCGCAGTAT CGCCGGAAAC
GAGCGCTCGC TGCGGGCCCT GGACAACCTC GCCCAGGTGG TCGACATCCT GGACATATAC
GGGGTCGCCG AGCATCTAAC CATCGACCTG GGCGAGATCC GCGGGCTGGA CTACCACAGC
GGGATCACCT TCGAAGGTTT CGTCCCCGGC GTCGGGGAAG CGATCTGCAG CGGCGGCCGC
TACGACGACC TCACCGCCAA GTACGGCTAT CCGGCGCACG CCACAGGCTT CGCCTTCAAC
ATCCTGGCCC TGCTGTCCAG CATGTCCAAG AGGCCGGAGG TCGAGGCGTC AAGCGGCCGC
GACTTCCTGA TCTTCAACAA CAAGGACGAG CGCCGCGAGG CGCTGGAAGT GGCGCAGAAG
CTGAGAAGCC TCGGCTACAC CTGCGCCAGG GATATCATCA AGCGTGACTT CGACAGCTCG
CTTGAATACG CGAAGAAAAT GAACATCCGA TTGCTGCTTG TGATCGGCGC CGAGGGGTGC
GCGGCCGACC AGCTTTACCT GGTGCGGGTG GCGGACCGTC GAAGCATCAC CGTAAGCGAA
GAGGAGTTGT TCGACAAGGA TCTTGATTTG AAATTCGATC TGCAGGGGGA GAATCATGGC
TAA
 
Protein sequence
MTNPSCIEAP LPKGVSDFLP ETADKITFIA DSIHRVFELW GFRRMITPLL EFEHVLALGM 
GDELRSKTFR FDDRQTGRLL AIPPDITPQV ARIVATRMHA LPLPHRIYYS GRVLRQAQMQ
SGKSREIFQS GVELIGLDSP EADAEMVAMA VEVLKNLGFT GFKIDLGQVE FYRGIMDASG
LSTEVRKQLQ EAISKKEVSA VRSILESAGA PDRVKEEIAL LPRLYGGREV LQEARSIAGN
ERSLRALDNL AQVVDILDIY GVAEHLTIDL GEIRGLDYHS GITFEGFVPG VGEAICSGGR
YDDLTAKYGY PAHATGFAFN ILALLSSMSK RPEVEASSGR DFLIFNNKDE RREALEVAQK
LRSLGYTCAR DIIKRDFDSS LEYAKKMNIR LLLVIGAEGC AADQLYLVRV ADRRSITVSE
EELFDKDLDL KFDLQGENHG