Gene RPD_4117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4117 
SymbolhisZ 
ID4024639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4581817 
End bp4582974 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content68% 
IMG OID637964325 
ProductATP phosphoribosyltransferase regulatory subunit 
Protein accessionYP_571237 
Protein GI91978578 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3705] ATP phosphoribosyltransferase involved in histidine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGA CCGCCGCCGC TCGAGCGACC GGATCCGCCG AGTGGGCGGA GGCGCTGCTG 
CAATCGTTCA CCAAGGCCGG CTATGTCCGG GCCGAACCGG CGATCCTGCA GCCTGCGGAG
CCGTTCCTCG ACCTCTCCGG CGAGGATATC CGCAAGAACC TCTACCTCAC CACCGACGGC
AGCGGCGAAG AGTTGTGCCT GCGGCCCGAC CTGACGATTC CCGTGGCGCG GGACTATCTC
GCCTCGCCGG GCGCGGGTCA GCCGACCGGC TTCTGTTATC TCGGCCCGGT GTTCCGGCAG
CGCAGCGGCA AGCCGAGCGA GTTCCTCCAG GCCGGGATCG AATCATTCGG CCGCCAGGAC
CGCGCCGCGG CCGACGCCGA GATGCTGGCG CTGGGACTGG AAGCCACGAC GGCGTTCGGC
GTCGGCGAGG TCGACATCCG CACCGGCGAC GTCGCGCTGT TCTCCGCGCT GATCGATGCG
CTCGGCCTGT ATCCGGTGTG GCGGCGGCGG CTGATGAAGG ATTTCAACCG CAAGGTGAGC
CTCGCGCAGG ACCTCGAGCG GCTGACGCTC GCGACCTCCG GCGGCAACGA ATATGAGGGC
GTGCTCGCGG CGCTGGCCGG CTCCGACCGC AAGGCGGCGC TGGCGCTGGT CACCGACCTG
ATGTCGATCG CCGGCGCGAC CACGCTCGGC GGCCGCTCGG TCGCTGAGAT CGCCGACCGC
TTTCTCGAAC AATCGACGCT GAAGAGCGGC GCCTTGCCGC GCGACGCGCT GCAGAAAATT
CAACGCTTTC TCGCGATCAG CGGCGATCCG AACGAGGCGC TGACGCAGCT GCGCGCGCTT
GCCGCCGACG CCAAGCTCGC GATCGAGCCG GCGATCGATC AGTTCGAGAG CCGGATCGGT
TTCATGGCCG CGCGCGGCAT CGATCTGAAG AAAACGCGGT TCTCGACTTC GTTCGGCCGC
GGCGTCGATT ATTATACCGG GTTCGAATTC GAACTGCACC GCGCTGGCAA CGGCGACGAT
CCGCTGGTCG CCGGCGGGCG CTATGACGGG TTGATGACTC AGCTCGGCGC CGCCGCGCCG
ATCCCCGCGG TCGGCTTCTC GATCTGGATC GAGGCGATGA CGCAGTCCGG CCCCGCCAAA
ACTGGGAGCG CGTCATGA
 
Protein sequence
MTKTAAARAT GSAEWAEALL QSFTKAGYVR AEPAILQPAE PFLDLSGEDI RKNLYLTTDG 
SGEELCLRPD LTIPVARDYL ASPGAGQPTG FCYLGPVFRQ RSGKPSEFLQ AGIESFGRQD
RAAADAEMLA LGLEATTAFG VGEVDIRTGD VALFSALIDA LGLYPVWRRR LMKDFNRKVS
LAQDLERLTL ATSGGNEYEG VLAALAGSDR KAALALVTDL MSIAGATTLG GRSVAEIADR
FLEQSTLKSG ALPRDALQKI QRFLAISGDP NEALTQLRAL AADAKLAIEP AIDQFESRIG
FMAARGIDLK KTRFSTSFGR GVDYYTGFEF ELHRAGNGDD PLVAGGRYDG LMTQLGAAAP
IPAVGFSIWI EAMTQSGPAK TGSAS