Gene RPD_3802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3802 
Symbol 
ID4024318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4241433 
End bp4243232 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content69% 
IMG OID637964006 
Productallophanate hydrolase 
Protein accessionYP_570924 
Protein GI91978265 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID[TIGR02713] allophanate hydrolase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAA CCATCGCCGA GATCGTCGCC GCCCATCGCG CCGGCGCAAG CACGCCCGCG 
CAGACCGTGG CGCGTTGCTA TCAGCGGATC CGCGCACATA GCGATCCCGC GATCTTCATC
ACCCTGCGCG ATGAGGCCGA GGCGGTTGCC GAAGCCGTCG CGCTCGCCGC CAGGGATCCG
TCGCTGCCGC TCTATGGCGT GCCGGTTGCG GTGAAGGACA ATATCGACGT CGCCGGCCTG
CCGACCACGG CGGCCTGCCC GGCCTTCGCC TATCGGCCGT CGAAGGATTC CACGGCGGTG
GCGAGGCTGC GCGCCGCGGG CGCGATCGTC ATCGGCAAAA CCAATCTCGA TCAGTTCGCC
ACCGGCCTCG TCGGCGTGCG GTCGCCCTAC GGGATTCCAC GCAACGCAAT GCGTCCTGAC
CTCGTGCCCG GCGGTTCGAG TTCGGGATCG GCCGTCGCGG TCGCGGCCGG CCTCGTGCCG
CTGTCGCTCG GCACCGACAC CGCGGGCTCC GGCCGCGTGC CCGCGATGCT CAACAACATC
GTCGGGCTGA AGCCGAGCCT CGGCCTGATC TCGACCACAG GCCTCGTGCC GGCGTGCCGC
ACGCTGGATT GCATCTCGGT GTTCGCGCTG ACGGTCGACG ACGCGATGAC CGCGCTGCGC
GTAATGGGCG CCCCGGACGC TACCGATCCC TATTCGCGCG ATCGGGCTCT TGCGACGATG
ACGGCCACCC CGGTCAGGCT GCGGCTCGGT GTGCCGCGAC GCGACCAATG GCAATTCTTC
GGCGATCAGC AAGCCGAGCA AGCCTACGCC GACGCGCTGC GACGCTGGAC CGCCCTCGGC
GCGGAATTGA TCGACGTCGA TATTGAGCCG CTCTACGAGA CCGCGCGTTT GCTCTACGAA
GGACCGTGGG TCGCCGAGCG CTATCTCGCC ATTCGCGAGC TGATCGACAC GACGCCCGAC
GCGGTGCATC CGGTGACGCG GGCGATCACG CTGGGCGGCA AGGGGATCAC CGCGGCCGAC
ACCTTCGCCG CGCTCTATCG CCTGCAGGCG CTGCGCACGA TCGCGGAGCC GGCCTTCGCC
GCAATCGATG CGCTGGTTCT GCCGACCGCG CCGACCGCCT ACACGGTCGA TGAGGTGCTC
GCCGAGCCGA TCGCACTCAA CAGCCGGCTG GGCACCTACA CCAACTTCGT CAATCTGCTC
GACCTTTGCG GCCTCGCGCT GCCCGCCTCG ATTCGCGCCG ACGGGATTCC GTTCGGCATC
ACTTTGCTGG CGCCGGGCGG CCACGATGCG CAGCTCGCCA GGATCGGACG GCTCTTCCAC
GCCGATACCG CGCTGCCGAT GGGGGCCACG GGGCGGAGGC AACCCGATCT CACTCCGCTG
GACCCGCCAG CCGACAGGGA GGCGATTGCG ATCGCCGTGG TCGGCGCGCA TCTGTCCGGC
ATGGCGCTGA ACGGCGAACT CACCACCCTC GGCGGACGGC TCGCACGCGC GACGACGACC
GGGCCGGACT ACAAGCTCTA TGCGTTGGAG GGCACCACGC CACCGAAACC CGGCATGCTG
CGCGTCGCCC CCGGCACGGG CGCCGCGATC GCTGTCGAAG TGTGGTCGCT GTCGCCCGCC
GCATTCGGGC ATTTCGTCGG CGCGATCCCG CAGCCGCTGT CGATCGGTAC CGTCACGTTG
GCGGACGGCG CAAAGGTCAA GGGCTTTCTC GTCGAGCCAG CGGCGCTCGA CGGCGCCCGG
GAGATCACGC ATTTCGGCGG CTGGCGCGCT TACATGGCGG AGCTGGCCGC AACCGGGTGA
 
Protein sequence
MTETIAEIVA AHRAGASTPA QTVARCYQRI RAHSDPAIFI TLRDEAEAVA EAVALAARDP 
SLPLYGVPVA VKDNIDVAGL PTTAACPAFA YRPSKDSTAV ARLRAAGAIV IGKTNLDQFA
TGLVGVRSPY GIPRNAMRPD LVPGGSSSGS AVAVAAGLVP LSLGTDTAGS GRVPAMLNNI
VGLKPSLGLI STTGLVPACR TLDCISVFAL TVDDAMTALR VMGAPDATDP YSRDRALATM
TATPVRLRLG VPRRDQWQFF GDQQAEQAYA DALRRWTALG AELIDVDIEP LYETARLLYE
GPWVAERYLA IRELIDTTPD AVHPVTRAIT LGGKGITAAD TFAALYRLQA LRTIAEPAFA
AIDALVLPTA PTAYTVDEVL AEPIALNSRL GTYTNFVNLL DLCGLALPAS IRADGIPFGI
TLLAPGGHDA QLARIGRLFH ADTALPMGAT GRRQPDLTPL DPPADREAIA IAVVGAHLSG
MALNGELTTL GGRLARATTT GPDYKLYALE GTTPPKPGML RVAPGTGAAI AVEVWSLSPA
AFGHFVGAIP QPLSIGTVTL ADGAKVKGFL VEPAALDGAR EITHFGGWRA YMAELAATG