Gene RPB_4057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4057 
Symbol 
ID3911864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4627321 
End bp4629120 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content71% 
IMG OID637885961 
Productallophanate hydrolase 
Protein accessionYP_487661 
Protein GI86751165 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID[TIGR02713] allophanate hydrolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGAAA CCATCGCCGA CATTCTCGCC GCGCATCGCG CCGGCACCAC GACGCCCGCG 
CAGACCATCG CGCGCTGCTA TCAGCGCATC CGCGCTCATG CCGATCCCGC GCTGTTCATC
ACGCTGCGCG ACGAGGCGGA CGCCGTGGCC GAAGCCGTGG CGCTCGCCGC CCGTGACCCG
TCGCTGCCGC TCTACGGCGT GCCGGTCGCC GTCAAGGACA ATATCGACGT CGCCGGCCTG
CCGACCACCG CGGCCTGTCC GGCTTTCGCG TATCAGCCGG CGCAGGATTC CACCGCGGTT
GCGAAGCTGC GCGCCGCAGG CGCGATCATC ATCGGCAAGA CCAATCTCGA TCAGTTCGCC
ACCGGCCTGG TCGGCGTGCG CTCGCCCTAC GGCATTCCGC GCAACGCGAT GCGCGCCGAT
CTGGTGCCGG GCGGCTCGAG TTCGGGTTCG GCGGTCGCGG TCGGCGCCGG CCTGGTGCCG
CTGTCCCTCG GCACCGACAC CGCCGGCTCC GGCCGCGTCC CGGCGATGCT CAACAACATC
GTCGGGCTGA AGCCGAGCCT CGGCCTGATC TCGACCACCG GCCTGGTGCC GGCGTGCCGC
ACGCTGGATT GCATCTCGGT GTTCGCGCTG ACCGTGGACG ACGCGATGAT CGCGCTGCGG
GTGATGGGCA CCCCCGACGC CACCGATCCG TATTCGCGCG CTCGGCCGAT CGCGCCGATG
TCGGCGATGC CCGACAGGCC ACGGCTCGGC GTACCGCGGC CCGATCAGTT GCAGTTTTTC
GGCGATCAGC AATCCGAACA GGCCTATGCC GACGCGCTGC AACGCTGGAC GTCGCTCGGC
GCCGAACTGA TCGAGATCGA TGTCGCGCCC TTATACGAGA CCGCGCGGCT GCTCTATGAC
GGCCCGTGGG TCGCCGAGCG CTATCTCGCG ATCCGCGAGC TGATCGACGC GACCCCCGAC
GCGCTGCATC CGGTGACGCG GCAGATCACG CTCGGCGGCA AGGCGATCAG CGCCGCCGAC
ACATTCGCAG CGCTGTATCG ACTGCAGGCG TTGCGCAAGC TCGCGGAGCC CGCCTTCGCG
GCGATCGACG CGCTGGTGCT GCCGACGGCG CCGACCGCCT ACACGGTCGA ACAGGTGCTG
GCCGATCCGG TCACGCTCAA CAGCCGGCTC GGCACCTACA CCAATTTCGT CAATCTGCTC
GACCTTTGCG GGCTCGCGAT CCCGGCGTCG ATCCGCGCCG ACGGCATCCC GTTCGGCGTC
ACGCTGCTGG CGCCGGGCGG CCGCGATGCC GAGCTCGCCA GCCTCGGCCG CATGTTCCAC
GCCGACACCG CGCTGCCGAT GGGCGCGACG GGCGTCGCCC TGCCGCCGCT GGCGGCGCTG
GATGCGGACG CCGGCACGGA CCACATTCAG ATCGCGGTGG TCGGCGCGCA TCTGTCCGGC
ATGGCGCTGA ACGGCGAGCT GACGTCGCTG GACGGCCGAC TGTTGCGCGC GACCGCGACG
GCGCCGGACT ACAAGCTCTA TGCGCTGAAC GGCACGGTGC CGCCGAAGCC CGGCATGCTG
CGCGTCGCAG CAGGCGCAGG CGCGGCGATC GCGCTGGAAA TCTGGTCGCT GTCGCCGGCC
GCGTTCGGCC GCTTCGTCGC TGCGATCCCG CAGCCCTTGT CGATCGGCAC CCTGAAGCTG
GCGGACGGCG CGCTGGTCAA AGGCTTCCTG GTCGAGCCCG CCGCGCTCGA GGTCGCCCGC
GACATCACCC ATTTCGGCGG CTGGCGGGCC TATATGGCGG AGCTGGCGAA AGCGGGGTGA
 
Protein sequence
MTETIADILA AHRAGTTTPA QTIARCYQRI RAHADPALFI TLRDEADAVA EAVALAARDP 
SLPLYGVPVA VKDNIDVAGL PTTAACPAFA YQPAQDSTAV AKLRAAGAII IGKTNLDQFA
TGLVGVRSPY GIPRNAMRAD LVPGGSSSGS AVAVGAGLVP LSLGTDTAGS GRVPAMLNNI
VGLKPSLGLI STTGLVPACR TLDCISVFAL TVDDAMIALR VMGTPDATDP YSRARPIAPM
SAMPDRPRLG VPRPDQLQFF GDQQSEQAYA DALQRWTSLG AELIEIDVAP LYETARLLYD
GPWVAERYLA IRELIDATPD ALHPVTRQIT LGGKAISAAD TFAALYRLQA LRKLAEPAFA
AIDALVLPTA PTAYTVEQVL ADPVTLNSRL GTYTNFVNLL DLCGLAIPAS IRADGIPFGV
TLLAPGGRDA ELASLGRMFH ADTALPMGAT GVALPPLAAL DADAGTDHIQ IAVVGAHLSG
MALNGELTSL DGRLLRATAT APDYKLYALN GTVPPKPGML RVAAGAGAAI ALEIWSLSPA
AFGRFVAAIP QPLSIGTLKL ADGALVKGFL VEPAALEVAR DITHFGGWRA YMAELAKAG