Gene RPC_1668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1668 
Symbol 
ID3972591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1800691 
End bp1801986 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content66% 
IMG OID637924783 
Productallantoate amidohydrolase 
Protein accessionYP_531548 
Protein GI90423178 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCGTC CGATCGATGA CGCGAATCGC AATCTGGACA TTCGCATGAG CAAGCTCGCC 
TCCAATCTGC AAATCGATTC CGCAAGGCTG TGGAGCACGA TCAACGACAC CGCGAAATTC
GGCGGCACGC CGAAGGGCGG GGTGCGGCGG CTGACGCTGA GCGCCGAAGA CAAGCAGGTC
CGCGACTGGT TTCGCCAAGC GCTCGAGGCC GCCGGCTGCG AGGTGCATGT CGATGCGCTC
GGCAACATGT TCGCGCTGCG CCGTGGCCGC GACATGAGCA AGCCGCCGAT CGGGCTCGGC
TCGCATCTCG ACACCCAGCC GACCGGCGGC AAGTTCGACG GCATTCTCGG TTCGCTCGCC
GCCCTCGAAG TGGTGCGCAC GCTGAACGAC GCCGGCATCG AGACCGAGCT GCCTTTGTGC
GTCGCCAACT GGACCAACGA GGAAGGCTCG CGCTACGCGC CGGCGATGAT GGGATCGGCG
GCCTATGTCG GCGACTTCAC CGTCGAGGAC ATTTTGGCGC GCAAGGACGG CGAGGGCATC
AGCGTCGCCG CGGCACTCGA CGGCATCGGC TATCGCGGCA GCGAGGCGGT CGGGACGCAG
AAATTCACCA GCTTCGTCGA GCTGCATATC GAACAAGGCC CGATCCTGGA AGCCGAAGGC
AAGACCATCG GCGTGGTGGA TTCCGGGCAG GGCGTGTTGT GGTACGATGG CCAGATCGTG
GGCTTCGAAA GCCACGCCGG CTCGACGCCG ATGCGGCTGC GCCGCGACGC GCTGGCGACG
CTTTCCGAGA TCGTGCTTGC GGTGGAGCGG ATCGCTACCG AACTCGGCCC CAACGCGGTC
GGCACCATCG GCGAAGCGGC GATCGCGCGG CCATCGCGCA ACGTCATTCC CGGCGAGATC
GCCTTCACCA TCGACATGCG CAGCGCCGAC GCGTCGATCA TGGATGCGCT CGACAAGAAT
TTGCGCGCTG CCGCGGCGGA GATCGCCGGC CGCCGCAAGG TCGAAATCCC GCTCGATCTG
GTGTGGCGGA TCGAGCCGAC GCATTTCGAC GCCAAGCTGG TCGACGCGGT GCAGCGAGCC
GCCGGCGAGC TCGGCTACAG CCATCGCCGC ATTACTTCCG GCGCCGGCCA CGACTCCTGC
AACCTCGCCA CCGCAATGCC GGCGGCGATG ATCTTCGTGC CGTGCAAGGA CGGCGTTAGC
CACAACGAAT TGGAAGACGC CACCGAGGCC GATTGCGGCG CCGGTGCCAA CGTGCTGCTG
CATACCGTGC TGGCGCTCGC CGGCGTGGCG AAGTAA
 
Protein sequence
MHRPIDDANR NLDIRMSKLA SNLQIDSARL WSTINDTAKF GGTPKGGVRR LTLSAEDKQV 
RDWFRQALEA AGCEVHVDAL GNMFALRRGR DMSKPPIGLG SHLDTQPTGG KFDGILGSLA
ALEVVRTLND AGIETELPLC VANWTNEEGS RYAPAMMGSA AYVGDFTVED ILARKDGEGI
SVAAALDGIG YRGSEAVGTQ KFTSFVELHI EQGPILEAEG KTIGVVDSGQ GVLWYDGQIV
GFESHAGSTP MRLRRDALAT LSEIVLAVER IATELGPNAV GTIGEAAIAR PSRNVIPGEI
AFTIDMRSAD ASIMDALDKN LRAAAAEIAG RRKVEIPLDL VWRIEPTHFD AKLVDAVQRA
AGELGYSHRR ITSGAGHDSC NLATAMPAAM IFVPCKDGVS HNELEDATEA DCGAGANVLL
HTVLALAGVA K