Gene RPB_3086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3086 
Symbol 
ID3910887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3517479 
End bp3518648 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content64% 
IMG OID637884991 
Productpeptidase M20D, amidohydrolase 
Protein accessionYP_486696 
Protein GI86750200 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.985379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATTG AGAACTGGAC CGCGCGCTAC CTGGACGAAC TCAAGGAGTT TCGTCACGAT 
CTGCACCGCA ACCCCGAGTT GCTGTACGAC GTTCATCGCA CCGCCGAGCG GGTCGCGGCG
CGGTTGCGCG AAGCCGGCGT CGACGAGGTG CACGAAGGCA TCGGCGGAAC GGGCGTCGTC
GGAATCATTT ACGGCCAATC GCGTAGTTCC GGCCGGATGA TCGGGCTGCG CGCCGATATG
GATGCTTTGC CCATTCTCGA GGCAACTGGC GCGGAATGGG CCTCGCAAGT CCCCGGCAAG
ATGCACGCAT GCGGCCACGA CGGCCACACC ACCATGCTGT TGGGTGCGGC GCTTGGCCTC
GTCGAGAGCC GGGCCTTCGA CGGCGGCGTC GCGCTGATCT TCCAGCCCGC CGAAGAAGGC
GGTGCGGGCG CCAAGGCCAT GCTCGATGAT GGCCTGCTGC AGCGCTTTCC GATCCAGGAA
TTCTACGGGA TGCACAACCG CCCGGGACTT CCCTTGGGGA CGTTTGCGAC GGGCCCAGGG
CCGCAAATGG GTTCGGTCGA CGAGATCATC ATCTCGATCG AAGGTCGCGG CGGCCACGCC
GCTCAGCCGC ATGCCACGGT CGATCCGGTG GTCGTTGCCG CGGCGTTGAT TCAGGCGACT
CAGGCGATCG TCTCGCGCAA TCTCGATCCA CTGCAATCCG CGGTTATTTC GATCACACAG
ATGACCGCCG GTGACGCGTT CAACGTCATT CCTCAAACGG TGACATTGCG CGGCACGGTT
CGCACGCTGG ACGAGCCGAC CCGCGACATG GTTGAAAAGC GGCTGCGCGA ACTCACCGAG
AGTATTTCGG CGGGGTTCAG CGCTGTCGGT ACGCTGTCTT ATCTGCGGCA CTACCCTGTG
ATGAGAAATT CCGAAGTCGG CGTCGACCGC GCGGTCGCGG CGGCCGGAGA AGTGGCCGGC
GTCGCGCACG TCGACGCCAC AATGGCTCCG ACGCTGGGCG GCGAGGACTT CGCGTTCATG
CTGAACGAGC GGCCTGGCGC GATGATCATG ATCGGTAACG GCGATAGCGC CCCGCTGCAT
CATCCGCGCT TCGATTTCAA CGACGACGTC ATCCCGTGGG GCTGTTCGTA TTGGACCGCC
TTGGTCCGCC AGCGCATGCC GTTGGTCTGA
 
Protein sequence
MPIENWTARY LDELKEFRHD LHRNPELLYD VHRTAERVAA RLREAGVDEV HEGIGGTGVV 
GIIYGQSRSS GRMIGLRADM DALPILEATG AEWASQVPGK MHACGHDGHT TMLLGAALGL
VESRAFDGGV ALIFQPAEEG GAGAKAMLDD GLLQRFPIQE FYGMHNRPGL PLGTFATGPG
PQMGSVDEII ISIEGRGGHA AQPHATVDPV VVAAALIQAT QAIVSRNLDP LQSAVISITQ
MTAGDAFNVI PQTVTLRGTV RTLDEPTRDM VEKRLRELTE SISAGFSAVG TLSYLRHYPV
MRNSEVGVDR AVAAAGEVAG VAHVDATMAP TLGGEDFAFM LNERPGAMIM IGNGDSAPLH
HPRFDFNDDV IPWGCSYWTA LVRQRMPLV