Gene RPB_1734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1734 
Symbol 
ID3908259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1980211 
End bp1981380 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content68% 
IMG OID637883628 
Productpeptidase M20D, amidohydrolase 
Protein accessionYP_485353 
Protein GI86748857 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.169236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0172136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTGA TCAACCGCGT CGCCGACCTG CAACCCGATA TCATGGCCTG GCGTCACGAC 
CTCCATCAGC ACCCCGAACT GATGTACGAC GTCGACCGCA CCGCCGATTT CGTCGCCCAG
CGCCTGCGCG AATTCGGCTG CGACGAGGTG GTGACGGGAC TCGGCCGCAC CGGCGTGGTC
GGTGTGATCC GCGGCCGCAA GCCGGCGAGC GGCGACCTCA AGGTGATCGG GCTGCGCGCC
GACATGGACG CGCTGCCGAT CGAGGAGGCG ACCGGTCTAC CCTATGCCTC CAAGGTGCCC
GGCAAGATGC ACGCCTGCGG CCATGACGGC CACACCGCGA TGCTGCTCGG CGCCGCGCGC
TATCTCGCCG AGACCCGCAA TTTCGCAGGC AGTGTAGTGG TGATCTTCCA GCCGGCCGAG
GAGGGCGGCG CGGGGGCCGC GGCGATGATC AAGGACGGGC TGATGGACCG CTTCGGCATC
GAGCAGGTCT ACGGCATGCA CAACGGCCCC GGCATCCCGG TCGGCTCCTT CGCCATCAGC
CCGGGCGCGA TCATGGCCTC GACCGATTCG GTCGACATCC GCATCGAGGG CGTCGGCGGC
CACGCCGCGC GGCCGCATAT GTGCGTCGAC TCGGTGCTGG TGGGCGCCCA GCTCGTCACC
GCGCTGCAGT CGATCGTGTC GCGCACGGTC GATCCGCTGG AATCGGCGGT GATCTCGATC
TGCGAATTCC ACGCCGGCAA CGCCCGCAAC GTCATCCCGC AGATCGCCGA ACTGAAAGGC
ACGGTCCGCA CCCTGAAGGC CGAAGTTCGC GACCTGGTCG AGAAGCGCAT CCACGAGGTC
GCGGCCGGCG TTGCGCAGTC GACCGGCGCC AGGATCGACA TCGTCTACGA GCGCGGCTAC
CCGGTGGTGG TCAACCATGC CGAGCAGACC GAGGTGGCGC AGCGGATCGC CCGCGACATC
GCCGGCGAGT CCAACGTGAC GTCGATGCCG CCGCTGATGG GCGCCGAGGA TTTCGCCTAT
ATGCTGGAAG CGCGGCCGGG CGCGTTCATC TTCCTCGGCA ATGGCGACAG CGCCGGGCTG
CATCACCCGG CCTACAACTT CAACGACGAC GCCATCGTCT ACGGCACCTC GTACTGGATC
AAACTGGTCG AGAACCAACT CGCGGCGTGA
 
Protein sequence
MPLINRVADL QPDIMAWRHD LHQHPELMYD VDRTADFVAQ RLREFGCDEV VTGLGRTGVV 
GVIRGRKPAS GDLKVIGLRA DMDALPIEEA TGLPYASKVP GKMHACGHDG HTAMLLGAAR
YLAETRNFAG SVVVIFQPAE EGGAGAAAMI KDGLMDRFGI EQVYGMHNGP GIPVGSFAIS
PGAIMASTDS VDIRIEGVGG HAARPHMCVD SVLVGAQLVT ALQSIVSRTV DPLESAVISI
CEFHAGNARN VIPQIAELKG TVRTLKAEVR DLVEKRIHEV AAGVAQSTGA RIDIVYERGY
PVVVNHAEQT EVAQRIARDI AGESNVTSMP PLMGAEDFAY MLEARPGAFI FLGNGDSAGL
HHPAYNFNDD AIVYGTSYWI KLVENQLAA