Gene RPB_1257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1257 
Symbol 
ID3909191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1438572 
End bp1439801 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content64% 
IMG OID637883151 
Productformamidase 
Protein accessionYP_484878 
Protein GI86748382 
COG category[C] Energy production and conversion 
COG ID[COG2421] Predicted acetamidase/formamidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.66975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.173821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAGA CACTGATCAA GGTCGATCTC ACGCAGTCCG CCTACGACAA CGAGATGGTC 
CACAACCGCT GGCACCCGGA CATTCCGATG GCGGCGTGGG TCAATCCCGG CGACGACTTC
ATCGTCGAGA CCTACGACTG GACCGGCGGC TTCATCAAGA ACAACGACAG CGCCGACGAC
GTCCGCGATA TCGACCTGTC GATCGTGCAC TTCCTGTCGG GGCCGATCGG CGTCAAGGGC
GCCGAGCCGG GCGACCTGCT GGTGGTCGAC CTGCTCGACG TCGGCCCGAT GAAGGAGAGC
CTGTGGGGCT TCAACGGCTT CTTCTCCAAG CAGAACGGCG GCGGCTTCCT GACCGATCAC
TTCCCGCTGG CGCAGAAGTC GATCTGGGAC TTCAAGGGCA TGTACACCTC GTCGCGCCAC
ATCCCGGGCG TGAACTTCGC CGGCCTGATC CATCCCGGCC TGATCGGCTG TCTGCCGGAT
CCGAAGCTGC TCGCAACCTG GAACGAGCGC GAGACCGGCC TGATCGCCAC CAACCCGACC
CGCGTGCCCG GCCTCGCCAA TCCGCCGTTC GGCCCGACCG CGCATATGGG CAAGCTCACC
GGCGACGCCA AGGCGAAAGC CGGAGCGGAA GGCGCCCGCA CCGTGCCGCC GCGCGAGCAC
GGCGGCAATT GCGACATCAA GGACCTGTCG CGCGGCTCCA AGATCTTCTT CCCGGTCTAT
GTGCCGGGCG GCGGCCTGTC GATGGGCGAC CTGCATTTCA GCCAGGGCGA CGGCGAGATC
ACCTTCTGCG GCGCCATCGA GATGGCCGGC TGGCTGCACA TCAAGGTCGA CATCATCAAG
GACGGCGTCT CGAAATACGG CATCAAGAAT CCGATCTTCA AGCCGTCGCC GGTGACGCCG
AACTACAAGG ACTATCTGAT CTTCGAAGGC ATCTCGGTCG ACGAGCAGGG CCAGCAGCAT
TATCTCGACG TCACCGTCGC GTATCGCCAG GCCTGCCTGA ACGCCATCGA GTATCTGAAG
AAGTTCGGCT ACTCCGGCGC CCAGGCCTAT TCGATCCTCG GCACCGCCCC GGTGCAGGGC
CACATCTCCG GCGTCGTCGA CGTCCCCAAC GCCTGCGCCA CGCTGTGGCT GCCGACCGAG
ATCTTCGATT TCGACATGAT GCCGTCCTCG GCCGGCCCGG TCAAACACAT CAAGGGCGAC
ATCCAGATGC CGATCTCGCA GGACAAGTAA
 
Protein sequence
MPETLIKVDL TQSAYDNEMV HNRWHPDIPM AAWVNPGDDF IVETYDWTGG FIKNNDSADD 
VRDIDLSIVH FLSGPIGVKG AEPGDLLVVD LLDVGPMKES LWGFNGFFSK QNGGGFLTDH
FPLAQKSIWD FKGMYTSSRH IPGVNFAGLI HPGLIGCLPD PKLLATWNER ETGLIATNPT
RVPGLANPPF GPTAHMGKLT GDAKAKAGAE GARTVPPREH GGNCDIKDLS RGSKIFFPVY
VPGGGLSMGD LHFSQGDGEI TFCGAIEMAG WLHIKVDIIK DGVSKYGIKN PIFKPSPVTP
NYKDYLIFEG ISVDEQGQQH YLDVTVAYRQ ACLNAIEYLK KFGYSGAQAY SILGTAPVQG
HISGVVDVPN ACATLWLPTE IFDFDMMPSS AGPVKHIKGD IQMPISQDK