Gene RPD_3859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3859 
Symbol 
ID4024375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4297463 
End bp4298692 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content62% 
IMG OID637964063 
Productformamidase 
Protein accessionYP_570981 
Protein GI91978322 
COG category[C] Energy production and conversion 
COG ID[COG2421] Predicted acetamidase/formamidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAGA CATTGATCAA GGTCGACCTC ACGCAGTCCG CTTACGACAA CGAGATGGTC 
CACAACCGCT GGCATCCGGA CATTCCGATG GCGGCGTGGG TGAATCCCGG CGACGACTTC
ATCGTCGAGA CTTATGACTG GACCGGCGGC TTCATCAAGA ACAATGATTC CGCGGACGAC
GTCCGCGATA TCGATCTGTC GATCGTGCAC TTCCTGTCGG GTCCGATCGG CGTCAAGGGC
GCGGAGCCCG GCGACCTCCT GGTCGTCGAC CTGCTCGATG TCGGCCCGAT GAAGGAGAGT
CTCTGGGGCT TCAACGGCTT CTTTTCCAAG CAGAACGGCG GCGGTTTCCT CACCGATCAT
TTCCCGCTGG CGCAGAAGTC GATCTGGGAC TTCAAGGGCA TGTACACCTC GTCCCGTCAC
ATCCCGGGCG TGAACTTCGC GGGCCTGATC CATCCCGGTC TGATCGGATG CTTGCCCGAT
CCGAAGCTGC TGTCGACCTG GAACGAGCGC GAGACCGGCC TGATCGCCAC CAACCCGACG
CGCGTGCCCG GCCTGGCCAA TCCGCCGTTC GGCCCGACCG CCCACATGGG CAAGCTGACC
GGCGATGCGA AAGCCAAGGC CGGCGCCGAG GGCGCCCGCA CCGTGCCGCC GCGCGAACAC
GGCGGCAATT GCGACATCAA GGATCTGTCG CGCGGCTCGA AGATCTACTT CCCGGTCTAC
GTGCCGGGCG GCGGTCTTTC CATGGGCGAT CTGCACTTCA GCCAGGGCGA CGGCGAGATC
ACCTTCTGCG GCGCGATCGA GATGGCCGGC TGGCTGCACA TCAAGGTCGA CATCATCAAG
GACGGCGTGT CGAAATACGG CATCAAGAAT CCGATCTTCA AGCCGTCGCC GGTGACGCCG
AACTACAAGG ACTATCTGAT CTTCGAAGGC ATCTCGGTCG ACGAGCAAGG CAAGCAGCAT
TATCTCGACG TCACCGTCGC CTATCGCCAG GCCTGCCTCA ACGCCATCGA ATATCTGAAG
AAGTTCGGCT ATTCCGGCGC CCAGGCCTAC TCGATCCTCG GCACCGCGCC GGTGCAGGGC
CATATCTCGG GCGTCGTCGA CGTTCCGAAC GCTTGCGCCA CGCTGTGGCT GCCGACCGAG
ATCTTCGACT TCGACATGAT GCCGACCTCT GCCGGTCCCA TCAAACATAT CAAGGGCGGC
ATCGACATGC CGATCTCGCA AGACAAGTAA
 
Protein sequence
MPETLIKVDL TQSAYDNEMV HNRWHPDIPM AAWVNPGDDF IVETYDWTGG FIKNNDSADD 
VRDIDLSIVH FLSGPIGVKG AEPGDLLVVD LLDVGPMKES LWGFNGFFSK QNGGGFLTDH
FPLAQKSIWD FKGMYTSSRH IPGVNFAGLI HPGLIGCLPD PKLLSTWNER ETGLIATNPT
RVPGLANPPF GPTAHMGKLT GDAKAKAGAE GARTVPPREH GGNCDIKDLS RGSKIYFPVY
VPGGGLSMGD LHFSQGDGEI TFCGAIEMAG WLHIKVDIIK DGVSKYGIKN PIFKPSPVTP
NYKDYLIFEG ISVDEQGKQH YLDVTVAYRQ ACLNAIEYLK KFGYSGAQAY SILGTAPVQG
HISGVVDVPN ACATLWLPTE IFDFDMMPTS AGPIKHIKGG IDMPISQDK