Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1257 |
Symbol | |
ID | 3909191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1438572 |
End bp | 1439801 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637883151 |
Product | formamidase |
Protein accession | YP_484878 |
Protein GI | 86748382 |
COG category | [C] Energy production and conversion |
COG ID | [COG2421] Predicted acetamidase/formamidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.66975 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.173821 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGAGA CACTGATCAA GGTCGATCTC ACGCAGTCCG CCTACGACAA CGAGATGGTC CACAACCGCT GGCACCCGGA CATTCCGATG GCGGCGTGGG TCAATCCCGG CGACGACTTC ATCGTCGAGA CCTACGACTG GACCGGCGGC TTCATCAAGA ACAACGACAG CGCCGACGAC GTCCGCGATA TCGACCTGTC GATCGTGCAC TTCCTGTCGG GGCCGATCGG CGTCAAGGGC GCCGAGCCGG GCGACCTGCT GGTGGTCGAC CTGCTCGACG TCGGCCCGAT GAAGGAGAGC CTGTGGGGCT TCAACGGCTT CTTCTCCAAG CAGAACGGCG GCGGCTTCCT GACCGATCAC TTCCCGCTGG CGCAGAAGTC GATCTGGGAC TTCAAGGGCA TGTACACCTC GTCGCGCCAC ATCCCGGGCG TGAACTTCGC CGGCCTGATC CATCCCGGCC TGATCGGCTG TCTGCCGGAT CCGAAGCTGC TCGCAACCTG GAACGAGCGC GAGACCGGCC TGATCGCCAC CAACCCGACC CGCGTGCCCG GCCTCGCCAA TCCGCCGTTC GGCCCGACCG CGCATATGGG CAAGCTCACC GGCGACGCCA AGGCGAAAGC CGGAGCGGAA GGCGCCCGCA CCGTGCCGCC GCGCGAGCAC GGCGGCAATT GCGACATCAA GGACCTGTCG CGCGGCTCCA AGATCTTCTT CCCGGTCTAT GTGCCGGGCG GCGGCCTGTC GATGGGCGAC CTGCATTTCA GCCAGGGCGA CGGCGAGATC ACCTTCTGCG GCGCCATCGA GATGGCCGGC TGGCTGCACA TCAAGGTCGA CATCATCAAG GACGGCGTCT CGAAATACGG CATCAAGAAT CCGATCTTCA AGCCGTCGCC GGTGACGCCG AACTACAAGG ACTATCTGAT CTTCGAAGGC ATCTCGGTCG ACGAGCAGGG CCAGCAGCAT TATCTCGACG TCACCGTCGC GTATCGCCAG GCCTGCCTGA ACGCCATCGA GTATCTGAAG AAGTTCGGCT ACTCCGGCGC CCAGGCCTAT TCGATCCTCG GCACCGCCCC GGTGCAGGGC CACATCTCCG GCGTCGTCGA CGTCCCCAAC GCCTGCGCCA CGCTGTGGCT GCCGACCGAG ATCTTCGATT TCGACATGAT GCCGTCCTCG GCCGGCCCGG TCAAACACAT CAAGGGCGAC ATCCAGATGC CGATCTCGCA GGACAAGTAA
|
Protein sequence | MPETLIKVDL TQSAYDNEMV HNRWHPDIPM AAWVNPGDDF IVETYDWTGG FIKNNDSADD VRDIDLSIVH FLSGPIGVKG AEPGDLLVVD LLDVGPMKES LWGFNGFFSK QNGGGFLTDH FPLAQKSIWD FKGMYTSSRH IPGVNFAGLI HPGLIGCLPD PKLLATWNER ETGLIATNPT RVPGLANPPF GPTAHMGKLT GDAKAKAGAE GARTVPPREH GGNCDIKDLS RGSKIFFPVY VPGGGLSMGD LHFSQGDGEI TFCGAIEMAG WLHIKVDIIK DGVSKYGIKN PIFKPSPVTP NYKDYLIFEG ISVDEQGQQH YLDVTVAYRQ ACLNAIEYLK KFGYSGAQAY SILGTAPVQG HISGVVDVPN ACATLWLPTE IFDFDMMPSS AGPVKHIKGD IQMPISQDK
|
| |