Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3859 |
Symbol | |
ID | 4024375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4297463 |
End bp | 4298692 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637964063 |
Product | formamidase |
Protein accession | YP_570981 |
Protein GI | 91978322 |
COG category | [C] Energy production and conversion |
COG ID | [COG2421] Predicted acetamidase/formamidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGAGA CATTGATCAA GGTCGACCTC ACGCAGTCCG CTTACGACAA CGAGATGGTC CACAACCGCT GGCATCCGGA CATTCCGATG GCGGCGTGGG TGAATCCCGG CGACGACTTC ATCGTCGAGA CTTATGACTG GACCGGCGGC TTCATCAAGA ACAATGATTC CGCGGACGAC GTCCGCGATA TCGATCTGTC GATCGTGCAC TTCCTGTCGG GTCCGATCGG CGTCAAGGGC GCGGAGCCCG GCGACCTCCT GGTCGTCGAC CTGCTCGATG TCGGCCCGAT GAAGGAGAGT CTCTGGGGCT TCAACGGCTT CTTTTCCAAG CAGAACGGCG GCGGTTTCCT CACCGATCAT TTCCCGCTGG CGCAGAAGTC GATCTGGGAC TTCAAGGGCA TGTACACCTC GTCCCGTCAC ATCCCGGGCG TGAACTTCGC GGGCCTGATC CATCCCGGTC TGATCGGATG CTTGCCCGAT CCGAAGCTGC TGTCGACCTG GAACGAGCGC GAGACCGGCC TGATCGCCAC CAACCCGACG CGCGTGCCCG GCCTGGCCAA TCCGCCGTTC GGCCCGACCG CCCACATGGG CAAGCTGACC GGCGATGCGA AAGCCAAGGC CGGCGCCGAG GGCGCCCGCA CCGTGCCGCC GCGCGAACAC GGCGGCAATT GCGACATCAA GGATCTGTCG CGCGGCTCGA AGATCTACTT CCCGGTCTAC GTGCCGGGCG GCGGTCTTTC CATGGGCGAT CTGCACTTCA GCCAGGGCGA CGGCGAGATC ACCTTCTGCG GCGCGATCGA GATGGCCGGC TGGCTGCACA TCAAGGTCGA CATCATCAAG GACGGCGTGT CGAAATACGG CATCAAGAAT CCGATCTTCA AGCCGTCGCC GGTGACGCCG AACTACAAGG ACTATCTGAT CTTCGAAGGC ATCTCGGTCG ACGAGCAAGG CAAGCAGCAT TATCTCGACG TCACCGTCGC CTATCGCCAG GCCTGCCTCA ACGCCATCGA ATATCTGAAG AAGTTCGGCT ATTCCGGCGC CCAGGCCTAC TCGATCCTCG GCACCGCGCC GGTGCAGGGC CATATCTCGG GCGTCGTCGA CGTTCCGAAC GCTTGCGCCA CGCTGTGGCT GCCGACCGAG ATCTTCGACT TCGACATGAT GCCGACCTCT GCCGGTCCCA TCAAACATAT CAAGGGCGGC ATCGACATGC CGATCTCGCA AGACAAGTAA
|
Protein sequence | MPETLIKVDL TQSAYDNEMV HNRWHPDIPM AAWVNPGDDF IVETYDWTGG FIKNNDSADD VRDIDLSIVH FLSGPIGVKG AEPGDLLVVD LLDVGPMKES LWGFNGFFSK QNGGGFLTDH FPLAQKSIWD FKGMYTSSRH IPGVNFAGLI HPGLIGCLPD PKLLSTWNER ETGLIATNPT RVPGLANPPF GPTAHMGKLT GDAKAKAGAE GARTVPPREH GGNCDIKDLS RGSKIYFPVY VPGGGLSMGD LHFSQGDGEI TFCGAIEMAG WLHIKVDIIK DGVSKYGIKN PIFKPSPVTP NYKDYLIFEG ISVDEQGKQH YLDVTVAYRQ ACLNAIEYLK KFGYSGAQAY SILGTAPVQG HISGVVDVPN ACATLWLPTE IFDFDMMPTS AGPIKHIKGG IDMPISQDK
|
| |