Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0022 |
Symbol | |
ID | 8011270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 20099 |
End bp | 21478 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644822613 |
Product | Formamidase |
Protein accession | YP_002973873 |
Protein GI | 241202777 |
COG category | [C] Energy production and conversion |
COG ID | [COG2421] Predicted acetamidase/formamidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.846742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0101399 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGCC GTATCGACTA TCGAAAGACT TCAGCTGCGG TTTCCAGAAA TTTCAACGGG TTTCAGGAGC CCTTGCAAAA GATCAGCAAG TGGGCTGCGG TTCCTCGCAT CCCAGCACAC TGCGTAAGGT CAAAATTTGC ATGTGCTGCA CCTGCAGTTG CGATGTTCCT TGTTGGAAGC GGTGCCTTCG CCGATACCAG TGACGTCAAG GGGCCAAGGC CAGTGATCGT CGCGAAGTCG GGCGAACATT GCAAGGATGA TCCAAACTGC TTTAACCGAG TTCACTATGC GGTTAAGCCA GTTGCGCGTG TGAAGCCTGG CCAGAAATTC ATTCTCGAGA CCCGTGATGG GCTGGATTCC GACCTTGATT TTTCGTCGAC CGCGGAGGAC GTCGCGGCCG TCGACCTGAA CCGCTGCCAC CCCCTGACGG GCCCCGTCTA TATTGAAGGG GCCAAGAAGG GCGATTCTAT TGCCGTGACA GTGGTGGACA TCGAGCCTGA CGAATTCGGG ACGACCACGG TCGTTCCGGG ATTCGGTTTC CTGAGAGATC TGTTCACCGA TCCATACATT GTTCACTGGG ATCTGAACCG CCTGGAAGCA CGCTCCAAAG ACATGCCCGG CATCTCGGTG CCAAACAACT CCTTCATGGG TACCATTGGT GTGCTTCCAG ACAAGGAAGA ACTTCAAAAA TGGCTTAAGC GCGAGCAGGA GCTTGCCGAT GCAGGCGGTG CAGTGCTGAC GCCGCAGCCA GTTGAAGCGC TACCAGCAGA TCTCTGTGGC GTCGATGGCA CAGCAAAGTC CGAATGCCTA CGCACCGTTC CGCCGCGTGA GAACGGAGGG AACGTTGACG CCAGGGAAAC AATTGTCGGT ACGACGATAT TGTTGCCATG CTTCATCGAC GGTTGTGGTC TGTTTGCCGG TGACGTCCAC TTTGCCATGG GCGGCGGCGA GGTGGCCGGA ACCGCCATCG AGACGGGTGG TAGGGTGACA CTCGAGGCCC AGGTCAGACC TGGTGGTGCC AAGCTTCAGA CTACAATGCA TTTCGAAGGC GGTTCACAGC TCAAGCAGCT TGCGCCATCG AGTTTCTACG CGATCTCAGG TCTCCCGGTT AAGAGTGAAG GCGAACTGCC TGTTTTTGAG ACCTACCTTG GTGGTGAGAA GATTGCTCCG CTCGCAAACC TCTCGGAGGA TCTCACGCTA GCAGCGCGAA ACGCCACGCT CAACATGATC GATTTTCTGG TCAAGACCAA GGGACTCACC CGCGAGCAGG CTTACGTTCT GACCAGTGTC GCTGTCGACC TCAACATAGC GCAGGTCGTC GACTACCCCA ATGTGGGGGT TACGGCGATC TTGAATCGCG ACGTGTTCAA GGAGCAATGA
|
Protein sequence | MNSRIDYRKT SAAVSRNFNG FQEPLQKISK WAAVPRIPAH CVRSKFACAA PAVAMFLVGS GAFADTSDVK GPRPVIVAKS GEHCKDDPNC FNRVHYAVKP VARVKPGQKF ILETRDGLDS DLDFSSTAED VAAVDLNRCH PLTGPVYIEG AKKGDSIAVT VVDIEPDEFG TTTVVPGFGF LRDLFTDPYI VHWDLNRLEA RSKDMPGISV PNNSFMGTIG VLPDKEELQK WLKREQELAD AGGAVLTPQP VEALPADLCG VDGTAKSECL RTVPPRENGG NVDARETIVG TTILLPCFID GCGLFAGDVH FAMGGGEVAG TAIETGGRVT LEAQVRPGGA KLQTTMHFEG GSQLKQLAPS SFYAISGLPV KSEGELPVFE TYLGGEKIAP LANLSEDLTL AARNATLNMI DFLVKTKGLT REQAYVLTSV AVDLNIAQVV DYPNVGVTAI LNRDVFKEQ
|
| |