Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4873 |
Symbol | |
ID | 6977967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 511458 |
End bp | 512627 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643394031 |
Product | amidohydrolase |
Protein accession | YP_002278849 |
Protein GI | 209546931 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.458115 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAGC TAGACTTAGG CGGGTTCCTG CCCGAGCTTC GCGACATCCG CCAGCACTTG CATAGCATTC CGGAAATAGG CCTGGAAGAG CATGAGACCT CTGACTTCAT TGCCGCAAAG CTTGAGGGAT GGGGGTTTCA GATCACTCGA TACCTTGGAA AGACTGGATT GGTCGCGTCG TTGCGCCGCG GCGCCGGAAA ACGCTCAATT GGACTGCGCG CGGACTTTGA CGCACTCCCA ATCATGGAAG AAACGAAACT GCCTTATGCC AGCGGGCATT CCGGCGTTAT GCACGCTTGC GGGCATGATG GCCATGCCGC AATGCTCCTG GGTGCGGCCT GGTTGCTGTC ACACACAAAC GATTTTTCCG GAACCGTGCA TTTCATATTC CAGCCGGCGG AAGAGAATTT TGGCGGCGCT CAACTGATGA TCGACGATGG CCTCCTCGAT TGGTTCCCTT GCGATGAAAT CTTTGCGCTG CATAACTGGC CGGGTCTGAC AGCGGGCGTT TTCTCTTCGA GGCCTGGCCC TATCGCAGCC TCCATCGACG CAGTCACCCT GACGATCAGG GGACTTGGAG GGCATGGCGC TGAACCGGAA AAAAGCATCG ATCCGGTTGT CGTTGGCTCA AGCATTGTAA TGGCTTTACA GACGCTGGCC TCTCGCACTG TCTCACCACA CTCCCCTTGC GTCGTCACTG TCGGTGCGTT TAATGCCGGC TCAGTCTGCA ATGTTATCCC AGATACGGCC AAGCTCGAAA TTTCGATCCG CGCAACTGAC CCGGCCGTAC GAGACGATAT TCGGGCGAAA ATCGAGACCA TTGCGAGACT GCAGGCCGAA AGCTTCCGCG CTACGGCCGA GTTCGAATGG ATAGTCGGCT ACCCTGCGAC AATCAACGAT GTGACGGCAT TCGAGCAGGT GCAACGCACC GTTACCAATC ATTTCGGGCC CGCGTTCTTC AAGCTGTGTG ACAAGCCATT TATGGGAAGC GAGGACTTTT CCTTCCTGCT TGAAAAAATT CCCGGCGCCT ACGTCCTGAT CGGAAATGGC GACAGCGCAA ATCTTCATAC GTCAAAGTAT GATTTCAATG ATGAAATCTT GGGCCCGGGC ATTGCATTCT TCGCGCATCT TGTGACGGAT GTTCTTAGGG ATGAAGCAGT GCAAGCATGA
|
Protein sequence | MPELDLGGFL PELRDIRQHL HSIPEIGLEE HETSDFIAAK LEGWGFQITR YLGKTGLVAS LRRGAGKRSI GLRADFDALP IMEETKLPYA SGHSGVMHAC GHDGHAAMLL GAAWLLSHTN DFSGTVHFIF QPAEENFGGA QLMIDDGLLD WFPCDEIFAL HNWPGLTAGV FSSRPGPIAA SIDAVTLTIR GLGGHGAEPE KSIDPVVVGS SIVMALQTLA SRTVSPHSPC VVTVGAFNAG SVCNVIPDTA KLEISIRATD PAVRDDIRAK IETIARLQAE SFRATAEFEW IVGYPATIND VTAFEQVQRT VTNHFGPAFF KLCDKPFMGS EDFSFLLEKI PGAYVLIGNG DSANLHTSKY DFNDEILGPG IAFFAHLVTD VLRDEAVQA
|
| |