Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4779 |
Symbol | |
ID | 8007032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 150893 |
End bp | 152053 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644821709 |
Product | amidohydrolase |
Protein accession | YP_002972969 |
Protein GI | 241113134 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.488683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0993236 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATCG ACAAGGATGC GCTGCATGCG GAAATGACAG CGTGGAGACG CGATCTCCAC GCTCATCCCG AATTTGGCTT CGAGGAGCGG CGGACATCCG CCTTCGTTGC GGCCAAGCTG CGGGAATTCG GCTTCGACGA GGTCACCGAG GGCATCGGCG GCACCGGCGT CGTCGGAACG CTGAAACGCG GCAACGGCAA TCGCGCCATT GCCCTGCGTG CCGATATGGA TGCGCTCAGG ATCAACGAAC AGGCGGAGCT CTCGCACCGG TCCCAAAACC CGGGAATCAT GCATGCTTGC GGTCACGACG GCCACACCGC CATGCTGCTC GGCGCAGCAA AGGTCCTGGC CGGGGAAGGC GGTTTCGACG GCACGGTACG CTTCATCTTC CAGCCTGCAG AAGAATGGGG CAAAGGCGCG CTGGCAATGA TCGCCGATGG GCTCTTCGAA AGATTCCCCT TCGACGAGAT CTACGGCATC CACAACATGC CGGGGATCGA CATTGGCCGC TTCCATACGC GTCCTGAAGC GATCATGTCC GCCGAGGACA ATTTCGAGAT TACGCTGACC GGCGTCGGCG GCCACGCCGC CCGGCCTCAC TGGGGCAATG AAGTGCTCGT CGCGGCCTGC GCGCTCGTGA CCAATCTGCA GACCATCGTC TCGCGGCGAC TGGATCCGGC CGACATCGCC GTCGTCTCCG TCACTGAGCT GATCACCGAC GGCACGAGGA ATGCGCTTCC CGGCTTCGCC CGCATTCTGG GCGACGCCCG CAGCTTTCGC TCGGAGATCA GCGAGACGAT CGAGAAGCAG ATGCGCGTGA TCGCCGAGGG TACCGCCATG ACGCACAACA TCAAGGCTGA CGTCGTCTAC ACCAGGGAAT TCATCCCTCT CATGAACGAT CCGTCGTTGA CGGAGGAGGC CTTGAGCGTC GCACGCGATC TGTACGACGC TTCAAATGTC GCCATCGCGA GCAAGCCCAT GACCGGATCC GAAGACTTCG CGCAGTTCCT TACGCGGGTT CCGGGCTGTT TCGTGTTCCT TGGCAACGGC GAGCATTCGC CGCCACTTCA TAACCCGACC TATGACTTCA ACGATGCCGG CCTCCTGCAT GGGGCAAACT TCCACGCAGG GATTGTGCGT CGACGGCTTC AGACAAGCTG A
|
Protein sequence | MTIDKDALHA EMTAWRRDLH AHPEFGFEER RTSAFVAAKL REFGFDEVTE GIGGTGVVGT LKRGNGNRAI ALRADMDALR INEQAELSHR SQNPGIMHAC GHDGHTAMLL GAAKVLAGEG GFDGTVRFIF QPAEEWGKGA LAMIADGLFE RFPFDEIYGI HNMPGIDIGR FHTRPEAIMS AEDNFEITLT GVGGHAARPH WGNEVLVAAC ALVTNLQTIV SRRLDPADIA VVSVTELITD GTRNALPGFA RILGDARSFR SEISETIEKQ MRVIAEGTAM THNIKADVVY TREFIPLMND PSLTEEALSV ARDLYDASNV AIASKPMTGS EDFAQFLTRV PGCFVFLGNG EHSPPLHNPT YDFNDAGLLH GANFHAGIVR RRLQTS
|
| |