Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4553 |
Symbol | |
ID | 6977647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 190585 |
End bp | 191580 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643393730 |
Product | urea amidolyase related protein |
Protein accession | YP_002278548 |
Protein GI | 209546630 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.16518 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAGA TTTGCGAAAG CGGCCCTTTC AACACGGTGC AGGATCTCGG ACGTCCCGGC TATCGCGACA TCGGCGTATC GGCGAGCGGC GCGATGGATC CGTTTGCGGT CAGGATCGGC AATATTCTCG TCGGCAATGA CGAGAATGCG GCGGTGATCG AGGTGCAGAC CTTCCCGTTC AGCCTGCGTT TCGAGCGGCG CACCGTCTTT GCCGTGACCG GCGCCGACGG CCATATCAAT CTCGACGGAT CGGAACTCAT TCCCTGGTGC GCCTATACGG CAGAGCCCGG ACAGGTCCTC CAGCTGCAGC AGCCCCCGCG GCTGGCGCGC TCCTATATTG CGGTCGGAGG TGGGCTGGAT ATTCCTGTCG TCATGGGTTC GCGAAGCACG TCGCTGCGCG GCGGCTTCGG CGGCAATGCC GGCCGGCCTC TGGCGAAGGG CGACCGGATC CCGGTCGGCG AGGATTTGGA GATCGCCATG TTGCCGGCCA CGGGCCTTGC CGTCGTCGAG CCCGCCGTGG CGCTGCGCGA GGTGTTCCCG GGACCCGTCG ACGGCGCCGT GCCGATCCGC GCCCTGCCTG CCGGCGAGCA TGATCTTTTC GCCGGAGACG GCGAAGCTTT CTGGAGCCAG ACCTGGAGGA TTTCCTCGCG CAGCGACCGC ACCGGCTACC GCCTGTCCGG CGAGCCGATC ACGCCGACCG CCTCCATCGA GATGCGCTCC CATGGCGTCG TGCCCGGCGT CATCCAGGTT CCGCCCGGCG GCGAACCGAT CGTGCAGATG AGCGATGCAA ACACCGCCGG CGGATATCCG AAGATCGCCG GCGTGATCGA GTGCGATCTC TGGCGGCTTG GGCAAGCCCG AATCGGCGCC CGCCTGAAAT TCGTCCGGTC GACGCATGCG GAAGCGCGCG CGGTCGAACA GGCCGTCGCC CGCTATGTCG AGGATGTCAG GCTGACATCC GGGATGGTCA AGCGCGCCCT GAAGGCAATG GCCTAA
|
Protein sequence | MIEICESGPF NTVQDLGRPG YRDIGVSASG AMDPFAVRIG NILVGNDENA AVIEVQTFPF SLRFERRTVF AVTGADGHIN LDGSELIPWC AYTAEPGQVL QLQQPPRLAR SYIAVGGGLD IPVVMGSRST SLRGGFGGNA GRPLAKGDRI PVGEDLEIAM LPATGLAVVE PAVALREVFP GPVDGAVPIR ALPAGEHDLF AGDGEAFWSQ TWRISSRSDR TGYRLSGEPI TPTASIEMRS HGVVPGVIQV PPGGEPIVQM SDANTAGGYP KIAGVIECDL WRLGQARIGA RLKFVRSTHA EARAVEQAVA RYVEDVRLTS GMVKRALKAM A
|
| |