Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5220 |
Symbol | |
ID | 6978314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 851825 |
End bp | 853006 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643394334 |
Product | amidohydrolase |
Protein accession | YP_002279152 |
Protein GI | 209547234 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.158205 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCTGA CGAACGACTA TGCCAACCTT TCGGATTTCG AGCCTGCCAA AGCCGAGCTG ACGGCGATCC GCCGTCACCT GCACGCCAAT CCTGAGCTTT CCTTCGAAGA AGCCGAAACC GCGCGTTTCG TCGCCGAAAA GCTGGAGGCC TGGGGTTACC ATGTGACACG CAATGTCGGC GGCCATGGCG TCGTTGCCCG TATGATTGTA GGGAGCGGCA CGAAGAGCAT CGCAATTCGC GCCGACATGG ACGCCCTGCC GATAACGGAG CAGACCGGTC TCGACCATGC CAGCAAGGTG GCAGGCAAGA TGCATGCCTG CGGCCATGAC GGCCATACCG CTATGCTTCT GGGCGCCGCC GAATATCTGG CCCGGACCCG CCGCTTCAAC GGCACCGTGA CGCTGATCTT CCAGCCGGCC GAAGAGGCAG GTGCGGTCAG CGGCGCCCCG GCGATGATCG CCGATGGCCT CTTCGAGCGC TTTCCTTTCG ACGTGATCTA CGGCCTGCAC AATCATCCGG GCGCCCCCGA AGGAACTTTC CTGATGCGCA CCGGCCCGCT GATGGCGGCC GCCGATACGG CCGAGATCAC GATAACGGGC AAGGGCGGCC ATGCCTCACG CCCGCATCTG ACGATCGATC CTGTCGTCGT CGCCTGCCAC CTCGTCGTCA CACTGCAGAC CGTCGTGTCG CGCAGCGTCG ACCCGACGCA GACTGCCGTG GTGACCGTCG GTGCCATCCA CAGCGGCGAA GCATCGAATG TCATTCCCGA GAATGCCAAG CTCTTGATGA CCGTCCGCTC CTTCGATCCC AAAGTGCGCG AGCTTTTGGA AACGCGCATC CGCAAGCTGT CCGAATCCAT TGCCGAAGGT TTCGGCGCCA AGGCGGAGAT CGATTACGTC CATGGCCATC CCGTCGTCGT GAACTCTGAG ACCGAGACCG AATTTGCCTG GACGGTCGCC GAGGAACTGG TCGGCGCCGA CAGGGTGACA ACCTGCGGTC TCATCCCGGG CAGTGAAGAC TTCTCGCATT TCCTCGAGCA CAAGCCCGGC GCCTTCCTCC GTCTCGGTAA TGGTGTCAAC TCGGCTATCC TGCACAGCGC CAGATATGAC TTCGCGGATG AAAGCCTGAC GGCGGGTGCC GCCATGTGGG CGCGGTTGAC CGAACGCTAT CTCGATGAAT AG
|
Protein sequence | MPLTNDYANL SDFEPAKAEL TAIRRHLHAN PELSFEEAET ARFVAEKLEA WGYHVTRNVG GHGVVARMIV GSGTKSIAIR ADMDALPITE QTGLDHASKV AGKMHACGHD GHTAMLLGAA EYLARTRRFN GTVTLIFQPA EEAGAVSGAP AMIADGLFER FPFDVIYGLH NHPGAPEGTF LMRTGPLMAA ADTAEITITG KGGHASRPHL TIDPVVVACH LVVTLQTVVS RSVDPTQTAV VTVGAIHSGE ASNVIPENAK LLMTVRSFDP KVRELLETRI RKLSESIAEG FGAKAEIDYV HGHPVVVNSE TETEFAWTVA EELVGADRVT TCGLIPGSED FSHFLEHKPG AFLRLGNGVN SAILHSARYD FADESLTAGA AMWARLTERY LDE
|
| |