Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1668 |
Symbol | |
ID | 6980405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1697903 |
End bp | 1699885 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643396393 |
Product | Amidohydrolase 3 |
Protein accession | YP_002281183 |
Protein GI | 209549266 |
COG category | [R] General function prediction only |
COG ID | [COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.6106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGC GTCGATCTTT TCTGGGAGCA GCATCGAGCC TGGCGTTCTC GAACCTCTTT TCCCCGGCGA AGGCCGCCGA TCCCAACCAG ACCGGAGTAG ACACCATGCA TCCCGACCTG ATCCTTCACA ACGGCCGCGT CACCACGCTT GACCGGTCCA ATCCGAATGC AACGGCTATC GCCGTTAAGG ACGGTCTCTT TGTCGAAGTC GGATCCGACA GCGAGATCAT GGCGCTGGCT GGGTCCAGCA CCAGGGTCGT CGACCTCAAG GGTAAGCGGG TCTTGCCCGG GCTCATCGAC AACCACACCC ACGTGGTCCG CGGCGGCCTG AACTACAACA TGGAGCTGCG TTGGGACGGC GTCCGGTCGC TCGCCGATGC GATGGACATG CTAAAGCGCC AGGTGGCGGT CACGCCCGCG CCGCAGTGGG TGCGCGTCGT CGGCGGGTTC AGCGAACATC AGTTTGCGGA AAAGCGCCTC CCGACGATTG AGGAAATCAA TGCTGTCGCG CCCGACACGC CGGTGTTCCT TCTGCACCTC TACGATCGCG CACTGCTCAA CGGGGCGGCT TTGCGTGCAG TCGGATACAC CCGTGACACG CCAAACCCGC CCGGCGGAGA GATCACCCGC GACGCCAATG GCAATCCCAC AGGCATGCTG CTGGCCAAGC CGAATGCAGG GGTTCTGTAC TCGACACTCG CCAAGGGACC GAAACTTCCT TTGGACTACC AGGTCAACTC GACCCGCCAC TTCATGCGCG AGCTCAACCG CCTGGGCGTG ACGGGCGTCA TCGATGCCGG CGGCGGCTTC CAGAACTATC CTGATGACTA CGAAGTCATC CAGAAGCTCT CCGATGCGAA CCAGATGACC GTTCGCCTTG CCTACAATCT CTTTACCCAG AAGCCCAAGG AAGAGAAACA GGACTTCCTG AAGTGGACGC AGTCGGTCAA ATACAAGCAG GGCAACGACT ACTTCCGCCA CAACGGTGCC GGCGAGATGC TTGTCTTCTC CGCCGCAGAT TTCGAGGATT TCCGCCAGCC TCGTCCGGAG ATGGCTCCGG AAATGGAAGG CGAGCTGGAG GAGGTCGTCC GTGTTCTGGC TGAAAACCGC TGGCCCTGGC GTCTGCACGC CACCTATGAC GAAACGATTT CCCGAGCCCT CGACGTGTTT GAGAAGGTCG ACAAGGATAT CCCGCTAGAA GGTCTGAACT GGTTCTTCGA TCACGCCGAA ACGATCTCCG AACGTTCGAT CGACCGGATC GCGGCGCTTG GCGGCGGCAT CGCCACCCAG CATCGCATGG CCTATCAGGG GGAATACTTC GTCGAACGCT ACGGTCACGG TGTAGCCGAG GCGACGCCGC CGATCCGCCG CATGCTCGAA AAGGGCGTGA ATGTCTCGGC AGGCACCGAC GCCACCCGCG TTGCCTCTTA CAACCCTTGG GTTTCGCTCT CCTGGATGGT CACCGGCAAG ACGGTCGGCG GCATGCAGCT CTATCCGCGC GCCAACTGCC TCGATCGCGA GACGGCGTTG CGCATGTGGA CCGAGAAGGT CACATGGTTC TCCAATGAGG AGGGCAAGAA GGGCCGTATC GAGAAGGGCC AGTTCGCCGA TCTGGTGGTG CCGGACAAGG ACTTCTTCTC CTGCGCGGAA GACGAGATCT CCTTCCTCAC TTCGGAACTG ACCATGGTCG GCGGCAAGAT TGTCTATGGG GCAGGCGACT TCAAGACGCT CGACGAGAAC GACGTGCCGC CGGCGATGCC CGACTGGTCT CCCGTCCGCA GATTCGGCGG ATACGCGGCC TGGGGCGAAC CGGAAGGCGC AGGCTCGCGC TCTTTGCGCC GTACGGTGAT TTCCACATGC GGATGTGCCA GTGACTGCGG CGTGCATGGC CATGACCATG CCGGACCCTG GACATCCAAA CTTCCGATCG CCGACCTAAA GGGATTCTTC GGCGCTCTCG GGTGCTCCTG CTGGGCGGTA TAA
|
Protein sequence | MTTRRSFLGA ASSLAFSNLF SPAKAADPNQ TGVDTMHPDL ILHNGRVTTL DRSNPNATAI AVKDGLFVEV GSDSEIMALA GSSTRVVDLK GKRVLPGLID NHTHVVRGGL NYNMELRWDG VRSLADAMDM LKRQVAVTPA PQWVRVVGGF SEHQFAEKRL PTIEEINAVA PDTPVFLLHL YDRALLNGAA LRAVGYTRDT PNPPGGEITR DANGNPTGML LAKPNAGVLY STLAKGPKLP LDYQVNSTRH FMRELNRLGV TGVIDAGGGF QNYPDDYEVI QKLSDANQMT VRLAYNLFTQ KPKEEKQDFL KWTQSVKYKQ GNDYFRHNGA GEMLVFSAAD FEDFRQPRPE MAPEMEGELE EVVRVLAENR WPWRLHATYD ETISRALDVF EKVDKDIPLE GLNWFFDHAE TISERSIDRI AALGGGIATQ HRMAYQGEYF VERYGHGVAE ATPPIRRMLE KGVNVSAGTD ATRVASYNPW VSLSWMVTGK TVGGMQLYPR ANCLDRETAL RMWTEKVTWF SNEEGKKGRI EKGQFADLVV PDKDFFSCAE DEISFLTSEL TMVGGKIVYG AGDFKTLDEN DVPPAMPDWS PVRRFGGYAA WGEPEGAGSR SLRRTVISTC GCASDCGVHG HDHAGPWTSK LPIADLKGFF GALGCSCWAV
|
| |