Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1155 |
Symbol | |
ID | 6979875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1167739 |
End bp | 1169988 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643395868 |
Product | glycoside hydrolase family 5 |
Protein accession | YP_002280675 |
Protein GI | 209548758 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.675797 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCAGC CGTTCGTCCA AACCCTGAAA ACGTCCGCAA GCACGGGCGT TCGCTGGGAT CTGTGGTACG CGAACAGCAC CGGCTTTTAC ATGAATGAGG GGGCGGAAAT CGCCCTCAAG ATCACGGTTG CCAACCCCAC GGCGGGCTGT AGCATCCGCT GCTACTGCAC TGGAGCAGCT GCGACCGAGA ACAATAACAA CTGGTCGAAG AGCTGGACCA ATATGTGCAC GGAAGAGGCG GCCCGGCACA CAGGCGTCAC CTATACGCCC CAAGGCGACA GCACGCACCC GTCCAGCCGC ACCGGGATAT TCACCTTCGC GGCGAACTAC GATGGTGAGC CCATCATATT AGCCCGCACG GCGCAATATG ACCTGACCAC GGAAGGCACC GTCCAGCTCG ATACGTGGAT TGATAACCCC AGCAGCGGCT CGATCCTGCA CGGTGTTGTC AGCCTCCAGA TAGCCGATGT GTCGGTAACG CCTCCCGGAA CTCCCACTTT CCGTGTCGTG TCGCCGGGTG GGTTCATCAA TGAGGGAGAT ATGGTCGATA TCACCCTCCA GACGGAAAAC ATCACACCCG GCACGACCTG CACTGTCCAG ATCGTCAACA CAGCGGCTAA CTCGGCGGAC TGGATTGAGG ACAATCACAC CCAGTGGACG AACGCCTGCG CCGCAGTGGG ATGCACCTAC ACCATCCGCA CCAGCTCGTC CGGGCTCATC ACCTTCGGCT CTGGCTACAG TGACGCCAAT CCCATCCATT ACACCCGCAC CTCGAAGGCG GATAATTTGA CGGAAGCGGC TACGGAGCAG GTGGACTGGA TCTTCGCCAA CTTCAGCGAT TCCTCTATGC GGATTTATTC CAGCGCCGTG TCCTACTGGC TTATCGACAC CAGCCAGAAC CCGACGGCCC CTGCGTTCCA GATCAAGATG TCGCCGAATA ACCCGAACCC CGGCGATACC ATCACGTACA CGCTGAAATC CGAGACGGGG GCGACGGGCG GGGGGACGGT GGTGTTTGCC CAAGGCGGAG ACGCCAGCGA TGCTGACTTC AATATCAGCC TGGACGATCT GCTGACGAAT CTCGCGGCAA CCCAGCCAGC CAACATGTCC TATGACACCT CGACCAACAC CCTGACCGTG ACGAGCAGCT GGACAGGCAC GGCCAGCACG ACCCGCATTC TGAACCCGGC CACGACGGCA ACAAAGCACA CCATCAGCCT TGCATCTGCC GGGGGAGGGG CGGTTTTGGT CGTTGCCGAT GATGTTTGTT ACATCGGCGG TGCGGGAGCC GTCGCACCAT TCACTTATGC TTCGGGCATC AACCTGTCCG GCGCGGATTT CGGCTCTTTC CTCATCAACT CGAATACGGT GCTGGACTAC TACGCCACGA AGGACTTCCA CGTCGCCCGT TTTCCGGTGA AGTGGGAGTA TCTGCAAAGC GCTGCCTTCG GCTCGCTCAA CACCAGCAAT ACCAACGCCT TTGTTGCGAT GGTCAAATAC TGGACCAACA CCAAGGGCCG GTTCGCGATC GTTGACCTGC ACAGCTATTC AAAGCTCAAC AACGTCCAGA TCGGCATCAC CGGCAGTCTG GTACGTCCGC AGGCGCTGTG TGATCTCTGG GTGAAGCTGG TCGGTGCCTT GCAAGCCGCC AGCGTGGACA TGAGTAAGGT GATCCTCGAC TTCATGAACG AGCCGACGAA CCTTGCGGCC GATTGGCGGC GGTATGCGCA GGCAACAGCG AATGCTGTCC GCGCCCGCAC GACCTTTACC GGCATGATCA TGATTGAGGG CGTCAGCGGT TCGAGCGCGA TGAACTGGAG CGCCAACAAG AACGACAGCG AGCTGGTGAA GTTCTACGAC CCGGCCAACA ACTACGCATT CCAGCCGCAC CAGTACCTCG ATTCCAACGG CAGCGGGACG TTGGGTAGCT GCGCGGTCAA CAGTAACTCC CGGATCTCCT CGATCACCAA CTGGGCACGG ACGAACGGCA AGAAGCTGTT CCTCGGGGAA ATTGCATGGG GCGACGACAG TATTGCCGGG AATGAGCAGT GCGCGGTGGA ATACCAGCAG ATCATGACCC GCCTGACCGT CTCTGATGAC GATGTGTGGA TTGGCTACAC CTACTGGGGT GCAGGCCAGT TCTGGGCAGC AGGCTATCCC TTCAAGCTTG ACCCCTCCGC ATATGACGGC AGTGTTCCAG ACACGCAGCA GATGTATGAA CTGTTGCTGT ACAACCGATT TACAGCTTAG
|
Protein sequence | MFQPFVQTLK TSASTGVRWD LWYANSTGFY MNEGAEIALK ITVANPTAGC SIRCYCTGAA ATENNNNWSK SWTNMCTEEA ARHTGVTYTP QGDSTHPSSR TGIFTFAANY DGEPIILART AQYDLTTEGT VQLDTWIDNP SSGSILHGVV SLQIADVSVT PPGTPTFRVV SPGGFINEGD MVDITLQTEN ITPGTTCTVQ IVNTAANSAD WIEDNHTQWT NACAAVGCTY TIRTSSSGLI TFGSGYSDAN PIHYTRTSKA DNLTEAATEQ VDWIFANFSD SSMRIYSSAV SYWLIDTSQN PTAPAFQIKM SPNNPNPGDT ITYTLKSETG ATGGGTVVFA QGGDASDADF NISLDDLLTN LAATQPANMS YDTSTNTLTV TSSWTGTAST TRILNPATTA TKHTISLASA GGGAVLVVAD DVCYIGGAGA VAPFTYASGI NLSGADFGSF LINSNTVLDY YATKDFHVAR FPVKWEYLQS AAFGSLNTSN TNAFVAMVKY WTNTKGRFAI VDLHSYSKLN NVQIGITGSL VRPQALCDLW VKLVGALQAA SVDMSKVILD FMNEPTNLAA DWRRYAQATA NAVRARTTFT GMIMIEGVSG SSAMNWSANK NDSELVKFYD PANNYAFQPH QYLDSNGSGT LGSCAVNSNS RISSITNWAR TNGKKLFLGE IAWGDDSIAG NEQCAVEYQQ IMTRLTVSDD DVWIGYTYWG AGQFWAAGYP FKLDPSAYDG SVPDTQQMYE LLLYNRFTA
|
| |