Gene Rleg2_1155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1155 
Symbol 
ID6979875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1167739 
End bp1169988 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content59% 
IMG OID643395868 
Productglycoside hydrolase family 5 
Protein accessionYP_002280675 
Protein GI209548758 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.675797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCAGC CGTTCGTCCA AACCCTGAAA ACGTCCGCAA GCACGGGCGT TCGCTGGGAT 
CTGTGGTACG CGAACAGCAC CGGCTTTTAC ATGAATGAGG GGGCGGAAAT CGCCCTCAAG
ATCACGGTTG CCAACCCCAC GGCGGGCTGT AGCATCCGCT GCTACTGCAC TGGAGCAGCT
GCGACCGAGA ACAATAACAA CTGGTCGAAG AGCTGGACCA ATATGTGCAC GGAAGAGGCG
GCCCGGCACA CAGGCGTCAC CTATACGCCC CAAGGCGACA GCACGCACCC GTCCAGCCGC
ACCGGGATAT TCACCTTCGC GGCGAACTAC GATGGTGAGC CCATCATATT AGCCCGCACG
GCGCAATATG ACCTGACCAC GGAAGGCACC GTCCAGCTCG ATACGTGGAT TGATAACCCC
AGCAGCGGCT CGATCCTGCA CGGTGTTGTC AGCCTCCAGA TAGCCGATGT GTCGGTAACG
CCTCCCGGAA CTCCCACTTT CCGTGTCGTG TCGCCGGGTG GGTTCATCAA TGAGGGAGAT
ATGGTCGATA TCACCCTCCA GACGGAAAAC ATCACACCCG GCACGACCTG CACTGTCCAG
ATCGTCAACA CAGCGGCTAA CTCGGCGGAC TGGATTGAGG ACAATCACAC CCAGTGGACG
AACGCCTGCG CCGCAGTGGG ATGCACCTAC ACCATCCGCA CCAGCTCGTC CGGGCTCATC
ACCTTCGGCT CTGGCTACAG TGACGCCAAT CCCATCCATT ACACCCGCAC CTCGAAGGCG
GATAATTTGA CGGAAGCGGC TACGGAGCAG GTGGACTGGA TCTTCGCCAA CTTCAGCGAT
TCCTCTATGC GGATTTATTC CAGCGCCGTG TCCTACTGGC TTATCGACAC CAGCCAGAAC
CCGACGGCCC CTGCGTTCCA GATCAAGATG TCGCCGAATA ACCCGAACCC CGGCGATACC
ATCACGTACA CGCTGAAATC CGAGACGGGG GCGACGGGCG GGGGGACGGT GGTGTTTGCC
CAAGGCGGAG ACGCCAGCGA TGCTGACTTC AATATCAGCC TGGACGATCT GCTGACGAAT
CTCGCGGCAA CCCAGCCAGC CAACATGTCC TATGACACCT CGACCAACAC CCTGACCGTG
ACGAGCAGCT GGACAGGCAC GGCCAGCACG ACCCGCATTC TGAACCCGGC CACGACGGCA
ACAAAGCACA CCATCAGCCT TGCATCTGCC GGGGGAGGGG CGGTTTTGGT CGTTGCCGAT
GATGTTTGTT ACATCGGCGG TGCGGGAGCC GTCGCACCAT TCACTTATGC TTCGGGCATC
AACCTGTCCG GCGCGGATTT CGGCTCTTTC CTCATCAACT CGAATACGGT GCTGGACTAC
TACGCCACGA AGGACTTCCA CGTCGCCCGT TTTCCGGTGA AGTGGGAGTA TCTGCAAAGC
GCTGCCTTCG GCTCGCTCAA CACCAGCAAT ACCAACGCCT TTGTTGCGAT GGTCAAATAC
TGGACCAACA CCAAGGGCCG GTTCGCGATC GTTGACCTGC ACAGCTATTC AAAGCTCAAC
AACGTCCAGA TCGGCATCAC CGGCAGTCTG GTACGTCCGC AGGCGCTGTG TGATCTCTGG
GTGAAGCTGG TCGGTGCCTT GCAAGCCGCC AGCGTGGACA TGAGTAAGGT GATCCTCGAC
TTCATGAACG AGCCGACGAA CCTTGCGGCC GATTGGCGGC GGTATGCGCA GGCAACAGCG
AATGCTGTCC GCGCCCGCAC GACCTTTACC GGCATGATCA TGATTGAGGG CGTCAGCGGT
TCGAGCGCGA TGAACTGGAG CGCCAACAAG AACGACAGCG AGCTGGTGAA GTTCTACGAC
CCGGCCAACA ACTACGCATT CCAGCCGCAC CAGTACCTCG ATTCCAACGG CAGCGGGACG
TTGGGTAGCT GCGCGGTCAA CAGTAACTCC CGGATCTCCT CGATCACCAA CTGGGCACGG
ACGAACGGCA AGAAGCTGTT CCTCGGGGAA ATTGCATGGG GCGACGACAG TATTGCCGGG
AATGAGCAGT GCGCGGTGGA ATACCAGCAG ATCATGACCC GCCTGACCGT CTCTGATGAC
GATGTGTGGA TTGGCTACAC CTACTGGGGT GCAGGCCAGT TCTGGGCAGC AGGCTATCCC
TTCAAGCTTG ACCCCTCCGC ATATGACGGC AGTGTTCCAG ACACGCAGCA GATGTATGAA
CTGTTGCTGT ACAACCGATT TACAGCTTAG
 
Protein sequence
MFQPFVQTLK TSASTGVRWD LWYANSTGFY MNEGAEIALK ITVANPTAGC SIRCYCTGAA 
ATENNNNWSK SWTNMCTEEA ARHTGVTYTP QGDSTHPSSR TGIFTFAANY DGEPIILART
AQYDLTTEGT VQLDTWIDNP SSGSILHGVV SLQIADVSVT PPGTPTFRVV SPGGFINEGD
MVDITLQTEN ITPGTTCTVQ IVNTAANSAD WIEDNHTQWT NACAAVGCTY TIRTSSSGLI
TFGSGYSDAN PIHYTRTSKA DNLTEAATEQ VDWIFANFSD SSMRIYSSAV SYWLIDTSQN
PTAPAFQIKM SPNNPNPGDT ITYTLKSETG ATGGGTVVFA QGGDASDADF NISLDDLLTN
LAATQPANMS YDTSTNTLTV TSSWTGTAST TRILNPATTA TKHTISLASA GGGAVLVVAD
DVCYIGGAGA VAPFTYASGI NLSGADFGSF LINSNTVLDY YATKDFHVAR FPVKWEYLQS
AAFGSLNTSN TNAFVAMVKY WTNTKGRFAI VDLHSYSKLN NVQIGITGSL VRPQALCDLW
VKLVGALQAA SVDMSKVILD FMNEPTNLAA DWRRYAQATA NAVRARTTFT GMIMIEGVSG
SSAMNWSANK NDSELVKFYD PANNYAFQPH QYLDSNGSGT LGSCAVNSNS RISSITNWAR
TNGKKLFLGE IAWGDDSIAG NEQCAVEYQQ IMTRLTVSDD DVWIGYTYWG AGQFWAAGYP
FKLDPSAYDG SVPDTQQMYE LLLYNRFTA