Gene Rleg_0445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0445 
Symbol 
ID8011645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp462121 
End bp463842 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content55% 
IMG OID644823039 
ProductCapsule polysaccharide biosynthesis protein 
Protein accessionYP_002974293 
Protein GI241203197 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGC ATTTTATTCG CAGGGCGATC AGATTTGCCA AGAGAGTTGT GAGTTCGAAG 
CTGTCGCTGC AACAAAGTTC TGCGCCCGCG GCGGCTCAGC AGTTTTCGGA CTGGCGCCGC
CATCTCGGCA CGTCCATCGA TTGGAAAAGG GCCTTATCCT CTGTTTCGAA AGGCCAGGAG
GTTCTTATCG CGACGAGCGT CGGCGGTCTG TCGGCCGCGA CGATCCTCGA AGGAATGCTT
GGCGTCGCTC TTACCTTGCG CGGTGCGCGG GTTCGCTTTC TCTTATGTGA CGCAGTTCTT
CCGGCCTGTC TTCACATACA TGCCGGAAAA ATCAGGGATC CGGCCGTCAT CACAGAGTAC
AGGCTCAACA AGGAAATCTG TCCAGGTTGC ATCGGAAGGG GTAGATCGCA TTACGGCTCT
CTTGGACTGC CGGTCTCCTA CTACAGCGAC TTCATTTCTG AAGAAGAGAG ACGTGCATTG
CGCAAGACAG CCCGCGAGAT GCCGGTTTCG GAAATTCGCG GCTTTCGGCT GAAAGATATG
AATCTGGGCG AGCACGCGAT GGCAGGCACC TTGCGCTTCT TCGCTTCGGG AAACCTCCCG
GCAACCCAGG AAGCCGAAGA TGTATTGCGG CGCTATTTTG AGGCGGCTCT GATCACGGAG
ACGGTCATTC AGAGATACCA TGAGCAGTTT TCGCCGGAAG TCGCCGTGTT TCACCATGGT
ATCTACGTCC CTCAGGGGGT AATCGGGGAA GTCTGTCGTG CCCATGGCAC CCGAGTCGCC
AACTGGCAGG TCGGCTATCG CAAGAAGACC TTCATTTTCT CGCACAAGGA AACCTACCAC
CATACTCTGA TAAACGAGTC TACCGACTGC TGGACAGACG TTCCTTGGAG CGAGGCCACG
GAAAACGAGA TCATGTCCTA TCTCAAGAGC CGCTGGTACG GCAGCAATGA CTGGATATGG
TTTCATGATC AGCCGAAGCA TGACGCAGAA CTTATCGCCA AGGAGACCGG CATCGACTTT
TCGAAGCCCA CCATCAGTCT CCTGACCAAT GTCTTTTGGG ATGCGCAACT CCATTTCAAG
GCCAATGCCT TCAGGGACAT GCTCGACTGG GTGCTGCAGA GTATTGAGTA TTTTAAGGGG
CGTCCCGATC TTCAACTGGC TATTCGGATC CATCCTGCCG AAGTCCGTGG CGCCATCCCC
TCACGGCAGC CGCTTGTGGA TGAAATCCGC AAGGTTTATC CGACCCTGCC GGACAATGTT
TACGTCATAC CGCCGGACAG CCAGGTCAGT ACCTATGTTC TCTGTGAGAA CAGTGACACT
GTAGTTATCT ATGGGACAAA AACCGGCGTG GAGCTGACCG CCATGGGAAT TCCCGTGGTC
GTTGCAGGTG AGGCGTGGAT ACGTAATAAG GGCCTCACCA TGGATGCGAC CTCGCCGGAG
AATTATTTTG ACTTGCTCGA CCGGCTGCCG GTCGGCAAGC GGTTGGACGC CGATACGATC
AATCGGGCTA GAAAATACGC ATTCCATTTC TTTTTTCGAC GCTTCATCCC TATCGAGTTC
ATGGAACCAT CGAGCAATGA CGCTCCCTAC GAAATTAGGA TCAACGACCT GCAGGATCTG
CTTCCGGGCA GGGATGCGGG CCTTGATGTC CTATGCAACG GCATTCTTGA TGGAAGCGAG
TTCGTGTATC CGGCGGAAAA GTATATCGGG AGAACGCAGT GA
 
Protein sequence
MSMHFIRRAI RFAKRVVSSK LSLQQSSAPA AAQQFSDWRR HLGTSIDWKR ALSSVSKGQE 
VLIATSVGGL SAATILEGML GVALTLRGAR VRFLLCDAVL PACLHIHAGK IRDPAVITEY
RLNKEICPGC IGRGRSHYGS LGLPVSYYSD FISEEERRAL RKTAREMPVS EIRGFRLKDM
NLGEHAMAGT LRFFASGNLP ATQEAEDVLR RYFEAALITE TVIQRYHEQF SPEVAVFHHG
IYVPQGVIGE VCRAHGTRVA NWQVGYRKKT FIFSHKETYH HTLINESTDC WTDVPWSEAT
ENEIMSYLKS RWYGSNDWIW FHDQPKHDAE LIAKETGIDF SKPTISLLTN VFWDAQLHFK
ANAFRDMLDW VLQSIEYFKG RPDLQLAIRI HPAEVRGAIP SRQPLVDEIR KVYPTLPDNV
YVIPPDSQVS TYVLCENSDT VVIYGTKTGV ELTAMGIPVV VAGEAWIRNK GLTMDATSPE
NYFDLLDRLP VGKRLDADTI NRARKYAFHF FFRRFIPIEF MEPSSNDAPY EIRINDLQDL
LPGRDAGLDV LCNGILDGSE FVYPAEKYIG RTQ