Gene Rleg_4500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4500 
Symbol 
ID8015261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4634489 
End bp4636102 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content64% 
IMG OID644827076 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_002978277 
Protein GI241207181 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.177123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGT ATGACAGGAA CAGGATAAGC CGGCTCCCTG GCTGGCGCAG TTTTGAGCCG 
TCTCAGACTG CGCCGGAAGG CGTGGGCATG CGCAGTCCCG TCGTCCGCCC GGATGATTTC
GTCCGGCCAT CGCCCGAACC TGCCCCGCCC CCCTTCGTGC CGCCGGCAAG CATTGCGGAA
ACCCGCCAAT ATGAGCGTCC GGCACCTCAG CCTCAGCCTG CAAGACAGCC TGTCGCGGAC
GCGCCGCCGA ATGCAGAGTC CGCGCCGGCC GCACCGCTTC TCGACCTCCG CTCCAGTATC
GCCGCGATCT GGAGCCGGCG ACTGATCGTG CTGGGTTTGG CGCTTCTCGG AGCCCTCGCC
GGCGGGGCGG TGGCGCCGCG CATCGCCCAG AAATTCACCG CCATCAGCAG CCTCTATTTC
GATCCGCGCC AGATCGGCCT TGCCGATGCC GGCGCGCAAT CGTCGGGTCC CTCGCCGGAA
ATGATCTCGG CGCTGATCGA CAGCCAGGTG CAGATCCTGA CCTCGGGCAA TGTGCTGCGC
CGCGTCGCCG AAGCCATGAA GCTCGACCAA GATCCCGAAT TCACCGGCGG CCGCACGGAT
GGCGCCGCCG TGATCGGCAC TCTGCAGAAG GCGCTGGTCA TCACCCGTGA GGCCAGCACC
TATGTCGTCT CGCTTGCCGC CACGACCAAC GATCCCGAAA AATCCGCAAG GCTTGCCAAC
CAGGTCGTCA CCTCCTTCAC CGAGGAGGAG AACAGTGCCT CGAACGGCAT CTACGAAAAT
ACCTCCTCAA CGCTCGACGG TCGCCTCAAC GACCTGCGCC AGAAGGTGCT GGAGGCCGAG
CAGGCTGTCG AAACCTTCCG CGCCGACAAC GACATGGCCG CGACCGAGGG CAACCTGATT
TCCGATCAGC GGCTCGTTTC GCTGAACACG ATGCTGGTGA CGGCACAGGA AAAGACCATC
CAGGCAAAGG CGCGCGCCGA TGCCGTCGCC AACCTCCGCG TCGAGGACAT CGTCGCCGGC
AACCAGGCGG AGGGCGGCGT CACCTCGCCG CTCGTCAGCC TGCGTCAGCA ATATGCCACC
CAGGCCGCCG CCGTCGGCAG CCTCGAAAGC CAGATGGGCA CGCGTCATCC GCGCCTGCAG
GCCGCCCGCT CCTCGCTGCA GAGCATTGGC GGCGAAATCA AGGGCGAATT GCAGCGTCTC
GTCACCTCGG CAAGGGGCGA ATACGAGCAG GCAAAGGCCG CCGAGGACAG CATCGCCAAG
GAACTTGCCG TGCAGAAGGC GTTGCAGGCG AGTACCTCCG ACAAGCAGGT GGAATTGAAC
GAATTGCAGC GCAAGGCGAC GGCGGCGCGC GATATTTACG AGACGGTGCT GAAACGCTCT
AGCCAGACGA GCGAGGAGCA AAACTTCAAC CGGAGTAACA TTCGCGTCAT TTCGCCGGCC
GAGCCGCCGG TCAAGGGAGA CGGTCCCGGA AAGACGATTC TATTGGTTGC CGGCGTCATC
GGCGGTTTTC TCGCCGGTTT CGTCGTCGGC GCGGGCTTTG CGATTCTCGC CGGCCTCTTC
AGCCATCCCG TCATCAGAAG TTATTTCAGG AAGTCTCCCG CTGCGGCCGC TTGA
 
Protein sequence
MNQYDRNRIS RLPGWRSFEP SQTAPEGVGM RSPVVRPDDF VRPSPEPAPP PFVPPASIAE 
TRQYERPAPQ PQPARQPVAD APPNAESAPA APLLDLRSSI AAIWSRRLIV LGLALLGALA
GGAVAPRIAQ KFTAISSLYF DPRQIGLADA GAQSSGPSPE MISALIDSQV QILTSGNVLR
RVAEAMKLDQ DPEFTGGRTD GAAVIGTLQK ALVITREAST YVVSLAATTN DPEKSARLAN
QVVTSFTEEE NSASNGIYEN TSSTLDGRLN DLRQKVLEAE QAVETFRADN DMAATEGNLI
SDQRLVSLNT MLVTAQEKTI QAKARADAVA NLRVEDIVAG NQAEGGVTSP LVSLRQQYAT
QAAAVGSLES QMGTRHPRLQ AARSSLQSIG GEIKGELQRL VTSARGEYEQ AKAAEDSIAK
ELAVQKALQA STSDKQVELN ELQRKATAAR DIYETVLKRS SQTSEEQNFN RSNIRVISPA
EPPVKGDGPG KTILLVAGVI GGFLAGFVVG AGFAILAGLF SHPVIRSYFR KSPAAAA