Gene Rleg2_4210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4210 
Symbol 
ID6982983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4387611 
End bp4389257 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content65% 
IMG OID643398941 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_002283698 
Protein GI209551781 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGT ATGACAGGAA CAGGGTAAGC CGGCTCCCTG GCTGGCGCAG TTATGAGTCG 
TCTCAGACTG CGCCGGAAGG CGTGCGCATG CGCAGTCCCG TGGTCCGTCC GGACGATTTC
GTCCCTCCGC CGCCCGAACC CGCCCCTCCT GCCTTTGTGC CGCCGCCGGC AAGTGTTGCG
CAACCCCGCC AAGCCGAACG CCCGGCGCCC CCGTCCTCGG CTGTGCCCCC GACGGCGCCC
ACGACGGCAC CTAGGAAACA GCCGGTCGTG GACGCGCCGC CGAACGCAGT GCCCGCCTCT
GCCGCACCGC TTCTCGACCT CCGCTCAAGC GTCGCCGCGA TCTGGAGCCG GCGGTTGGTC
GTGCTTGTTC TTGCCCTTCT CGGCGCCGTC GCCGGCGGGG TGGTGGCGCC CACCATCGGG
CAAAAATTCA CCGCCGTCAG CAGCCTCTAT TTCGATCCGC GCCAGATCGG TCTTGCCGAT
GCGGGCGGTC AGTCTTCGGG GCCTTCGCCG GAAATGATCT CGACTTTGAT CGACAGCCAG
GTGCAGATCC TGACCTCCGG CAATGTACTG CGCCGCGTCG TCGAGACCAT GAAGCTCGAC
CAGGATCCGG AATTCACCGG CGGCCGCACC GATGGCGCCG CCGTGATCGG CACTCTGCAG
AAGGCGCTGG TCATTACCCG GCAGGCCAGC ACCTATGTCG TTTCGCTTGC CGCGACGACC
AATGATCCCG AGAAATCGGC AAGACTGGCC AACCAGGTCG TCACCTCCTT CACCGAGGAG
GAAAACAGCG CCTCGAACGG CATCTACGAA AACACCTCCT CGACGCTGGA CGGACGCCTC
GACGATTTGC GGCAGAAGGT GCTGGAGGCT GAGCAGGCTG TCGAAACCTT CCGCGCCGAC
AACGACATGG CCGCGACCGA GGGCAATCTG ATTTCCGATC AGCGGCTCGT CTCGCTGAAC
ACGATGCTGG TGACGGCGCA GGAAAAAACC ATCCAGGCCA AGGCCCGCGC CGATGCCGTC
GCCAATCTCC GCGTCGAGGA TATCGTTGCC GGCAACCAGG CGGAGGGCGG CGTCACTTCG
CCGCTGGTCA GCCTGCGCCA GCAATATGCC ACCCAGGCCG CCGCCGTCGG CAGCCTCGAA
AGCCAGATGG GTACGCGCCA TCCGCGCCTG CAGGCGGCCC GCTCCTCGCT GCAGAGCATA
TCAGGCGAAA TCAAGGGCGA ACTGCAGCGT CTCGCTACCT CGGCAAGAGG CGAATACGAG
CAGGCCAAGG CCGCCGAGGA CAGCATCGCC AAGGAGCTTG CCGTGCAGAA GGCGCTGCAG
GCGAGCACCT CGGACAAGCA GGTGGAACTG AACGAATTGC AGCGCAAGGC GACGGCGGCG
CGCGATATTT ACGAGACGGT GCTGAAGCGC TCCAGCCAGA CGAGCGAGGA GCAGAACCTC
AACCAGAGCA ACATTCGCGT CATCTCGCCG GCCGAGCCGC CTGTGAAGGC CGACGGCCCG
GGAAAGAAGA TCCTGCTCAT CGCCGGTATC ATCGGCGGTC TTCTCGCCGG TTTCGTCGTC
GGCGCTGGTT TTGCGATCCT CGCCGCCCTC TTCAGCCACC CTGTCGTCAG AAGTTATTTC
AGCAGGTCGC CCGCGACCAC CGCTTGA
 
Protein sequence
MNQYDRNRVS RLPGWRSYES SQTAPEGVRM RSPVVRPDDF VPPPPEPAPP AFVPPPASVA 
QPRQAERPAP PSSAVPPTAP TTAPRKQPVV DAPPNAVPAS AAPLLDLRSS VAAIWSRRLV
VLVLALLGAV AGGVVAPTIG QKFTAVSSLY FDPRQIGLAD AGGQSSGPSP EMISTLIDSQ
VQILTSGNVL RRVVETMKLD QDPEFTGGRT DGAAVIGTLQ KALVITRQAS TYVVSLAATT
NDPEKSARLA NQVVTSFTEE ENSASNGIYE NTSSTLDGRL DDLRQKVLEA EQAVETFRAD
NDMAATEGNL ISDQRLVSLN TMLVTAQEKT IQAKARADAV ANLRVEDIVA GNQAEGGVTS
PLVSLRQQYA TQAAAVGSLE SQMGTRHPRL QAARSSLQSI SGEIKGELQR LATSARGEYE
QAKAAEDSIA KELAVQKALQ ASTSDKQVEL NELQRKATAA RDIYETVLKR SSQTSEEQNL
NQSNIRVISP AEPPVKADGP GKKILLIAGI IGGLLAGFVV GAGFAILAAL FSHPVVRSYF
SRSPATTA