Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4210 |
Symbol | |
ID | 6982983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 4387611 |
End bp | 4389257 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643398941 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_002283698 |
Protein GI | 209551781 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGT ATGACAGGAA CAGGGTAAGC CGGCTCCCTG GCTGGCGCAG TTATGAGTCG TCTCAGACTG CGCCGGAAGG CGTGCGCATG CGCAGTCCCG TGGTCCGTCC GGACGATTTC GTCCCTCCGC CGCCCGAACC CGCCCCTCCT GCCTTTGTGC CGCCGCCGGC AAGTGTTGCG CAACCCCGCC AAGCCGAACG CCCGGCGCCC CCGTCCTCGG CTGTGCCCCC GACGGCGCCC ACGACGGCAC CTAGGAAACA GCCGGTCGTG GACGCGCCGC CGAACGCAGT GCCCGCCTCT GCCGCACCGC TTCTCGACCT CCGCTCAAGC GTCGCCGCGA TCTGGAGCCG GCGGTTGGTC GTGCTTGTTC TTGCCCTTCT CGGCGCCGTC GCCGGCGGGG TGGTGGCGCC CACCATCGGG CAAAAATTCA CCGCCGTCAG CAGCCTCTAT TTCGATCCGC GCCAGATCGG TCTTGCCGAT GCGGGCGGTC AGTCTTCGGG GCCTTCGCCG GAAATGATCT CGACTTTGAT CGACAGCCAG GTGCAGATCC TGACCTCCGG CAATGTACTG CGCCGCGTCG TCGAGACCAT GAAGCTCGAC CAGGATCCGG AATTCACCGG CGGCCGCACC GATGGCGCCG CCGTGATCGG CACTCTGCAG AAGGCGCTGG TCATTACCCG GCAGGCCAGC ACCTATGTCG TTTCGCTTGC CGCGACGACC AATGATCCCG AGAAATCGGC AAGACTGGCC AACCAGGTCG TCACCTCCTT CACCGAGGAG GAAAACAGCG CCTCGAACGG CATCTACGAA AACACCTCCT CGACGCTGGA CGGACGCCTC GACGATTTGC GGCAGAAGGT GCTGGAGGCT GAGCAGGCTG TCGAAACCTT CCGCGCCGAC AACGACATGG CCGCGACCGA GGGCAATCTG ATTTCCGATC AGCGGCTCGT CTCGCTGAAC ACGATGCTGG TGACGGCGCA GGAAAAAACC ATCCAGGCCA AGGCCCGCGC CGATGCCGTC GCCAATCTCC GCGTCGAGGA TATCGTTGCC GGCAACCAGG CGGAGGGCGG CGTCACTTCG CCGCTGGTCA GCCTGCGCCA GCAATATGCC ACCCAGGCCG CCGCCGTCGG CAGCCTCGAA AGCCAGATGG GTACGCGCCA TCCGCGCCTG CAGGCGGCCC GCTCCTCGCT GCAGAGCATA TCAGGCGAAA TCAAGGGCGA ACTGCAGCGT CTCGCTACCT CGGCAAGAGG CGAATACGAG CAGGCCAAGG CCGCCGAGGA CAGCATCGCC AAGGAGCTTG CCGTGCAGAA GGCGCTGCAG GCGAGCACCT CGGACAAGCA GGTGGAACTG AACGAATTGC AGCGCAAGGC GACGGCGGCG CGCGATATTT ACGAGACGGT GCTGAAGCGC TCCAGCCAGA CGAGCGAGGA GCAGAACCTC AACCAGAGCA ACATTCGCGT CATCTCGCCG GCCGAGCCGC CTGTGAAGGC CGACGGCCCG GGAAAGAAGA TCCTGCTCAT CGCCGGTATC ATCGGCGGTC TTCTCGCCGG TTTCGTCGTC GGCGCTGGTT TTGCGATCCT CGCCGCCCTC TTCAGCCACC CTGTCGTCAG AAGTTATTTC AGCAGGTCGC CCGCGACCAC CGCTTGA
|
Protein sequence | MNQYDRNRVS RLPGWRSYES SQTAPEGVRM RSPVVRPDDF VPPPPEPAPP AFVPPPASVA QPRQAERPAP PSSAVPPTAP TTAPRKQPVV DAPPNAVPAS AAPLLDLRSS VAAIWSRRLV VLVLALLGAV AGGVVAPTIG QKFTAVSSLY FDPRQIGLAD AGGQSSGPSP EMISTLIDSQ VQILTSGNVL RRVVETMKLD QDPEFTGGRT DGAAVIGTLQ KALVITRQAS TYVVSLAATT NDPEKSARLA NQVVTSFTEE ENSASNGIYE NTSSTLDGRL DDLRQKVLEA EQAVETFRAD NDMAATEGNL ISDQRLVSLN TMLVTAQEKT IQAKARADAV ANLRVEDIVA GNQAEGGVTS PLVSLRQQYA TQAAAVGSLE SQMGTRHPRL QAARSSLQSI SGEIKGELQR LATSARGEYE QAKAAEDSIA KELAVQKALQ ASTSDKQVEL NELQRKATAA RDIYETVLKR SSQTSEEQNL NQSNIRVISP AEPPVKADGP GKKILLIAGI IGGLLAGFVV GAGFAILAAL FSHPVVRSYF SRSPATTA
|
| |