Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3542 |
Symbol | |
ID | 6982302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3671712 |
End bp | 3672989 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643398266 |
Product | hypothetical protein |
Protein accession | YP_002283035 |
Protein GI | 209551118 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCGTA TTCCTCCCGT TACCGAAAGC ACCGACAGCA TTGCCAACCG CATGGTGCAT GATCTGGCCG CGCTCCATTT CGAAGCGCCG CGGGCCGAGG CGCGCGCCGA GATCGGCCGG CCGGGGCGCG AACTCTGCCT CTATCCCGGC AAGCTCGGCT ATGAACTTCA GGAAGAGCTC GACTTTCTCT CCAACCGGGC GATGGAACCG AACGTCTTTT TCTCCGGCCG CTTCCTCGCG CCCGCCATGC CGCGGCTCGA AGACCGGCAG GTGAATTTCG CCCTGATCCG CGACCACAAT GCCGGCCGCA GCCGCATGCG CTTTCTGATG CCGTTTTCGG TCGACAAGCC GGGTTTTGCC GTCGGCCCAT CGATCATCCG CGGCTGGTCG AACAGCTTCG GCCCGCTCGG CACGCCGCTC GTCGACAGCG AGGATGCCGC CGAGACCCTC GACAATCTTT TCGAGGGACT GACCGCCCGT GACCTCAATC TGCCGGGAAT CGTGGTCCTG CCGGATCTGA GGCTGAACGG TATCTTCGTG CGCATGATCA AGGCCGTGGC GCTCAGCCGC AATCTTCCCG TCACCGTGAC CAATCCCTAT TTGCGCCCGA TGCTGCAGAG CGACGAAGAG GCGCTGACCT ATCTTGGCCG CACCATCTCT TCCTCGCACA TGCGCGAGAT GCGCCGCCAG TGGCGTCTGC TGGAGGAACA GGGAACGGCG GTTTATGCCG TCGCCCGCCA GCCCCGCGAA ATTCACATCC GCTTCGAGGA ATTCCTGGCG ATGGAAGCCG GCGGCTGGAA GGGCAAGCGG CGAAGCGCGC TGGTCACCGA TCGCTATCAT ACCGCCTTTG CCCGCGAGGC GGTCTCGAAC CTTGCCGCCG TCGACGCCGT GCGTATTCAC ACGATCGATC TCAACGGCAA GGCGATCGCC GCCATCGTCG TGCTGATGAT GGGTGGCGAG GCCTATACGT GGAAGACGGC CTATGACGAG AATTATGCCC GCTATTCGCC GGGCAAGCTG CTGATGAGCG AACTCACCGA ATGGCATCTC GACGACGCCA ATATCGTCCG CTCCGATTCC TGCGCCGTCT CGGATCATCC GATCATGAGC CGCTTCTGGC AGGAACGCGA GGAGATGGGA ACGCTGGTAA TCGGCCTGAC GCAGAACAGC GACCGCGATA TGCGCCAAGT CTCAGCCCAG CTCCACATGT ACCGCAGCAC CCGCAACATG GCGAAGATGC TGCGCGAGAA GATCATGTCG CTCGCCGGCC GGGGCTGA
|
Protein sequence | MVRIPPVTES TDSIANRMVH DLAALHFEAP RAEARAEIGR PGRELCLYPG KLGYELQEEL DFLSNRAMEP NVFFSGRFLA PAMPRLEDRQ VNFALIRDHN AGRSRMRFLM PFSVDKPGFA VGPSIIRGWS NSFGPLGTPL VDSEDAAETL DNLFEGLTAR DLNLPGIVVL PDLRLNGIFV RMIKAVALSR NLPVTVTNPY LRPMLQSDEE ALTYLGRTIS SSHMREMRRQ WRLLEEQGTA VYAVARQPRE IHIRFEEFLA MEAGGWKGKR RSALVTDRYH TAFAREAVSN LAAVDAVRIH TIDLNGKAIA AIVVLMMGGE AYTWKTAYDE NYARYSPGKL LMSELTEWHL DDANIVRSDS CAVSDHPIMS RFWQEREEMG TLVIGLTQNS DRDMRQVSAQ LHMYRSTRNM AKMLREKIMS LAGRG
|
| |