Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3833 |
Symbol | |
ID | 8014660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3906864 |
End bp | 3908141 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826402 |
Product | hypothetical protein |
Protein accession | YP_002977615 |
Protein GI | 241206519 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCGTA TCCCCCCCGT TACCGAAAGC ACCGACAGCG CTGCGAACCG CATGGTGCAT GATCTCGCAG CACTGCATTT CGAAGCGCCG CAAGCCGAGG CTCGCGCCGA GATCGGCCGG CCGGGGCGCG AGCTCTGCCT CTATCCGGGT AAGCTCGGCT ATGAGCTTCA GGATGAGCTC GATTTCCTTT CCAACCGGGC GATGGAACCG AACGTCTTTT TCTCCGGCCG CTTCCTCGCC CCCGCCATGC CGCGGCTCGA AGACCGGCAG GTGAACTTCG CCCTGATCCG CGACCACAGT GCCGGCCGCA GCCGCATGCG CTTCCTTTTG CCGTTTTCGG TCGACAAGCC GGGTTTTGCC GTCGGCCCGT CGATCATCCG CGGCTGGTCG AACAGTTTCG GTCCGCTCGG CACGCCGCTC GTCGATGGCG AGGATGCCGC CGAGACCCTC GACAATCTCT TCGAGGGACT GACCGCCCGT GATCTCAATC TGCCCGGCAT ACTGGTTCTG CCGGATCTCA GGCTGAACGG TATCTTCGTG CGCATGGTCA AGGCCGTGGC GCTCAGCCGC AATCTTCCCG TCACCGTCAC CAATCCCTAC CTGCGCCCGA TGCTGCAGAG CGAGGAAGAG GCGCCGGCCT ATCTCAGCAA AACCATCGCC TCCTCGCATA TGCGCGAGAT GCGCCGCCAG TGGCGGCTGC TGGAGGAACG GGGAACGACG GTCTATACCG TCGCCCGCCA ACCGCGCGAA ATCCATATCC GCTTCGAGGA ATTCCTGGCG ATGGAAGCCG GTGGCTGGAA GGGCAAGCGG CGAAGCGCTC TCGTCACCGA TCGTTATCAC ACGGCCTTCG CCCGCGAGGC GGTATCGAAC CTTGCCGCCG TCGATGCCGT GCGTATTCAC ACGATCGATC TCAACGGCAA GGCGATTGCC GCCATCGTCG TGCTGATGAT GGGCGGCGAG GCCTATACCT GGAAGACTGC CTACGACGAA AACTATGCCC GCTATTCGCC TGGCAAGCTG CTGATGAGCG AACTCACCGA ATGGCATCTC GACGACGCCA ATATCATCCG CTCCGATTCC TGCGCGGTCT CGGATCATCC GATCATGAGC CGCTTCTGGC AGGAGCGCGA GGAGATGGGA ACGCTGGTAA TCGGCTTGAC GCAGAACAGC GACCGCGACA TGCGCCAGGT CACCGCCCAG CTGCACATGT ACCGCAGTAC CCGCAATATG GCGAAGATGC TGCGCGAAAA GATCATGTCG CTTGCCGGCC GGGGCTAA
|
Protein sequence | MVRIPPVTES TDSAANRMVH DLAALHFEAP QAEARAEIGR PGRELCLYPG KLGYELQDEL DFLSNRAMEP NVFFSGRFLA PAMPRLEDRQ VNFALIRDHS AGRSRMRFLL PFSVDKPGFA VGPSIIRGWS NSFGPLGTPL VDGEDAAETL DNLFEGLTAR DLNLPGILVL PDLRLNGIFV RMVKAVALSR NLPVTVTNPY LRPMLQSEEE APAYLSKTIA SSHMREMRRQ WRLLEERGTT VYTVARQPRE IHIRFEEFLA MEAGGWKGKR RSALVTDRYH TAFAREAVSN LAAVDAVRIH TIDLNGKAIA AIVVLMMGGE AYTWKTAYDE NYARYSPGKL LMSELTEWHL DDANIIRSDS CAVSDHPIMS RFWQEREEMG TLVIGLTQNS DRDMRQVTAQ LHMYRSTRNM AKMLREKIMS LAGRG
|
| |