Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3791 |
Symbol | |
ID | 6982554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3921118 |
End bp | 3922296 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643398513 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002283279 |
Protein GI | 209551362 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.242824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.21889 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGAACGA CATACCGCTT TTTGCGTGCC CATGTCCTGC AGCGGCTGAT CCCGCGGTCG CGTCTTGCCT TCAACCCGCG CCGGCCGGTG GAAATCGTCG GCTATCTCTC GATGGCTGTC GGCGTCGGCG AATCGGCAAG GCTTTGCGCC GGTGCTTTGT CGGAGGCGGG AAGGGCAATT TCGCTCTCCG ACGTCAGCAC GCATCCGGAC GAGAATTCCT TCCCCGGATG GACGTCGTCG CGTCTTTCCA CCGAGCCGCC GGGAAGCCGG ATCTGGCACC TCAATCCGCC GATGCTGCCG CGCGCCATCC TGAAGAAGGG TGTCGCCAAT TTCACCCGCG CCTTCAATAT CGGCTATTTC GCCTGGGAGC TGGAAGTGGT GCCGGCGGAA TGGCGCAATG CCATGCATTA CATGAACGCG GTCTTCGTGC CGTCGGAATT CACCCGGCGG GCGATTGCGC CGCTCACCAA GGCACCGGTC ATCGTCGTTC CGCATCCGGT GATCGAGGCG CCGGCGACCG AAGGCATGCG GGAGAGGTTC GGCATCGCGA AGGACGCCTT TCTCGTCAGC TTCATCTTCA GCGCCGGCTC GTCGATCAAC AGGAAAAATC CGCAGGCCGT CATCGAGGCT TTCAGGATAT TTGCCGCCGA AGCCCCAAGC GCCTTCCTGT TGATGAAGGC CAGCGGCGAT GTGAACAAGG ATGAAGCCCT GAGCGCGCTG GTCGCTTCGG TTGCGGGCGA CGCCCGGATC AGGATCGTCA CCGACAGGCT GTCGAACGCC GATATCAACG GCATCATCCG CTCTTCCGAT GCCTATCTTT CGCTGCATCG TTCCGAGGGT TTCGGGCTGA CGGTTGCCGA GGCGATCATG CAGGGCACAC CCGTTATCTC CACGGCCTGG TCGGGCACGG CGGATTTCTG CGACCCCTGC AATAGCTGGC TAGTCGCCTC TCCTCTGATC CCCGTCGTCG ATACCCATCC CGAATTTGTC GGGCTTGAGG GGGCGGTCTG GGCGGATCCC TCTCCGGAAG CCGCGGCCGC TCATTTGAGC GACATCTTCA GGGCGCCCGA GCTTGCGCTG GGGAAGGCCG AGAAAGCGCG GGAGTTCCTA CGGCACTATC TCGCTGAGAA CAGCTATGAG AGGGCGCTCC AGACGCTGGC GGCGATGCAG GCGAGCTAA
|
Protein sequence | MRTTYRFLRA HVLQRLIPRS RLAFNPRRPV EIVGYLSMAV GVGESARLCA GALSEAGRAI SLSDVSTHPD ENSFPGWTSS RLSTEPPGSR IWHLNPPMLP RAILKKGVAN FTRAFNIGYF AWELEVVPAE WRNAMHYMNA VFVPSEFTRR AIAPLTKAPV IVVPHPVIEA PATEGMRERF GIAKDAFLVS FIFSAGSSIN RKNPQAVIEA FRIFAAEAPS AFLLMKASGD VNKDEALSAL VASVAGDARI RIVTDRLSNA DINGIIRSSD AYLSLHRSEG FGLTVAEAIM QGTPVISTAW SGTADFCDPC NSWLVASPLI PVVDTHPEFV GLEGAVWADP SPEAAAAHLS DIFRAPELAL GKAEKAREFL RHYLAENSYE RALQTLAAMQ AS
|
| |