Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1105 |
Symbol | |
ID | 8012227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1087793 |
End bp | 1089292 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644823688 |
Product | putative glycosyltransferase protein |
Protein accession | YP_002974939 |
Protein GI | 241203843 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00589779 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.39875 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGAGC GCGCGACGAG GACGATCAGA ACCGCGGGCC TGCTGCTTGC AGCCTATTTC GTGCTCAACA TCGCGCTGCG CATCGCGCTT CCGCATTCGC TGGAGCTCGA CGAGGCCGAG CAATCCTTCT TCTCGCAATA TCTGCTGGCC GGCTACGGCC CGCAGCCGCC CTTCTACAAC TGGATGCAAT ATGCCGTCGT TTCGGTGACG GGCATGTCGA TCGGCGCGCT GATCGTTCCG AAAAACATCC TGCTGTTCCT ATCCTATCTC TTCTATGGCC TTGCCGGGCG GCGCGCGCTG AAGGACGAGG CGCTTGCCGC CGTCGGCATG CTGGCGCTGA TCACCCTGCC CCAGGTCTCC TACATGGCCC AGCAGGATCT GACCCACACG ACGGCGCTGC TCTTTGCCAG TTCGCTGTTT CTCTACGGCT TCTTCCGCAC GCTCGACCGG CCCGATATGG CGAGCTACCT GCTGCTCGGG CTTGCGACCG GCATCGGGCT GATCTCGAAA TATAATTTCG CCCTGATGCC TGTCGTCGCC TTGATCGCCA TCCTGCCCGA TGCCGAATGG CGGCGCCGGG CGCTCGACTG GCGCATGCTG GCAGCGATCG CTGTTGCGCT CGTCATCATC CTGCCGCACG CGATCTGGCT GCAGGGCAAT CTCGCATTCG CCTCCTCCGA CACTCTGGTC AAGATGGCCG CCGGCAGCGA ACCCGCCGGC GCAGTGCGGA TCGGCAAAGG CCTTCTCGCC TTTCTCGTCG CCATCATCGC CTTTGCGGCG CTGCCGGTCA CTATCTTCGC CGCCACCTTC CGCCGGGATT TTGTCCGGGC GCTTTCGGCG GGCAACCGCT GGACTGCGAT GATGGAGCGG ATGATGCTCG CAAGTCTTGC CGGCATCGTT CTGATCGTGC TGTTCACCGG CTCTACCACG GTGCGCGAGC GCTGGCTCGA CCCCTTCCTG CTAGTGCTGC CGATCTATTT CCTGGCGAAG ATGCAGGCGG CCGGGCTCGA CCTTTCCGCC GGGCTGCGCC GCTTCCGGCC GGTGTTGCCG GTGCTGATGG CCTGCGTGCT GATCGCACTC GGCTTCCGCG TCGTCGGCGC CGGGCTGATC GGCACTTACA GCCGGCCGAA TGTGCCGATG GCGGGTTTTT CGCGCGAGAT GACGCAACAG GCACAACCGG CGCTGGTGAT CGCTTCCGAC ACCTATATCG GCGGAAACAT GCGGCTGCAA TTTCCCGATG TGCCGGTGGT GATCCCGGAT TTTCCGGCAC CGGGCATTCC GGCCTATGCC GAGGCCAAGG GGCCGGTGCT GATCGTCTGG CGCGGCAAGA AGACGGCGAC GGCTGCCGAT GCGGTGATGC CGGAGCGTTT TTCTTCGGCG CTGACGGCGG CTGATATCGC GCTGCAGGAG ATCGGCTCGC TGTCGCTTCC CTATTACTTC GGCCGCCAGG GCGACAATTT CGCGCTCGGC TACGCCTGGG TTCGGCCGGA GAGCAAATAG
|
Protein sequence | MLERATRTIR TAGLLLAAYF VLNIALRIAL PHSLELDEAE QSFFSQYLLA GYGPQPPFYN WMQYAVVSVT GMSIGALIVP KNILLFLSYL FYGLAGRRAL KDEALAAVGM LALITLPQVS YMAQQDLTHT TALLFASSLF LYGFFRTLDR PDMASYLLLG LATGIGLISK YNFALMPVVA LIAILPDAEW RRRALDWRML AAIAVALVII LPHAIWLQGN LAFASSDTLV KMAAGSEPAG AVRIGKGLLA FLVAIIAFAA LPVTIFAATF RRDFVRALSA GNRWTAMMER MMLASLAGIV LIVLFTGSTT VRERWLDPFL LVLPIYFLAK MQAAGLDLSA GLRRFRPVLP VLMACVLIAL GFRVVGAGLI GTYSRPNVPM AGFSREMTQQ AQPALVIASD TYIGGNMRLQ FPDVPVVIPD FPAPGIPAYA EAKGPVLIVW RGKKTATAAD AVMPERFSSA LTAADIALQE IGSLSLPYYF GRQGDNFALG YAWVRPESK
|
| |