Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3645 |
Symbol | |
ID | 8014494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3681852 |
End bp | 3685178 |
Gene Length | 3327 bp |
Protein Length | 1108 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644826208 |
Product | transglutaminase domain protein |
Protein accession | YP_002977427 |
Protein GI | 241206331 |
COG category | [S] Function unknown |
COG ID | [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.701627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATCA AAGCCAGCAT CTATCATCTG ACGCATTACA TCTATGACAA GCCGGTTCGC CTCGGCCCAC AGATCATCAG ACTGAAACCT GCCTCGCATT CGAAGACACG GGTGCTCAGC CACTCGCTGA AGGTCACGCC TTCCAACCAT TTCGTCAATC TGCAGCAGGA CCCCTACGGC AACTACCTTG CCCGCTACGT CTTTCCGGAT CCGGTGACCG AGTTCAAGAT CGAGGTCGAT CTCGTCGCCG ACATGACGGT CTACAATCCG TTCGACTTCT TCGTCGAGGA AGAGGCAACC AAGTGGCCGT TCGGATACCC CGAAACGATC CAGGAGGATC TGTCGATCTA CATGACGCCG GAGCCGGCCG GTCCGCGCCT CAAGGCGCTG CTGCCGACGC TCGACTGGTC GCCCGACCAG CCGACGGTGG ACATGGTCGT CGGCCTGAAT GCCCGCCTGC AGCGGCAGAT CGGCTATGTC ATCCGCATGG AAACCGGCGT GCAGACGCCG GAGGAGACGC TGGAAAGCGC CAAGGGTTCC TGCCGCGACA CCAGCTGGCT GCTCGTCCAG ATCCTTCGCC ATCTCGGCCT CGCCGCCCGC TTCGTCTCCG GCTACCTCAT CCAGCTGACG CCGGACCTGA AGGCGCTTGA CGGACCCTCC GGCACCGAAG TCGATTTCAC CGACCTGCAT GCCTGGGCTG AGGTCTATCT GCCTGGTGCC GGCTGGGTCG GTCTCGATCC GACCTCCGGC CTGCTGACCG GGGAAAGCCA CATCCCGCTT GCCGCCACAC CGCATTTCCG AAATGCCGCT CCGATTTCCG GCGGTTATTT CGGCGAGGCC GAAACCGAAT TCGCCTTCGA CATGAAGGTA TCGCGCGTCG CCGAGCACCC GCGCATCACC AAGCCCTTCT CCGATGAGAG CTGGGACGAA CTCAACGAAC TCGGCGAGAA GGTCGATCGT GTCCTCGAAG CCGAGGACGT GCATCTGACG ATGGGCGGCG AACCGACCTT CGTTTCGATC GACGATTTCC AGTCCGAGGA GTGGAACACC GCCGCCGTCG GCCCGACCAA GCGCGAAAAG GCCGACGTCC TGATCCGCAA GCTGCAAGAA CGCTTCGCCC CCGGTGGTTT CCTGCATTAC GGCCAGGGCA AATGGTATCC GGGCGAAAGC CTGCCGCGCT GGACCTTCTC GCTCTACTGG CGCCGCGACG GCAAGCCGGT CTGGCAGAAC CTCGATCTGG TCGCCGCCGA AGGCAAGGAT ACCGGCGTTA CCGCCGAGGA TGCCGAAAAG CTCCTGACGG CGATCGCCAA GGAGCTGGCG ATCAAGCCTG ACATGGTGCA GCCGGCCTAT GAAGATCCGG CCGACTGGAT CATCAAGGAA GGCAACCTTC CCGAAAATGT CGATCCGGCG AATTCGAAGC TGAAGGATCC TGAGGAACGC AACCGCATCG CCCGCGTCTT CGCCCGCGGC CTGACCGTCG CGACCGGCTA CATTCTTCCG GTCCAGGCAT GGAACGCCAA GGCAGCGGAA AGCCGAGTCT GGGTCAGCGA GAAATGGCGC ACCCGCCGCG GCAAGATCTT CCTCGTCCCG GGCGACAGTC CGATCGGTTA CCGCCTGCCG CTCGGCACGT TGCCCTATGT TGCCCCGGCG CGTTACCCCT ATATCCACCC CGCCGATCCG ACGATCCCGC GCGAACCGCT GCCTGATGTC TTTGTACCGG CCGGCCGCGC CATGCCGGAA GCCTCTTTCC ATGCCGATGA GAGCAATCGC CGCCGTGTCG AACAGACGCT CGGCGAACTC CGCGGTGCCG TGCGCACCGC CATGTCGGTC GAGCCGCGCG ACGGCAGGCT CTGCATCTTC ATGCCGCCGG TCGAGCGCAT CGAGGATTAT CTCGAGCTGA TCGCCGCAGC CGAAAACGCC GCAGCCGAAC TCAAGCTTCC CGTCCATATC GAAGGTTATC CGCCGCCGCA GGACGAGCGC ATCAACGTCA TCCGCGTCGC CCCGGATCCC GGCGTCATTG AGGTCAACAT TCATCCGGCC TCGAACTGGA AGGACTGCGT CGACATTACG ACCGCGATCT ACGAGGAAGC GCGCGCCACC CGGCTCGGCG CCGACAAGTT CATGATCGAC GGTCGCCACA CCGGCACCGG CGGCGGCAAC CATGTCGTCG TAGGCGCGGC CAATCCGAAC AACAGCCCCT TCCTGCGCCG TCCCGATCTG TTGAAGAGCG TCGTCCTCCA CTGGCAGCGT CATCCCTCTC TCTCCTATCT CTTCTCCGGC ATGTTCATCG GCCCGACCAG CCAGGCGCCG CGCATCGACG AGGCTCGTCA CGACAGCCTC TATGAGCTGG AGATCGCCAT GGCGCAGATC GCTGCCCCCG GCAGCGGCCA GCAACCGCTG CCCTGGCTCG TTGACCGCCT GTTCCGTAAC CTTTTGACCG ACGTCACCGG CAATACCCAC CGTGCCGAGA TCTGCATCGA TAAGCTGTTC TCGCCCGACG GCCCGACCGG ACGGCTCGGC CTCATCGAGT TCCGCGGCTT CGAGATGCCG CCGAATGCGC GCATGTCGCT TGCCCAGCAA TTGCTGGTGC GCGCGCTGAT CGCTCGTTTT TGGGTCAACC CGGTCGACGG CAACTTCGTC CGCTGGGGCA CGACCCTGCA CGACCGTTTC ATGCTGCCGC ATTTCGTCTG GACCGATTTC CTCGACGTGC TCGCCGACCT TCGCCAAAAC GGCTTCGACG TCAGCCCGGA ATGGTTCAAG GCCCAGCTCG AATTCCGCTT CCCCTTCTGC GGCGAAGTCG AATACGAGGG CTCGAAGCTC GAACTGCGCC AGGCGCTGGA GCCTTGGCAT GTCATGGGCG AGGAAGGCGC GCCGGGCGGC ACCGTCCGAT ACGTCGACAG CTCCGTCGAA CGCCTTCAGG TGCGGCTCGA AACCAGCAAT ACCGCGCGTT ATACCGTCAC CTGCAATGGC CGCACCCTGC CGCTGACGCC GACCGGCACC GCTGGCGTGT CGGTCGCCGG CGTCCGTTAC AAGGCGTGGC AGCCATCCTC CGGTCTGCAT CCTGTCCTGC CGATCAACAC GCCTTTGACG TTTGACATTT ATGATACATG GTCGAAGCGT TCGATCGGCG GCTGCATCTA TCATGTTGCC CATCCGGGCG GCCGGACCTA TGACACCTTC CCGGTCAACG GCAACGAAGC GGAAGCAAGG CGGCTCGCCC GCTTCGAGCC GTGGGGACAT ACGGCGGGCG GTTATATCCC GCACGCCGAG ACGGGTTCGC TTGAATTCCC GCTGACGCTG GATCTGAGGC GGCCCGCCGG AATTTAA
|
Protein sequence | MSIKASIYHL THYIYDKPVR LGPQIIRLKP ASHSKTRVLS HSLKVTPSNH FVNLQQDPYG NYLARYVFPD PVTEFKIEVD LVADMTVYNP FDFFVEEEAT KWPFGYPETI QEDLSIYMTP EPAGPRLKAL LPTLDWSPDQ PTVDMVVGLN ARLQRQIGYV IRMETGVQTP EETLESAKGS CRDTSWLLVQ ILRHLGLAAR FVSGYLIQLT PDLKALDGPS GTEVDFTDLH AWAEVYLPGA GWVGLDPTSG LLTGESHIPL AATPHFRNAA PISGGYFGEA ETEFAFDMKV SRVAEHPRIT KPFSDESWDE LNELGEKVDR VLEAEDVHLT MGGEPTFVSI DDFQSEEWNT AAVGPTKREK ADVLIRKLQE RFAPGGFLHY GQGKWYPGES LPRWTFSLYW RRDGKPVWQN LDLVAAEGKD TGVTAEDAEK LLTAIAKELA IKPDMVQPAY EDPADWIIKE GNLPENVDPA NSKLKDPEER NRIARVFARG LTVATGYILP VQAWNAKAAE SRVWVSEKWR TRRGKIFLVP GDSPIGYRLP LGTLPYVAPA RYPYIHPADP TIPREPLPDV FVPAGRAMPE ASFHADESNR RRVEQTLGEL RGAVRTAMSV EPRDGRLCIF MPPVERIEDY LELIAAAENA AAELKLPVHI EGYPPPQDER INVIRVAPDP GVIEVNIHPA SNWKDCVDIT TAIYEEARAT RLGADKFMID GRHTGTGGGN HVVVGAANPN NSPFLRRPDL LKSVVLHWQR HPSLSYLFSG MFIGPTSQAP RIDEARHDSL YELEIAMAQI AAPGSGQQPL PWLVDRLFRN LLTDVTGNTH RAEICIDKLF SPDGPTGRLG LIEFRGFEMP PNARMSLAQQ LLVRALIARF WVNPVDGNFV RWGTTLHDRF MLPHFVWTDF LDVLADLRQN GFDVSPEWFK AQLEFRFPFC GEVEYEGSKL ELRQALEPWH VMGEEGAPGG TVRYVDSSVE RLQVRLETSN TARYTVTCNG RTLPLTPTGT AGVSVAGVRY KAWQPSSGLH PVLPINTPLT FDIYDTWSKR SIGGCIYHVA HPGGRTYDTF PVNGNEAEAR RLARFEPWGH TAGGYIPHAE TGSLEFPLTL DLRRPAGI
|
| |