Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3343 |
Symbol | |
ID | 6982097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3442290 |
End bp | 3445616 |
Gene Length | 3327 bp |
Protein Length | 1108 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643398061 |
Product | transglutaminase domain protein |
Protein accession | YP_002282836 |
Protein GI | 209550919 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.171972 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATCA AAGCCAGTAT CTATCATCTG ACACATTACA CTTATGACAA GCCGGTCCGC CTCGGCCCCC AGATCATCCG GCTGAAACCT GCCTCGCATT CGAAGACAAG GGTGCTCAGC CACTCGCTGA AGGTCACGCC TTCAAACCAC TTCGTCAATC TGCAGCAGGA TCCCTACGGC AATTACCTCG CCCGCTACGT TTTCCCGGAC CCGGTGACGG AGTTCAAGAT CGAGGTCGAT CTCGTCGCCG ACATGACGGT CTATAATCCC TTCGACTTCT TCGTCGAGGA GGAGGCGACC AAATGGCCCT TCGGCTATCC TGAAACGATC CAGGAAGACC TGTCGATCTA TATGACGCCG GAACCGGCCG GCCCGCGGTT GAAGGCGCTG CTGCCGACGC TCGACTGGTC GCCCGGCCAG CCGACGGTCG ATATGGTCGT CGGGTTGAAT GCCCGTCTGC AGCGGCAGAT CGGCTATGTC ATCCGCATGG AAACCGGCGT GCAGACGCCG GAGGAGACGC TCGAAAGCGC CAAGGGCTCC TGCCGCGACA CCAGCTGGCT GCTCGTCCAG ATCCTTCGCC ATCTCGGCCT TGCCGCCCGC TTCGTCTCCG GTTACCTCAT CCAGCTGACG CCGGATCTGA AGGCGCTCGA TGGGCCCTCC GGCACCGAAG TCGATTTCAC CGACCTGCAT GCCTGGGCCG AGGTCTATCT GCCCGGCGCC GGCTGGGTCG GCCTCGATCC GACCTCCGGC TTGCTGACTG GCGAAAGCCA TATTCCGCTT GCCGCTACGC CGCACTACCG CAATGCCGCC CCGATCTCCG GCGGTTATTT CGGCGAGGCC GAAACCGAAT TCGCCTTCGA CATGAAGGTG TCGCGCGTCG CCGAACATCC GCGCATCACC AAACCCTTCT CCGACGAAAG CTGGGACGAA CTCAACGAAC TCGGCGAGAA GGTCGACCGT GTCCTCGAGG CCGAGGACTT GCACCTGACG ATGGGCGGCG AGCCGACCTT CGTTTCGATC GATGATTTCC AGTCCGAGGA ATGGAACACC GCCGCCGTCG GCCCGACCAA GCGCGAAAAG GCCGACAGCC TGATCCGCCG GCTGCAGGAA CGCTTCGCCC CCGGCGGCTT CCTGCATTAC GGCCAGGGTA AATGGTATCC GGGCGAAAGC CTGCCGCGCT GGACCTTCTC GCTCTACTGG CGCCGCGACG GCAAGCCGGT CTGGCAGAAC CTCGATCTGG TCGCCACCGA AGGCAAGGAT ACCGGCGTCA CTTCAGAGGA TGCCGAGAAG CTGCTGACGG CGATCGCAAG AGAACTGGCG ATCAAGCCCG ACATGGTGCA GCCGGCCTAT GAGGATCCGG CCGACTGGAT CATCAAGGAA GGCAATCTTC CCGAAAACGT CGATCCGGCG AATTCGAAGC TGAAGGATCC CGAGGAGCGC AACCGCATCG CCCGGGTCTT TGCCCGCGGC CTCACCGTCG CCACCGGTTA CGTCCTCCCG GTTCAGGCAT GGAACGCCAA GGCGGCCGAA AGCCGCGTCT GGATCAGCGA GAAATGGCAG AGCCGCCGCG GCCGGATCTT CCTCGTCCCG GGCGACAGCC CGATCGGTTA CCGTCTGCCG CTCGGCACCC TGCCCTATGT TCCCCCGGCA CGTTATCCCT ATATCCACCC TGCCGATCCG ACGATCCCGC ACGGACCGCT GCCCGATGTC TTCGTGCCGG CCGGCCGCGC CATGCCGGAA GCCTCCTTCC AGGCTGGCGA GAGCAATCGC GGCCGGATCG AACAGACGCT CGGTGAGATC GGCGGCGCCG TGCGCACCGC CATGTCGGTC GAGCCGCGAG ACGGCAGGCT CTGCGTCTTC ATGCCGCCGG TCGAGCGCAT CGAAGATTAT CTCGAGCTGA TCGCCGCGGC CGAAAACGCC GCCGCCGAAC TCAAGCTTCC CGTCCATATC GAAGGCTACC CGCCGCCGCA GGACGAGCGT ATCAACGTCA TCCGCGTTGC CCCGGATCCC GGCGTCATCG AGGTCAACAT TCATCCGGCA TCGAATTGGA AGGAATGCGT CGACATCACC ACGGCGGTCT ATGAAGAGGC GCGAGCCACA CGGCTCGGCG CCGACAAGTT CATGATCGAC GGCCGTCATA CCGGCACCGG CGGCGGCAAC CATGTCGTCG TCGGCAGCGC CAATCCGAAC AACAGTCCCT TCCTGCGCCG TCCCGATCTC TTGAAGAGCG TCGTCCTCCA CTGGCAGCGC CACCCCTCGC TCTCCTACCT CTTCTCAGGC ATGTTCATCG GCCCGACCAG CCAGGCGCCA CGCATCGACG AGGCCCGTCA CGACAGCCTG TACGAGCTGG AGATCGCCAT GGCGCAGATC GCTGCCCCCG GCAGCAGTCA GCCGCCGCTG CCGTGGCTGG TCGACCGGCT GTTCCGCAAC CTTCTGACCG ATGTCACCGG CAACACCCAC CGCGCCGAGA TCTGCATCGA CAAGCTGTTT TCGCCCGACG GCCCGACTGG AAGGCTCGGC CTCATCGAGT TCCGCGGCTT CGAGATGCCG CCGAACGCGC GCATGTCGCT TGCCCAGCAA TTGCTGGTCC GCGCCCTGAT CGCCCGCTTC TGGGTCAACC CGGTCGACGG CAAATTCGTC CGCTGGGGCA CAACCCTGCA CGACCGTTTC ATGCTGCCGC ACTTCGTCTG GGCCGACTTC CTCGACGTGC TGGCCGACCT TCGCGAAAAC GGCTTCGACG TCAGCCCGGA ATGGTTCAAG GCCCAGCTCG AATTCCGCTT CCCCTTCTGC GGCGAAGTCG AATACGAGGG CTCGAAGCTG GAACTGCGCC AAGCGCTGGA GCCTTGGCAC GTCATGGGCG AAGAGGGCGC GCCGGGCGGC ACGGTCCGCT ACGTCGACTC GTCCGTCGAG CGCCTTCAAG TGCGGCTCGA AACCAGCAAT ACAGCGCGTT ATACCGTCAC CTGCAATGGC CGCACCCTGC CGCTGACGCC GACCGGCACG GCGGGCGTTT CGGTCGCCGG CGTCCGTTAC AAGGCCTGGC AGCCGGCCTC CGGCCTGCAT CCTGTGCTGC CGATCAACAC GCCTTTGACT TTTGACATTT ATGATACATG GTCGAAGCGT TCGATCGGCG GCTGCATCTA TCATGTTGCC CATCCGGGCG GCCGGACCTA TGACACCTTC CCGGTCAACG GCAACGAGGC GGAAGCAAGG CGGCTCGCCC GCTTCGAGCC GTGGGGACAT ACGGCGGGCG GTTACATTCC GCGGGCCGAG ACGGGCTCGC TGGAATTCCC GCTGACGCTG GATCTGCGGC GGCCCGCTGG AATTTGA
|
Protein sequence | MSIKASIYHL THYTYDKPVR LGPQIIRLKP ASHSKTRVLS HSLKVTPSNH FVNLQQDPYG NYLARYVFPD PVTEFKIEVD LVADMTVYNP FDFFVEEEAT KWPFGYPETI QEDLSIYMTP EPAGPRLKAL LPTLDWSPGQ PTVDMVVGLN ARLQRQIGYV IRMETGVQTP EETLESAKGS CRDTSWLLVQ ILRHLGLAAR FVSGYLIQLT PDLKALDGPS GTEVDFTDLH AWAEVYLPGA GWVGLDPTSG LLTGESHIPL AATPHYRNAA PISGGYFGEA ETEFAFDMKV SRVAEHPRIT KPFSDESWDE LNELGEKVDR VLEAEDLHLT MGGEPTFVSI DDFQSEEWNT AAVGPTKREK ADSLIRRLQE RFAPGGFLHY GQGKWYPGES LPRWTFSLYW RRDGKPVWQN LDLVATEGKD TGVTSEDAEK LLTAIARELA IKPDMVQPAY EDPADWIIKE GNLPENVDPA NSKLKDPEER NRIARVFARG LTVATGYVLP VQAWNAKAAE SRVWISEKWQ SRRGRIFLVP GDSPIGYRLP LGTLPYVPPA RYPYIHPADP TIPHGPLPDV FVPAGRAMPE ASFQAGESNR GRIEQTLGEI GGAVRTAMSV EPRDGRLCVF MPPVERIEDY LELIAAAENA AAELKLPVHI EGYPPPQDER INVIRVAPDP GVIEVNIHPA SNWKECVDIT TAVYEEARAT RLGADKFMID GRHTGTGGGN HVVVGSANPN NSPFLRRPDL LKSVVLHWQR HPSLSYLFSG MFIGPTSQAP RIDEARHDSL YELEIAMAQI AAPGSSQPPL PWLVDRLFRN LLTDVTGNTH RAEICIDKLF SPDGPTGRLG LIEFRGFEMP PNARMSLAQQ LLVRALIARF WVNPVDGKFV RWGTTLHDRF MLPHFVWADF LDVLADLREN GFDVSPEWFK AQLEFRFPFC GEVEYEGSKL ELRQALEPWH VMGEEGAPGG TVRYVDSSVE RLQVRLETSN TARYTVTCNG RTLPLTPTGT AGVSVAGVRY KAWQPASGLH PVLPINTPLT FDIYDTWSKR SIGGCIYHVA HPGGRTYDTF PVNGNEAEAR RLARFEPWGH TAGGYIPRAE TGSLEFPLTL DLRRPAGI
|
| |