Gene Rleg2_3343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3343 
Symbol 
ID6982097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3442290 
End bp3445616 
Gene Length3327 bp 
Protein Length1108 aa 
Translation table11 
GC content64% 
IMG OID643398061 
Producttransglutaminase domain protein 
Protein accessionYP_002282836 
Protein GI209550919 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.171972 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCA AAGCCAGTAT CTATCATCTG ACACATTACA CTTATGACAA GCCGGTCCGC 
CTCGGCCCCC AGATCATCCG GCTGAAACCT GCCTCGCATT CGAAGACAAG GGTGCTCAGC
CACTCGCTGA AGGTCACGCC TTCAAACCAC TTCGTCAATC TGCAGCAGGA TCCCTACGGC
AATTACCTCG CCCGCTACGT TTTCCCGGAC CCGGTGACGG AGTTCAAGAT CGAGGTCGAT
CTCGTCGCCG ACATGACGGT CTATAATCCC TTCGACTTCT TCGTCGAGGA GGAGGCGACC
AAATGGCCCT TCGGCTATCC TGAAACGATC CAGGAAGACC TGTCGATCTA TATGACGCCG
GAACCGGCCG GCCCGCGGTT GAAGGCGCTG CTGCCGACGC TCGACTGGTC GCCCGGCCAG
CCGACGGTCG ATATGGTCGT CGGGTTGAAT GCCCGTCTGC AGCGGCAGAT CGGCTATGTC
ATCCGCATGG AAACCGGCGT GCAGACGCCG GAGGAGACGC TCGAAAGCGC CAAGGGCTCC
TGCCGCGACA CCAGCTGGCT GCTCGTCCAG ATCCTTCGCC ATCTCGGCCT TGCCGCCCGC
TTCGTCTCCG GTTACCTCAT CCAGCTGACG CCGGATCTGA AGGCGCTCGA TGGGCCCTCC
GGCACCGAAG TCGATTTCAC CGACCTGCAT GCCTGGGCCG AGGTCTATCT GCCCGGCGCC
GGCTGGGTCG GCCTCGATCC GACCTCCGGC TTGCTGACTG GCGAAAGCCA TATTCCGCTT
GCCGCTACGC CGCACTACCG CAATGCCGCC CCGATCTCCG GCGGTTATTT CGGCGAGGCC
GAAACCGAAT TCGCCTTCGA CATGAAGGTG TCGCGCGTCG CCGAACATCC GCGCATCACC
AAACCCTTCT CCGACGAAAG CTGGGACGAA CTCAACGAAC TCGGCGAGAA GGTCGACCGT
GTCCTCGAGG CCGAGGACTT GCACCTGACG ATGGGCGGCG AGCCGACCTT CGTTTCGATC
GATGATTTCC AGTCCGAGGA ATGGAACACC GCCGCCGTCG GCCCGACCAA GCGCGAAAAG
GCCGACAGCC TGATCCGCCG GCTGCAGGAA CGCTTCGCCC CCGGCGGCTT CCTGCATTAC
GGCCAGGGTA AATGGTATCC GGGCGAAAGC CTGCCGCGCT GGACCTTCTC GCTCTACTGG
CGCCGCGACG GCAAGCCGGT CTGGCAGAAC CTCGATCTGG TCGCCACCGA AGGCAAGGAT
ACCGGCGTCA CTTCAGAGGA TGCCGAGAAG CTGCTGACGG CGATCGCAAG AGAACTGGCG
ATCAAGCCCG ACATGGTGCA GCCGGCCTAT GAGGATCCGG CCGACTGGAT CATCAAGGAA
GGCAATCTTC CCGAAAACGT CGATCCGGCG AATTCGAAGC TGAAGGATCC CGAGGAGCGC
AACCGCATCG CCCGGGTCTT TGCCCGCGGC CTCACCGTCG CCACCGGTTA CGTCCTCCCG
GTTCAGGCAT GGAACGCCAA GGCGGCCGAA AGCCGCGTCT GGATCAGCGA GAAATGGCAG
AGCCGCCGCG GCCGGATCTT CCTCGTCCCG GGCGACAGCC CGATCGGTTA CCGTCTGCCG
CTCGGCACCC TGCCCTATGT TCCCCCGGCA CGTTATCCCT ATATCCACCC TGCCGATCCG
ACGATCCCGC ACGGACCGCT GCCCGATGTC TTCGTGCCGG CCGGCCGCGC CATGCCGGAA
GCCTCCTTCC AGGCTGGCGA GAGCAATCGC GGCCGGATCG AACAGACGCT CGGTGAGATC
GGCGGCGCCG TGCGCACCGC CATGTCGGTC GAGCCGCGAG ACGGCAGGCT CTGCGTCTTC
ATGCCGCCGG TCGAGCGCAT CGAAGATTAT CTCGAGCTGA TCGCCGCGGC CGAAAACGCC
GCCGCCGAAC TCAAGCTTCC CGTCCATATC GAAGGCTACC CGCCGCCGCA GGACGAGCGT
ATCAACGTCA TCCGCGTTGC CCCGGATCCC GGCGTCATCG AGGTCAACAT TCATCCGGCA
TCGAATTGGA AGGAATGCGT CGACATCACC ACGGCGGTCT ATGAAGAGGC GCGAGCCACA
CGGCTCGGCG CCGACAAGTT CATGATCGAC GGCCGTCATA CCGGCACCGG CGGCGGCAAC
CATGTCGTCG TCGGCAGCGC CAATCCGAAC AACAGTCCCT TCCTGCGCCG TCCCGATCTC
TTGAAGAGCG TCGTCCTCCA CTGGCAGCGC CACCCCTCGC TCTCCTACCT CTTCTCAGGC
ATGTTCATCG GCCCGACCAG CCAGGCGCCA CGCATCGACG AGGCCCGTCA CGACAGCCTG
TACGAGCTGG AGATCGCCAT GGCGCAGATC GCTGCCCCCG GCAGCAGTCA GCCGCCGCTG
CCGTGGCTGG TCGACCGGCT GTTCCGCAAC CTTCTGACCG ATGTCACCGG CAACACCCAC
CGCGCCGAGA TCTGCATCGA CAAGCTGTTT TCGCCCGACG GCCCGACTGG AAGGCTCGGC
CTCATCGAGT TCCGCGGCTT CGAGATGCCG CCGAACGCGC GCATGTCGCT TGCCCAGCAA
TTGCTGGTCC GCGCCCTGAT CGCCCGCTTC TGGGTCAACC CGGTCGACGG CAAATTCGTC
CGCTGGGGCA CAACCCTGCA CGACCGTTTC ATGCTGCCGC ACTTCGTCTG GGCCGACTTC
CTCGACGTGC TGGCCGACCT TCGCGAAAAC GGCTTCGACG TCAGCCCGGA ATGGTTCAAG
GCCCAGCTCG AATTCCGCTT CCCCTTCTGC GGCGAAGTCG AATACGAGGG CTCGAAGCTG
GAACTGCGCC AAGCGCTGGA GCCTTGGCAC GTCATGGGCG AAGAGGGCGC GCCGGGCGGC
ACGGTCCGCT ACGTCGACTC GTCCGTCGAG CGCCTTCAAG TGCGGCTCGA AACCAGCAAT
ACAGCGCGTT ATACCGTCAC CTGCAATGGC CGCACCCTGC CGCTGACGCC GACCGGCACG
GCGGGCGTTT CGGTCGCCGG CGTCCGTTAC AAGGCCTGGC AGCCGGCCTC CGGCCTGCAT
CCTGTGCTGC CGATCAACAC GCCTTTGACT TTTGACATTT ATGATACATG GTCGAAGCGT
TCGATCGGCG GCTGCATCTA TCATGTTGCC CATCCGGGCG GCCGGACCTA TGACACCTTC
CCGGTCAACG GCAACGAGGC GGAAGCAAGG CGGCTCGCCC GCTTCGAGCC GTGGGGACAT
ACGGCGGGCG GTTACATTCC GCGGGCCGAG ACGGGCTCGC TGGAATTCCC GCTGACGCTG
GATCTGCGGC GGCCCGCTGG AATTTGA
 
Protein sequence
MSIKASIYHL THYTYDKPVR LGPQIIRLKP ASHSKTRVLS HSLKVTPSNH FVNLQQDPYG 
NYLARYVFPD PVTEFKIEVD LVADMTVYNP FDFFVEEEAT KWPFGYPETI QEDLSIYMTP
EPAGPRLKAL LPTLDWSPGQ PTVDMVVGLN ARLQRQIGYV IRMETGVQTP EETLESAKGS
CRDTSWLLVQ ILRHLGLAAR FVSGYLIQLT PDLKALDGPS GTEVDFTDLH AWAEVYLPGA
GWVGLDPTSG LLTGESHIPL AATPHYRNAA PISGGYFGEA ETEFAFDMKV SRVAEHPRIT
KPFSDESWDE LNELGEKVDR VLEAEDLHLT MGGEPTFVSI DDFQSEEWNT AAVGPTKREK
ADSLIRRLQE RFAPGGFLHY GQGKWYPGES LPRWTFSLYW RRDGKPVWQN LDLVATEGKD
TGVTSEDAEK LLTAIARELA IKPDMVQPAY EDPADWIIKE GNLPENVDPA NSKLKDPEER
NRIARVFARG LTVATGYVLP VQAWNAKAAE SRVWISEKWQ SRRGRIFLVP GDSPIGYRLP
LGTLPYVPPA RYPYIHPADP TIPHGPLPDV FVPAGRAMPE ASFQAGESNR GRIEQTLGEI
GGAVRTAMSV EPRDGRLCVF MPPVERIEDY LELIAAAENA AAELKLPVHI EGYPPPQDER
INVIRVAPDP GVIEVNIHPA SNWKECVDIT TAVYEEARAT RLGADKFMID GRHTGTGGGN
HVVVGSANPN NSPFLRRPDL LKSVVLHWQR HPSLSYLFSG MFIGPTSQAP RIDEARHDSL
YELEIAMAQI AAPGSSQPPL PWLVDRLFRN LLTDVTGNTH RAEICIDKLF SPDGPTGRLG
LIEFRGFEMP PNARMSLAQQ LLVRALIARF WVNPVDGKFV RWGTTLHDRF MLPHFVWADF
LDVLADLREN GFDVSPEWFK AQLEFRFPFC GEVEYEGSKL ELRQALEPWH VMGEEGAPGG
TVRYVDSSVE RLQVRLETSN TARYTVTCNG RTLPLTPTGT AGVSVAGVRY KAWQPASGLH
PVLPINTPLT FDIYDTWSKR SIGGCIYHVA HPGGRTYDTF PVNGNEAEAR RLARFEPWGH
TAGGYIPRAE TGSLEFPLTL DLRRPAGI