Gene Rleg_3645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3645 
Symbol 
ID8014494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3681852 
End bp3685178 
Gene Length3327 bp 
Protein Length1108 aa 
Translation table11 
GC content64% 
IMG OID644826208 
Producttransglutaminase domain protein 
Protein accessionYP_002977427 
Protein GI241206331 
COG category[S] Function unknown 
COG ID[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.701627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCA AAGCCAGCAT CTATCATCTG ACGCATTACA TCTATGACAA GCCGGTTCGC 
CTCGGCCCAC AGATCATCAG ACTGAAACCT GCCTCGCATT CGAAGACACG GGTGCTCAGC
CACTCGCTGA AGGTCACGCC TTCCAACCAT TTCGTCAATC TGCAGCAGGA CCCCTACGGC
AACTACCTTG CCCGCTACGT CTTTCCGGAT CCGGTGACCG AGTTCAAGAT CGAGGTCGAT
CTCGTCGCCG ACATGACGGT CTACAATCCG TTCGACTTCT TCGTCGAGGA AGAGGCAACC
AAGTGGCCGT TCGGATACCC CGAAACGATC CAGGAGGATC TGTCGATCTA CATGACGCCG
GAGCCGGCCG GTCCGCGCCT CAAGGCGCTG CTGCCGACGC TCGACTGGTC GCCCGACCAG
CCGACGGTGG ACATGGTCGT CGGCCTGAAT GCCCGCCTGC AGCGGCAGAT CGGCTATGTC
ATCCGCATGG AAACCGGCGT GCAGACGCCG GAGGAGACGC TGGAAAGCGC CAAGGGTTCC
TGCCGCGACA CCAGCTGGCT GCTCGTCCAG ATCCTTCGCC ATCTCGGCCT CGCCGCCCGC
TTCGTCTCCG GCTACCTCAT CCAGCTGACG CCGGACCTGA AGGCGCTTGA CGGACCCTCC
GGCACCGAAG TCGATTTCAC CGACCTGCAT GCCTGGGCTG AGGTCTATCT GCCTGGTGCC
GGCTGGGTCG GTCTCGATCC GACCTCCGGC CTGCTGACCG GGGAAAGCCA CATCCCGCTT
GCCGCCACAC CGCATTTCCG AAATGCCGCT CCGATTTCCG GCGGTTATTT CGGCGAGGCC
GAAACCGAAT TCGCCTTCGA CATGAAGGTA TCGCGCGTCG CCGAGCACCC GCGCATCACC
AAGCCCTTCT CCGATGAGAG CTGGGACGAA CTCAACGAAC TCGGCGAGAA GGTCGATCGT
GTCCTCGAAG CCGAGGACGT GCATCTGACG ATGGGCGGCG AACCGACCTT CGTTTCGATC
GACGATTTCC AGTCCGAGGA GTGGAACACC GCCGCCGTCG GCCCGACCAA GCGCGAAAAG
GCCGACGTCC TGATCCGCAA GCTGCAAGAA CGCTTCGCCC CCGGTGGTTT CCTGCATTAC
GGCCAGGGCA AATGGTATCC GGGCGAAAGC CTGCCGCGCT GGACCTTCTC GCTCTACTGG
CGCCGCGACG GCAAGCCGGT CTGGCAGAAC CTCGATCTGG TCGCCGCCGA AGGCAAGGAT
ACCGGCGTTA CCGCCGAGGA TGCCGAAAAG CTCCTGACGG CGATCGCCAA GGAGCTGGCG
ATCAAGCCTG ACATGGTGCA GCCGGCCTAT GAAGATCCGG CCGACTGGAT CATCAAGGAA
GGCAACCTTC CCGAAAATGT CGATCCGGCG AATTCGAAGC TGAAGGATCC TGAGGAACGC
AACCGCATCG CCCGCGTCTT CGCCCGCGGC CTGACCGTCG CGACCGGCTA CATTCTTCCG
GTCCAGGCAT GGAACGCCAA GGCAGCGGAA AGCCGAGTCT GGGTCAGCGA GAAATGGCGC
ACCCGCCGCG GCAAGATCTT CCTCGTCCCG GGCGACAGTC CGATCGGTTA CCGCCTGCCG
CTCGGCACGT TGCCCTATGT TGCCCCGGCG CGTTACCCCT ATATCCACCC CGCCGATCCG
ACGATCCCGC GCGAACCGCT GCCTGATGTC TTTGTACCGG CCGGCCGCGC CATGCCGGAA
GCCTCTTTCC ATGCCGATGA GAGCAATCGC CGCCGTGTCG AACAGACGCT CGGCGAACTC
CGCGGTGCCG TGCGCACCGC CATGTCGGTC GAGCCGCGCG ACGGCAGGCT CTGCATCTTC
ATGCCGCCGG TCGAGCGCAT CGAGGATTAT CTCGAGCTGA TCGCCGCAGC CGAAAACGCC
GCAGCCGAAC TCAAGCTTCC CGTCCATATC GAAGGTTATC CGCCGCCGCA GGACGAGCGC
ATCAACGTCA TCCGCGTCGC CCCGGATCCC GGCGTCATTG AGGTCAACAT TCATCCGGCC
TCGAACTGGA AGGACTGCGT CGACATTACG ACCGCGATCT ACGAGGAAGC GCGCGCCACC
CGGCTCGGCG CCGACAAGTT CATGATCGAC GGTCGCCACA CCGGCACCGG CGGCGGCAAC
CATGTCGTCG TAGGCGCGGC CAATCCGAAC AACAGCCCCT TCCTGCGCCG TCCCGATCTG
TTGAAGAGCG TCGTCCTCCA CTGGCAGCGT CATCCCTCTC TCTCCTATCT CTTCTCCGGC
ATGTTCATCG GCCCGACCAG CCAGGCGCCG CGCATCGACG AGGCTCGTCA CGACAGCCTC
TATGAGCTGG AGATCGCCAT GGCGCAGATC GCTGCCCCCG GCAGCGGCCA GCAACCGCTG
CCCTGGCTCG TTGACCGCCT GTTCCGTAAC CTTTTGACCG ACGTCACCGG CAATACCCAC
CGTGCCGAGA TCTGCATCGA TAAGCTGTTC TCGCCCGACG GCCCGACCGG ACGGCTCGGC
CTCATCGAGT TCCGCGGCTT CGAGATGCCG CCGAATGCGC GCATGTCGCT TGCCCAGCAA
TTGCTGGTGC GCGCGCTGAT CGCTCGTTTT TGGGTCAACC CGGTCGACGG CAACTTCGTC
CGCTGGGGCA CGACCCTGCA CGACCGTTTC ATGCTGCCGC ATTTCGTCTG GACCGATTTC
CTCGACGTGC TCGCCGACCT TCGCCAAAAC GGCTTCGACG TCAGCCCGGA ATGGTTCAAG
GCCCAGCTCG AATTCCGCTT CCCCTTCTGC GGCGAAGTCG AATACGAGGG CTCGAAGCTC
GAACTGCGCC AGGCGCTGGA GCCTTGGCAT GTCATGGGCG AGGAAGGCGC GCCGGGCGGC
ACCGTCCGAT ACGTCGACAG CTCCGTCGAA CGCCTTCAGG TGCGGCTCGA AACCAGCAAT
ACCGCGCGTT ATACCGTCAC CTGCAATGGC CGCACCCTGC CGCTGACGCC GACCGGCACC
GCTGGCGTGT CGGTCGCCGG CGTCCGTTAC AAGGCGTGGC AGCCATCCTC CGGTCTGCAT
CCTGTCCTGC CGATCAACAC GCCTTTGACG TTTGACATTT ATGATACATG GTCGAAGCGT
TCGATCGGCG GCTGCATCTA TCATGTTGCC CATCCGGGCG GCCGGACCTA TGACACCTTC
CCGGTCAACG GCAACGAAGC GGAAGCAAGG CGGCTCGCCC GCTTCGAGCC GTGGGGACAT
ACGGCGGGCG GTTATATCCC GCACGCCGAG ACGGGTTCGC TTGAATTCCC GCTGACGCTG
GATCTGAGGC GGCCCGCCGG AATTTAA
 
Protein sequence
MSIKASIYHL THYIYDKPVR LGPQIIRLKP ASHSKTRVLS HSLKVTPSNH FVNLQQDPYG 
NYLARYVFPD PVTEFKIEVD LVADMTVYNP FDFFVEEEAT KWPFGYPETI QEDLSIYMTP
EPAGPRLKAL LPTLDWSPDQ PTVDMVVGLN ARLQRQIGYV IRMETGVQTP EETLESAKGS
CRDTSWLLVQ ILRHLGLAAR FVSGYLIQLT PDLKALDGPS GTEVDFTDLH AWAEVYLPGA
GWVGLDPTSG LLTGESHIPL AATPHFRNAA PISGGYFGEA ETEFAFDMKV SRVAEHPRIT
KPFSDESWDE LNELGEKVDR VLEAEDVHLT MGGEPTFVSI DDFQSEEWNT AAVGPTKREK
ADVLIRKLQE RFAPGGFLHY GQGKWYPGES LPRWTFSLYW RRDGKPVWQN LDLVAAEGKD
TGVTAEDAEK LLTAIAKELA IKPDMVQPAY EDPADWIIKE GNLPENVDPA NSKLKDPEER
NRIARVFARG LTVATGYILP VQAWNAKAAE SRVWVSEKWR TRRGKIFLVP GDSPIGYRLP
LGTLPYVAPA RYPYIHPADP TIPREPLPDV FVPAGRAMPE ASFHADESNR RRVEQTLGEL
RGAVRTAMSV EPRDGRLCIF MPPVERIEDY LELIAAAENA AAELKLPVHI EGYPPPQDER
INVIRVAPDP GVIEVNIHPA SNWKDCVDIT TAIYEEARAT RLGADKFMID GRHTGTGGGN
HVVVGAANPN NSPFLRRPDL LKSVVLHWQR HPSLSYLFSG MFIGPTSQAP RIDEARHDSL
YELEIAMAQI AAPGSGQQPL PWLVDRLFRN LLTDVTGNTH RAEICIDKLF SPDGPTGRLG
LIEFRGFEMP PNARMSLAQQ LLVRALIARF WVNPVDGNFV RWGTTLHDRF MLPHFVWTDF
LDVLADLRQN GFDVSPEWFK AQLEFRFPFC GEVEYEGSKL ELRQALEPWH VMGEEGAPGG
TVRYVDSSVE RLQVRLETSN TARYTVTCNG RTLPLTPTGT AGVSVAGVRY KAWQPSSGLH
PVLPINTPLT FDIYDTWSKR SIGGCIYHVA HPGGRTYDTF PVNGNEAEAR RLARFEPWGH
TAGGYIPHAE TGSLEFPLTL DLRRPAGI