Gene Rleg2_4808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4808 
Symbol 
ID6977902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp448375 
End bp452364 
Gene Length3990 bp 
Protein Length1329 aa 
Translation table11 
GC content64% 
IMG OID643393970 
Productamino acid adenylation domain protein 
Protein accessionYP_002278788 
Protein GI209546870 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC AAATCACCAA TTTCGCCGAT GCTTCCCCGA CCGATGCCGC AATGGACGCG 
GTCTTCGAGA GCTTTCCGCT GACGATCGCG CAGAAGCGCA TCTGGTCGCT GGAGCAGATC
GGCAACTATA CCGTCTTTCC CGACCAGGTC ATCGGGCTGC GATTGAGCCC GGCGGTCGAC
GTCGAGACGA TTGCCGGCGC CTGCCGGGCG CTCGTGGCGC AAAATCCGTC GCTTGCCAGC
CGCTTCCGCC GGCTGGCGGG CGGCCGCATC GAGCAATATC GCGACGCATC GAACGCGGTG
CCGATCGAGA TCCTCGGCAA AGGCGGCGCG GCCCTTTCAG AGCCGGATGC CCTGGCCGCA
AGGAAAGCCT TCCGCGACAA GCGCTTCGAT CTCCTGGAGG AGCCCGGCGC GCGCATCCAG
ATCATCCGGC TTGCCCAGAG GCAGTCGCTG CTGACGATCG TGCTGCATCC GATCGTCTGC
GACGACCACG AAAAATCCGT TCTTGCCAGG TCGCTGGCGC GACTTCTCGA CGGCGAGCCC
GTTGGCGACA TGCATCCGGT AGAAATTTCG GCCGGAACGC GGGAGGCCGA TTGGCTGGAG
ACGGACGAGG CGCGCGAAGC GGTTGCCTAT TGGCGCGACA CCATCGGTTT CGACTATGCG
GCCTCGACCT TTCCCACGCG GTTCAACAGC GGCGGGCTGG CAGGCGTGGC GCGCGCGCAG
CATCGATTTT CGATCGAGCC CGACCTCTGG AGCAAACTCG AACGGCATGC GGGGGATAAG
GGAGTTCATG TCGAGCGCGT GCTGCACGCC GCCTTTTGCG CGCTGCTGGC ACGCTACAGC
GGCAACTACG CTTTGCTGAC CGGCCTTCTC GTCGCGCGGC CCGATAAGGA GCATTTTGCC
AGCCGCGGCC GTGCCGAGCA GGTGCTCCCG CTCATCCTAT CGCTGACGTC GAGACATTCC
CTCGACGATG TCATCGCGGC GATCTCGTCG GCGAGCGAGG AGGGGCAACG CCGGCTTCTG
CCGCTGGAGC GCATCACCCA GGAACTGGTC GTCGACGATG CCGCCGCCCA GGAAGCCGTG
GTCAAGGCGC TGTTCGAATT CCGCGGACCC TATGCGCTAC CCAGCCAGGC CACAAATCTG
GAGCCGCCGG GTGCCCGTGC CGACAGCGAG CTTTCGCTGG TGGTCGAGGC GCATTCGGCT
GGCGCATCTG CGCTGATCGA CTACGCGCAG GACCTCTATG ACGGCGCCCT GGTTGCCCGC
CTGGCCAGGC ATTTCGGCAA CGTGCTCGAA CAGATGGTCG CGCGGCCGAA TGTGCGGATC
AAGGATATCG AACTGGTGGG CAGCGAAGAG CTCGACTGGC TGTCGGCGCC TTACGAGGAT
GATGCCGTCA ACGACGACAG GCCGGTGCAT GAGCTGATCG CTGCCCATTC GCGCCGTACG
CCTGAGAAGA CCGCGATCGT TTACGGTGAC GAGGAATGGA GCCATGGCTG GCTGGAGACG
AACACCAACC GCCTCGGCCA TCGGCTGCGG CAGCTCGGCG TTCGCGCCGA AGTAACGGTC
GCAATCTTCA TCAAGCGCTC GCCGGAGGCG ATCGTCGGCA TTCTCGCGAC GCTGAAGGCC
GGCGGTGCCT ATATTCCCGT CGAGCCCGAC CATCCGCCGG TGCGCAACCA CCACATCTTG
CGCGACGGCG GCGTCAAGAT CGTCCTGACC CACAGCTGGC TGCGCCATCG GCTGCCTGAA
GAACTCGACG GCATTATCCT CGAACTCGAC AAGATCGACC TCGACGGAGA GCCGGATACG
CCGCTCTATG TTCCCACGCA CAAGGATCAG CTCGCCTATG TCATGTACAC GTCGGGCTCG
ACCGGATTGC CGAAGGGCGT CGCCGTCGAA CACGGGCCGC TGACGCACCA TCTGCAAAAC
ACCTCGCGTG TCTATGGCAT GAGCTCGGAA TCCCGTGAGC TGCCGTTCCT GCCCTTCAGC
TCAGATGGCG GCCATGAGCG GTGGATGAAC CCGCTGATGG AAGGCGGCAG CATCATCCTG
CCCGATCAGC CGCTCTGGAC GCCTGAGGAG ACATTGACGG CGATGCGCAA ACATGGCGCC
AACAATGCCA GCATTCCCAC AACCTATCTG CAGCAACTGG CGGAATGGGC CGATATCACC
GACGGCGCGC CGCCGATGCG ACTTTACTCC TTCGGCGGCG AGGGGCTTGC ACAACCGACC
TTCGATCTGC TGTCGCGGGC ACTGAAATCG GAATGGCTGA TCAACGGTTA CGGTCCCACA
GAAACCATCA TGACGCCGAT GGTCTGGAAG GTGAGGGCCG GTACGAAGTT CCAGGGCGTC
TATGCGCCGC TCGGCCGCGC CGTCGGGCTG CGGCGCGTCT ATGTGCTCGA CCCCGATCTC
AACCTGTGCC CGATCGGCGT CACCGGCGAA CTCTATATCG GGGGCGAGGG CATCGCGCGC
GGCTATCTCG GCAAGCCGGA TACGACGGCG GATCGTTTTA TTCCCGACCC ATTCTCCAAG
GAGGGCGGCC GGCTTTATCG CTCCGGCGAC CTGACGCGTT GGCGTGAGGA CGGAACCGTC
GAATTCGTCG GCCGCGTCGA CCATCAGGTG AAGCTGCGGG GGTACCGCAT CGAGCTCGGC
GAGATCGAAG CAGCCCTCCT CCAGCAGCCC GGCGTCGGTG AAGCGCTGGT CGTGCTGCGC
GATGACGATG CCGGCGAGAA GGCGCTGGTC GCCTATGTCG TTCCGAAGAA GGACGAGACG
TTAAACGTCG AGACCGTCCG CGTTGGCCTC GAACGCAGCC TGCCCTCCTA CATGGTGCCG
GCGGCGGTGG TCGAACTGGA AAAGATGCCG ACCAATCCGA ACAGCAAGCT CGATCGCTTC
GCGCTGCCCG CACCCCAGCC GGTCCGGCGC GAAATCGTCG AGCCGGCGAC CACACTCGAA
GAAGAGGTGC TGGATGTCTG GCGCCAGGTT CTCAAACTGG AAGCGATCAG TGTCGAGGAT
AATTTCTTCA CGATCGGCGG CAATTCGCTG GGCGCGATCC GCATTCTTTC GCAGCTGAGG
CAGCGCTGGC CGAAGACACC GCTCACCGTC GCCGATATCT TCAACAACCC GACCATCCGC
GCCTTTGCCG GCGTGATGGA GCAGGGCGCC GAACGGGAAC TGTCGGAGGT GATCGTGCTG
CGCGCTTCCG GCGCCAAGCC GCGGCTCTAT TGCTTCCCGG GGCTTCTCGT CAGCACCCGC
GAATATGTGA AGCTCGTCGA TTATCTCGGC GCAGACCAGC CGGCGACCGG CTTCATTTGC
CATTCGCTGT CCGAAAAGAA GGAAGTCGGC GCGCCGATCG AAGAGATCAT CGAGAGCTAT
GTCGATCATA TCAGGACCCA TAGCAAAGGC GCACCCTGCA CCTTCCTCGG CTGGTCCTGG
GGCGGGCTTT TGGCCTACGA GGCGGCCCGC ACGCTCGGCA ACGAGGTCGA CGTCAGGATG
ATGGCGATGG TCGATGTCTG CGATCTCGGC TCGGAATTCG CGATCGGCGC CAAGCCGAAG
TTCCGGCCGG GCGAGCGCGA CATGCTTCAC CGCGATGTTG AGGCATGGCT GCAAAAGACG
GAGATGCGTC CCGAATGGGA CCGGCTGCTC TCGACGATGG ATGCCGATAC CTACGATCAG
TTCCTGCGCT TCGTCGGCGA CGAGAAAGAC CCGCTTCCGA CCGACGGGCC GGATATCAGC
AGCCGCGAGC ATACGTTCTG GGTGCTGATC GACAATGCCC TGATCTTCCG CAAGCACCGG
CTCGTCCCTC ACGATGTGCC GATCTATCCC TGGGCAGCCG ATGACAGCCT CAACCGCGGC
CTCAACCTGA TCGATTGGCG CCGCCTATCG CCGCGGGCGC ATGCGGCCGA AATTATCACC
GGCACCAACC ATCTGCACAT GATCGGGTCT CCCGCCTTCC ATTCAAGGCT TGCCCTGCGT
CTTAAGGAAA CAGAGAAGGA TTTCGCATGA
 
Protein sequence
MNKQITNFAD ASPTDAAMDA VFESFPLTIA QKRIWSLEQI GNYTVFPDQV IGLRLSPAVD 
VETIAGACRA LVAQNPSLAS RFRRLAGGRI EQYRDASNAV PIEILGKGGA ALSEPDALAA
RKAFRDKRFD LLEEPGARIQ IIRLAQRQSL LTIVLHPIVC DDHEKSVLAR SLARLLDGEP
VGDMHPVEIS AGTREADWLE TDEAREAVAY WRDTIGFDYA ASTFPTRFNS GGLAGVARAQ
HRFSIEPDLW SKLERHAGDK GVHVERVLHA AFCALLARYS GNYALLTGLL VARPDKEHFA
SRGRAEQVLP LILSLTSRHS LDDVIAAISS ASEEGQRRLL PLERITQELV VDDAAAQEAV
VKALFEFRGP YALPSQATNL EPPGARADSE LSLVVEAHSA GASALIDYAQ DLYDGALVAR
LARHFGNVLE QMVARPNVRI KDIELVGSEE LDWLSAPYED DAVNDDRPVH ELIAAHSRRT
PEKTAIVYGD EEWSHGWLET NTNRLGHRLR QLGVRAEVTV AIFIKRSPEA IVGILATLKA
GGAYIPVEPD HPPVRNHHIL RDGGVKIVLT HSWLRHRLPE ELDGIILELD KIDLDGEPDT
PLYVPTHKDQ LAYVMYTSGS TGLPKGVAVE HGPLTHHLQN TSRVYGMSSE SRELPFLPFS
SDGGHERWMN PLMEGGSIIL PDQPLWTPEE TLTAMRKHGA NNASIPTTYL QQLAEWADIT
DGAPPMRLYS FGGEGLAQPT FDLLSRALKS EWLINGYGPT ETIMTPMVWK VRAGTKFQGV
YAPLGRAVGL RRVYVLDPDL NLCPIGVTGE LYIGGEGIAR GYLGKPDTTA DRFIPDPFSK
EGGRLYRSGD LTRWREDGTV EFVGRVDHQV KLRGYRIELG EIEAALLQQP GVGEALVVLR
DDDAGEKALV AYVVPKKDET LNVETVRVGL ERSLPSYMVP AAVVELEKMP TNPNSKLDRF
ALPAPQPVRR EIVEPATTLE EEVLDVWRQV LKLEAISVED NFFTIGGNSL GAIRILSQLR
QRWPKTPLTV ADIFNNPTIR AFAGVMEQGA ERELSEVIVL RASGAKPRLY CFPGLLVSTR
EYVKLVDYLG ADQPATGFIC HSLSEKKEVG APIEEIIESY VDHIRTHSKG APCTFLGWSW
GGLLAYEAAR TLGNEVDVRM MAMVDVCDLG SEFAIGAKPK FRPGERDMLH RDVEAWLQKT
EMRPEWDRLL STMDADTYDQ FLRFVGDEKD PLPTDGPDIS SREHTFWVLI DNALIFRKHR
LVPHDVPIYP WAADDSLNRG LNLIDWRRLS PRAHAAEIIT GTNHLHMIGS PAFHSRLALR
LKETEKDFA