Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4808 |
Symbol | |
ID | 6977902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 448375 |
End bp | 452364 |
Gene Length | 3990 bp |
Protein Length | 1329 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643393970 |
Product | amino acid adenylation domain protein |
Protein accession | YP_002278788 |
Protein GI | 209546870 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGC AAATCACCAA TTTCGCCGAT GCTTCCCCGA CCGATGCCGC AATGGACGCG GTCTTCGAGA GCTTTCCGCT GACGATCGCG CAGAAGCGCA TCTGGTCGCT GGAGCAGATC GGCAACTATA CCGTCTTTCC CGACCAGGTC ATCGGGCTGC GATTGAGCCC GGCGGTCGAC GTCGAGACGA TTGCCGGCGC CTGCCGGGCG CTCGTGGCGC AAAATCCGTC GCTTGCCAGC CGCTTCCGCC GGCTGGCGGG CGGCCGCATC GAGCAATATC GCGACGCATC GAACGCGGTG CCGATCGAGA TCCTCGGCAA AGGCGGCGCG GCCCTTTCAG AGCCGGATGC CCTGGCCGCA AGGAAAGCCT TCCGCGACAA GCGCTTCGAT CTCCTGGAGG AGCCCGGCGC GCGCATCCAG ATCATCCGGC TTGCCCAGAG GCAGTCGCTG CTGACGATCG TGCTGCATCC GATCGTCTGC GACGACCACG AAAAATCCGT TCTTGCCAGG TCGCTGGCGC GACTTCTCGA CGGCGAGCCC GTTGGCGACA TGCATCCGGT AGAAATTTCG GCCGGAACGC GGGAGGCCGA TTGGCTGGAG ACGGACGAGG CGCGCGAAGC GGTTGCCTAT TGGCGCGACA CCATCGGTTT CGACTATGCG GCCTCGACCT TTCCCACGCG GTTCAACAGC GGCGGGCTGG CAGGCGTGGC GCGCGCGCAG CATCGATTTT CGATCGAGCC CGACCTCTGG AGCAAACTCG AACGGCATGC GGGGGATAAG GGAGTTCATG TCGAGCGCGT GCTGCACGCC GCCTTTTGCG CGCTGCTGGC ACGCTACAGC GGCAACTACG CTTTGCTGAC CGGCCTTCTC GTCGCGCGGC CCGATAAGGA GCATTTTGCC AGCCGCGGCC GTGCCGAGCA GGTGCTCCCG CTCATCCTAT CGCTGACGTC GAGACATTCC CTCGACGATG TCATCGCGGC GATCTCGTCG GCGAGCGAGG AGGGGCAACG CCGGCTTCTG CCGCTGGAGC GCATCACCCA GGAACTGGTC GTCGACGATG CCGCCGCCCA GGAAGCCGTG GTCAAGGCGC TGTTCGAATT CCGCGGACCC TATGCGCTAC CCAGCCAGGC CACAAATCTG GAGCCGCCGG GTGCCCGTGC CGACAGCGAG CTTTCGCTGG TGGTCGAGGC GCATTCGGCT GGCGCATCTG CGCTGATCGA CTACGCGCAG GACCTCTATG ACGGCGCCCT GGTTGCCCGC CTGGCCAGGC ATTTCGGCAA CGTGCTCGAA CAGATGGTCG CGCGGCCGAA TGTGCGGATC AAGGATATCG AACTGGTGGG CAGCGAAGAG CTCGACTGGC TGTCGGCGCC TTACGAGGAT GATGCCGTCA ACGACGACAG GCCGGTGCAT GAGCTGATCG CTGCCCATTC GCGCCGTACG CCTGAGAAGA CCGCGATCGT TTACGGTGAC GAGGAATGGA GCCATGGCTG GCTGGAGACG AACACCAACC GCCTCGGCCA TCGGCTGCGG CAGCTCGGCG TTCGCGCCGA AGTAACGGTC GCAATCTTCA TCAAGCGCTC GCCGGAGGCG ATCGTCGGCA TTCTCGCGAC GCTGAAGGCC GGCGGTGCCT ATATTCCCGT CGAGCCCGAC CATCCGCCGG TGCGCAACCA CCACATCTTG CGCGACGGCG GCGTCAAGAT CGTCCTGACC CACAGCTGGC TGCGCCATCG GCTGCCTGAA GAACTCGACG GCATTATCCT CGAACTCGAC AAGATCGACC TCGACGGAGA GCCGGATACG CCGCTCTATG TTCCCACGCA CAAGGATCAG CTCGCCTATG TCATGTACAC GTCGGGCTCG ACCGGATTGC CGAAGGGCGT CGCCGTCGAA CACGGGCCGC TGACGCACCA TCTGCAAAAC ACCTCGCGTG TCTATGGCAT GAGCTCGGAA TCCCGTGAGC TGCCGTTCCT GCCCTTCAGC TCAGATGGCG GCCATGAGCG GTGGATGAAC CCGCTGATGG AAGGCGGCAG CATCATCCTG CCCGATCAGC CGCTCTGGAC GCCTGAGGAG ACATTGACGG CGATGCGCAA ACATGGCGCC AACAATGCCA GCATTCCCAC AACCTATCTG CAGCAACTGG CGGAATGGGC CGATATCACC GACGGCGCGC CGCCGATGCG ACTTTACTCC TTCGGCGGCG AGGGGCTTGC ACAACCGACC TTCGATCTGC TGTCGCGGGC ACTGAAATCG GAATGGCTGA TCAACGGTTA CGGTCCCACA GAAACCATCA TGACGCCGAT GGTCTGGAAG GTGAGGGCCG GTACGAAGTT CCAGGGCGTC TATGCGCCGC TCGGCCGCGC CGTCGGGCTG CGGCGCGTCT ATGTGCTCGA CCCCGATCTC AACCTGTGCC CGATCGGCGT CACCGGCGAA CTCTATATCG GGGGCGAGGG CATCGCGCGC GGCTATCTCG GCAAGCCGGA TACGACGGCG GATCGTTTTA TTCCCGACCC ATTCTCCAAG GAGGGCGGCC GGCTTTATCG CTCCGGCGAC CTGACGCGTT GGCGTGAGGA CGGAACCGTC GAATTCGTCG GCCGCGTCGA CCATCAGGTG AAGCTGCGGG GGTACCGCAT CGAGCTCGGC GAGATCGAAG CAGCCCTCCT CCAGCAGCCC GGCGTCGGTG AAGCGCTGGT CGTGCTGCGC GATGACGATG CCGGCGAGAA GGCGCTGGTC GCCTATGTCG TTCCGAAGAA GGACGAGACG TTAAACGTCG AGACCGTCCG CGTTGGCCTC GAACGCAGCC TGCCCTCCTA CATGGTGCCG GCGGCGGTGG TCGAACTGGA AAAGATGCCG ACCAATCCGA ACAGCAAGCT CGATCGCTTC GCGCTGCCCG CACCCCAGCC GGTCCGGCGC GAAATCGTCG AGCCGGCGAC CACACTCGAA GAAGAGGTGC TGGATGTCTG GCGCCAGGTT CTCAAACTGG AAGCGATCAG TGTCGAGGAT AATTTCTTCA CGATCGGCGG CAATTCGCTG GGCGCGATCC GCATTCTTTC GCAGCTGAGG CAGCGCTGGC CGAAGACACC GCTCACCGTC GCCGATATCT TCAACAACCC GACCATCCGC GCCTTTGCCG GCGTGATGGA GCAGGGCGCC GAACGGGAAC TGTCGGAGGT GATCGTGCTG CGCGCTTCCG GCGCCAAGCC GCGGCTCTAT TGCTTCCCGG GGCTTCTCGT CAGCACCCGC GAATATGTGA AGCTCGTCGA TTATCTCGGC GCAGACCAGC CGGCGACCGG CTTCATTTGC CATTCGCTGT CCGAAAAGAA GGAAGTCGGC GCGCCGATCG AAGAGATCAT CGAGAGCTAT GTCGATCATA TCAGGACCCA TAGCAAAGGC GCACCCTGCA CCTTCCTCGG CTGGTCCTGG GGCGGGCTTT TGGCCTACGA GGCGGCCCGC ACGCTCGGCA ACGAGGTCGA CGTCAGGATG ATGGCGATGG TCGATGTCTG CGATCTCGGC TCGGAATTCG CGATCGGCGC CAAGCCGAAG TTCCGGCCGG GCGAGCGCGA CATGCTTCAC CGCGATGTTG AGGCATGGCT GCAAAAGACG GAGATGCGTC CCGAATGGGA CCGGCTGCTC TCGACGATGG ATGCCGATAC CTACGATCAG TTCCTGCGCT TCGTCGGCGA CGAGAAAGAC CCGCTTCCGA CCGACGGGCC GGATATCAGC AGCCGCGAGC ATACGTTCTG GGTGCTGATC GACAATGCCC TGATCTTCCG CAAGCACCGG CTCGTCCCTC ACGATGTGCC GATCTATCCC TGGGCAGCCG ATGACAGCCT CAACCGCGGC CTCAACCTGA TCGATTGGCG CCGCCTATCG CCGCGGGCGC ATGCGGCCGA AATTATCACC GGCACCAACC ATCTGCACAT GATCGGGTCT CCCGCCTTCC ATTCAAGGCT TGCCCTGCGT CTTAAGGAAA CAGAGAAGGA TTTCGCATGA
|
Protein sequence | MNKQITNFAD ASPTDAAMDA VFESFPLTIA QKRIWSLEQI GNYTVFPDQV IGLRLSPAVD VETIAGACRA LVAQNPSLAS RFRRLAGGRI EQYRDASNAV PIEILGKGGA ALSEPDALAA RKAFRDKRFD LLEEPGARIQ IIRLAQRQSL LTIVLHPIVC DDHEKSVLAR SLARLLDGEP VGDMHPVEIS AGTREADWLE TDEAREAVAY WRDTIGFDYA ASTFPTRFNS GGLAGVARAQ HRFSIEPDLW SKLERHAGDK GVHVERVLHA AFCALLARYS GNYALLTGLL VARPDKEHFA SRGRAEQVLP LILSLTSRHS LDDVIAAISS ASEEGQRRLL PLERITQELV VDDAAAQEAV VKALFEFRGP YALPSQATNL EPPGARADSE LSLVVEAHSA GASALIDYAQ DLYDGALVAR LARHFGNVLE QMVARPNVRI KDIELVGSEE LDWLSAPYED DAVNDDRPVH ELIAAHSRRT PEKTAIVYGD EEWSHGWLET NTNRLGHRLR QLGVRAEVTV AIFIKRSPEA IVGILATLKA GGAYIPVEPD HPPVRNHHIL RDGGVKIVLT HSWLRHRLPE ELDGIILELD KIDLDGEPDT PLYVPTHKDQ LAYVMYTSGS TGLPKGVAVE HGPLTHHLQN TSRVYGMSSE SRELPFLPFS SDGGHERWMN PLMEGGSIIL PDQPLWTPEE TLTAMRKHGA NNASIPTTYL QQLAEWADIT DGAPPMRLYS FGGEGLAQPT FDLLSRALKS EWLINGYGPT ETIMTPMVWK VRAGTKFQGV YAPLGRAVGL RRVYVLDPDL NLCPIGVTGE LYIGGEGIAR GYLGKPDTTA DRFIPDPFSK EGGRLYRSGD LTRWREDGTV EFVGRVDHQV KLRGYRIELG EIEAALLQQP GVGEALVVLR DDDAGEKALV AYVVPKKDET LNVETVRVGL ERSLPSYMVP AAVVELEKMP TNPNSKLDRF ALPAPQPVRR EIVEPATTLE EEVLDVWRQV LKLEAISVED NFFTIGGNSL GAIRILSQLR QRWPKTPLTV ADIFNNPTIR AFAGVMEQGA ERELSEVIVL RASGAKPRLY CFPGLLVSTR EYVKLVDYLG ADQPATGFIC HSLSEKKEVG APIEEIIESY VDHIRTHSKG APCTFLGWSW GGLLAYEAAR TLGNEVDVRM MAMVDVCDLG SEFAIGAKPK FRPGERDMLH RDVEAWLQKT EMRPEWDRLL STMDADTYDQ FLRFVGDEKD PLPTDGPDIS SREHTFWVLI DNALIFRKHR LVPHDVPIYP WAADDSLNRG LNLIDWRRLS PRAHAAEIIT GTNHLHMIGS PAFHSRLALR LKETEKDFA
|
| |