Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4321 |
Symbol | |
ID | 8015101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4435599 |
End bp | 4438979 |
Gene Length | 3381 bp |
Protein Length | 1126 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644826897 |
Product | hypothetical protein |
Protein accession | YP_002978100 |
Protein GI | 241207004 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1112] Superfamily I DNA and RNA helicases and helicase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACGA ATGCAGACGG CGCCGAAAGG GAACGGCTTC TGGCCATTCT CGATTTCTGG CACAAAATCG AGTTCTTCAT TCCTTACGAT CTCTCCAGTC GTATCGTCTC CGGTGAAGGC CGAAGTGTTT TCTGGTTGCA TGCGAAAACG CTCGGCGATG ACGGTGCCGC ACTCAGTCGT CCAGCGATCC CCGAGGAGAA GCAGATCACG GGTTTTACCC TGTTCCTCGG CGTTTTCAGC AAATCCGAAA TTGCCGATAT TCGCAGACAC TTCGATTGCG TTGCGACCGA TACCGCCGAA TATGACGACG CCGAACGCAG CGACCTCGAT GGCGATACCT GCTTTGCCAG CTTGCAGCTC TCTCCCTTGG GACAGCCGCT GTTCGAAACA TTTTCAGTTT CCACGCTGCC ATGGGCTCTG GGACGGGTGC GGAAATCCGG CCTCTCCTCG CTCGGCTACC AGGCATTTGC CGACAGCAAG CGGCAACTCT CGGAATTGCT TCAAAACTTT CGGGCGCAGC GGCATCTGAG ATCTTCATCC TCCGAGGACA CGACCGATCA GCCGATCGAT GCTGCCGAAA TCCTGACGCT GCATGAATTG CTCTGCGATT GGGCCGGCTT TGCCTCGAAG CAGGAGAAAC CCATTGCCGC TGTTGAAATG CGTTACCGGG ACCGGGTGGA AAAATCAGAG TTCATCTCCT TGCCACCACC GCCGGAAAGC AATGCGCTCG ATGCCGACGA CGAGGAGGAT GAGAGTACCA GTGCTGAAGA CGATATCGGT ATCTTGAACA GCTTCTTCAT CGAGGACATC GAGCGGGCGA TGATACGCGT CACGCATGGC GATATCCCCG CGCCGCTCAG GCAATATCTC ACCCCGCTGG TGCATGAGAA ACGGATCGAC CTCTACTCGG AGGATGGACG CCGGGCGATC GTTCGGGCCT TGCATCCTGG AAAACTCAAC CGCGGGCGCT GGCTCAGCGA ACCTCACCAC GCCATGAGCC TGATGCAACA GTTCGCGATC AATTCGGCAA TCGACGAGCT GTCGGAAACG GGACTGTTCT CGGTCAACGG CCCGCCAGGG ACGGGAAAGA CGACGCTGCT TCGCGACATG TTCGCAGATA ATATCGTTCG GCGCGCTCGC ATCCTGGCCT CCTTGAAGAC GGCGCGCGAA GCCTTCGATG GCGCGCCGCG CCGCATCATT TTCGCGGACC GGAGCACTGC CACGATATCG GCGCTGATCC CGGCCCTGGC AGGATTCGAA ATGGTTGTCG CCTCCTCCAA CAACGCCGCC GTGGAGATTA TTTCGCGCGA CCTTCCGAAG CGAAGCTCGG TCGCGAGAAC GTCTTCCTTC CAATATCTGC AGACCATCGC GCACAAAGTC GCTTGCCAGA AGGACAATGG GGCGGTCGTC AGGCTCTCGG ACGGCGACCG CCCATGGGGG CTAATCGCCT GTGCGCTCGG CAATTCCCGG AATCGCCGAG CTTTCAAGGA ACGTTTCGCC TTCATGGAGA TCGCCGAAAG GCCGAAGCCC GGCTGGTCCG GTGCCGACAA ACCACAGACC ATCTGGGAGT GGCTAAAGGG TTATAAGGGA CCAAACTTCG CCGAAGCATC GGCGGCATTT CAGGCCGCCG ACAAGGTGGT TCGCGACAAG ATCGGCGAAT ATGCACGCTA TGCCGATCTC CATGACGAAA TCGCCCTGGT TTCGCAGGAT GGTTTCTGCC GCGAGGCGCT TGAAAAGGTC AGAGCAGGTG CGGCCGAACT TCGACACGCA CAGGACCGAT GCGATATGGT TGCCGCCGAC ATGCTCCGCC TCAGGGAAGG GTTGTCTCCT CTGAAGGAGG AGGAACTGCT GCTCGACCGA GGCGCTCCTG CGTGGTGGGA GAAAGTGCTG TCGACCAACC CTGCCCGGCA GCATCGGCAC AATGTCGTCG CCAATGCACG AAGACAGCTG GAACTGCGCA AGGCGCTGGC GGAATGCGAA CATCACCTTA CGAAAACCCT CAGACCTGCG CTGGAACAGT CTTTGCGGAG ACATCGACGT GCCGAACAGG CACTGCGGTC GCAACGAGAA ATCTGGTCCA GAAAAAGGGA GGAGTTTGGC CGCCTAGGGG AAATTCTCAA TCACCCCACC CTACCTGGTC GCCTGACCGA TCTCGAGACC GATCAGTTCC AGATTGACGG TCTGTGGCAT CAGGACGAAC TGGCAGGTCT GCGTTCGGCA TTGTTGGAAG CAGCGTTGAC GCTGCATGAA GCGTGGCTGG CGGATGTGGG CAAGAAAGGC GGAGGTTTTG GAGGGAACAT CGTCGCCATC AACAAACTGC TTTCCAACAA CGGTCCCGCC GACGGCGAGC ACATTGCGTT GATCTGGCAG AGCCTTTTCA TGATCGTTCC CATCGTCTCG ACAACCTTCG CCTCCTTTGC CCGGCAGTTC CACGGTCTCG ACACCGGCTC GGTCGGCTGG GTCTTCATCG ATGAAGCTGG TCAGGCGGTG CCGCAGGCGG CCGTCGGCGC CCTGTTGCGG GCGCGGCGCG TCATGGTGAT CGGTGATCCC CAGCAGATCG AACCGGTCTT CACGCTGCCC AGCGCGCTGA TCACCGCCAC CTCGGCTCTC TCGCCGCATA CGGCGGCAGG ACAATATTCG CCGAACAGTG CGTCGGTGCA GATGTTGGCC GATGCCAACA ACCGCTATGG GACGACGGTC TCGGGCGAGG AGGCCGATGG GCTCTGGATC GGCAGTCCCC TCAGGGTACA TCGGCGCTGC ATCGATCCGA TGTTTGGCCT TGCCAACCAG ATCGCCTATC AGAACAAGAT GGTCTTCGGG CTGGAAGAGC GCCGGCCGGC CGGTGACGCT CCGCCGTTTT ATGGCGACAG CGCCTGGATC GACGTCAGGG GAAGGGTGTC AGGCAAACAG GCCGTTCCAG AACAAACGGG CTTCATCGTC GATCTCCTCA CCGCCACTTA TCGCCGGGAC GGCGGATTGC CAGACCTCTA TATCATTTCG CCGTTCAAGG AGATCAAGAA CAGTCTGAAG CAGGCTCTGG CCCATGCAAC ATGGGTTGAT TGGAACGGAA ATACGCGCTC GGCTCCGCCA AGACTGTCGA AGTGGCTGAA GGAAAGGATC GGCACGGTGC ATACCTTCCA GGGCAAGGAA GAAGACGTGG TTTTCATGGT GCTCGGCGCC GATGCCGCTC ATAGCGGGGC TGCAGCCTGG GCCGCATCGA AGCCGAACCT GTTGAACGTT GCCCTGACGC GCGCCAAACG CCGCTTCTAT ATCGTCGGCG ATCGCACCCT CTGGGAAACC CTGCCTTACT TCAGGGAGAC AGCGAGTGCC TTGAAGACAA TACAGGCAGC CGAATTCCTG GCCCGGAATG AATTGAACTG A
|
Protein sequence | MQTNADGAER ERLLAILDFW HKIEFFIPYD LSSRIVSGEG RSVFWLHAKT LGDDGAALSR PAIPEEKQIT GFTLFLGVFS KSEIADIRRH FDCVATDTAE YDDAERSDLD GDTCFASLQL SPLGQPLFET FSVSTLPWAL GRVRKSGLSS LGYQAFADSK RQLSELLQNF RAQRHLRSSS SEDTTDQPID AAEILTLHEL LCDWAGFASK QEKPIAAVEM RYRDRVEKSE FISLPPPPES NALDADDEED ESTSAEDDIG ILNSFFIEDI ERAMIRVTHG DIPAPLRQYL TPLVHEKRID LYSEDGRRAI VRALHPGKLN RGRWLSEPHH AMSLMQQFAI NSAIDELSET GLFSVNGPPG TGKTTLLRDM FADNIVRRAR ILASLKTARE AFDGAPRRII FADRSTATIS ALIPALAGFE MVVASSNNAA VEIISRDLPK RSSVARTSSF QYLQTIAHKV ACQKDNGAVV RLSDGDRPWG LIACALGNSR NRRAFKERFA FMEIAERPKP GWSGADKPQT IWEWLKGYKG PNFAEASAAF QAADKVVRDK IGEYARYADL HDEIALVSQD GFCREALEKV RAGAAELRHA QDRCDMVAAD MLRLREGLSP LKEEELLLDR GAPAWWEKVL STNPARQHRH NVVANARRQL ELRKALAECE HHLTKTLRPA LEQSLRRHRR AEQALRSQRE IWSRKREEFG RLGEILNHPT LPGRLTDLET DQFQIDGLWH QDELAGLRSA LLEAALTLHE AWLADVGKKG GGFGGNIVAI NKLLSNNGPA DGEHIALIWQ SLFMIVPIVS TTFASFARQF HGLDTGSVGW VFIDEAGQAV PQAAVGALLR ARRVMVIGDP QQIEPVFTLP SALITATSAL SPHTAAGQYS PNSASVQMLA DANNRYGTTV SGEEADGLWI GSPLRVHRRC IDPMFGLANQ IAYQNKMVFG LEERRPAGDA PPFYGDSAWI DVRGRVSGKQ AVPEQTGFIV DLLTATYRRD GGLPDLYIIS PFKEIKNSLK QALAHATWVD WNGNTRSAPP RLSKWLKERI GTVHTFQGKE EDVVFMVLGA DAAHSGAAAW AASKPNLLNV ALTRAKRRFY IVGDRTLWET LPYFRETASA LKTIQAAEFL ARNELN
|
| |