Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0509 |
Symbol | |
ID | 6979225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 525036 |
End bp | 526181 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643395221 |
Product | transposase IS4 family protein |
Protein accession | YP_002280032 |
Protein GI | 209548115 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0772109 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGCATG ACAATAGCGT CTTTCATGAT CTGTTGAAGC GGATTCCGTG GACGGCTTTT GAAAGACTTG TGGAGGAGCA TCAGGCCGAT AAGCATGTGC GGCGGCTGTC GACCAAGAGC CAGTTGATCG CCCTGCTTTA TGGCCAGCTT GCCGGTGCCA CCAGCCTGCG TGAGATCGTT GGCTCCCTGC AGAGCCACGG CACCCGCCTT TACCATCTCG GCGCTCGTCC GGTGTCACGC TCGACATTCG CCGATGCCAA TGGCCTGCGT CCGAGCGCCG TTTTTGCCGA GTTGTTCGCG CAGATGGTGG CCCGCGCCGG GCGCGGGCTC AAGCGGGCCG TCGGCGAGGC GGTCTATCTA ATCGACGGCA GCAGTCTGAG CCTTGCCGGG GCGGGATCGC AGTGGGCCCG CTTTTCCGAT CAGGCCTGCG GCGCCAAGAT GCACGTCGTC TACGATGCCA ACGCCGAACG TCCGATCTAT GCGGCCGTCA CCCCGGCCAA TGTCAATGAC ATCACCGCGG CCAAGGAGAT GCCGATCGAG GCAGGCGCCA CCTATGTCTT CGATCTCGGC TACTACGACT TCGGCTGGTG GGCGAAGCTC AATGCCGCCG GCTGCCGCAT CGTCACCCGC CTCAAATCCC ACACGAGACT GACGGTGAGC GCCGAGCAGG CGGTGAACGA GGATGCCGGC ATCCTGTTCG ACCGTATCGG CCTGCTGCCG CAGCGCCAGG CCAAGAGTCG CCGCAACCCA ATGAACCGGC CGGTGCGCGA GATCGGCGTG CGGATCGAAA CCGGCAAGGT GTTGCGCATC TTCTCCAACG ATCTTACCGC CCCGGCCGAG GAGATCGCCG CGCTTTACAA GCGTCGCTGG GCGATCGAAC TGTTCTTCCG CTGGGTCAAG CAGACGCTGA AGATCCGCCA TTTCCTCGGC AACAACGAAA ATGCCGTGCG CATCCAGGTC GCCGTCGCCC TGATTGCCTA TCTGCTGCTG CAGATGGCAA AGGCCGACCA GACCAGCGTC ACAAGCCCGC TGGCATTCGC CCGCCTGGTG CGCGCCAACC TCATGCACCG CAAAAGGATC GACCGCCTGC TGAAACCAAG CCACAGCCCT CCCATCAGTC CCGCCCAGAT GAGCCTCCAA TGGTGA
|
Protein sequence | MRHDNSVFHD LLKRIPWTAF ERLVEEHQAD KHVRRLSTKS QLIALLYGQL AGATSLREIV GSLQSHGTRL YHLGARPVSR STFADANGLR PSAVFAELFA QMVARAGRGL KRAVGEAVYL IDGSSLSLAG AGSQWARFSD QACGAKMHVV YDANAERPIY AAVTPANVND ITAAKEMPIE AGATYVFDLG YYDFGWWAKL NAAGCRIVTR LKSHTRLTVS AEQAVNEDAG ILFDRIGLLP QRQAKSRRNP MNRPVREIGV RIETGKVLRI FSNDLTAPAE EIAALYKRRW AIELFFRWVK QTLKIRHFLG NNENAVRIQV AVALIAYLLL QMAKADQTSV TSPLAFARLV RANLMHRKRI DRLLKPSHSP PISPAQMSLQ W
|
| |