Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2900 |
Symbol | |
ID | 6981644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2953140 |
End bp | 2954285 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643397610 |
Product | transposase IS4 family protein |
Protein accession | YP_002282394 |
Protein GI | 209550477 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00460334 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.33461 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGCATG ACAATAGCGT CTTTCATGAT GTGCTGAAGC GGATTCCGTG GGCGGTCTTC GAAAGACTTG TGGATGAGCA TCAGGCCGAC AAGCATGTTC GGCGGCTGTC GACGAAGAGC CAGTTGATCG CCCTGCTTTA CGGCCAGCTT GCCGGTGCCG TCAGCCTGCG TGAGATCGTC GGCTCCCTGG AAAGCCATAG CGCCCGCCTT TACCATCTCG GCGCTCGTCC GGTGTCGCGC TCGACGTTCG CCGATGCTAA CGGCCTGCGT CCGAGCACGG TTTTTGCCGA GTTGTTCGCG CAGATGGTGG CCCGCGCCGG GCGCGGCCTC AAGCGGGCCA TCGGCGAGGC GGTCTATCTG ATCGACGGCA GCAGTCTGAG CCTTGCCGGG GCGGGATCGC AGTGGGCCCG CTTTTCCGAT CAGGCCTGTG GTGCCAAGAT GCACGTCGTC TACGATGCCA ATGCCGAGCG ACCGATCTAT GCGGCCGTCA CCCCGGCCAA TGTCAACGAC ATCACCGCCG CCAAGGAGAT GCCGATCGAG GCGGGCGCCA CCTATGTCTT CGATCTCGGC TATTACGACT TCGGCTGGTG GGCGAAGCTC AATGCCGCCG GCTGCCGCAT CGTCAGCCGC CTCAAATCCC ACACGAAACT GACGGTGAGC GCCGAGCAGG CGGCAAATGC GGATGCCGGC ATCCTGTTCG ACCGCATCGG CCTGTTGCCG CAGCGCCAGG CCAAGAGCCG CCGCAACCCG ATGAATCGGC CGGTGCGCGA GATCGGCGTG CGGATCGAAA CCGGCAAGGT GCTGCGCATC TTCTCCAACG ATCTTACCGC CCCGGCCGAG GAGATCGCCG CGCTTTACAA GCGCCGCTGG GCGATCGAGC TGTTCTTCCG CTGGGTCAAA CAGACGCTGA AGATCCGCCA TTTCCTCGGC AATAGCGAAA ATGCCGTGCG CATCCAGGTC GCTGTCGCCC TGATCGCCTA TTTGCTGCTG CAGATGGCAA AGGCTGACCA GGCCACCGTC ACGAGCCCGC TGGCCTTCGC CCGCCTGGTG CGCACCAACC TGATGCACCG CAAAAGGATC GACCGCCTCC TAAAACCACG CCACAGCCCT CCCGGAAATC CCGGCCAGAT GAGCCTCCAA TGGTGA
|
Protein sequence | MRHDNSVFHD VLKRIPWAVF ERLVDEHQAD KHVRRLSTKS QLIALLYGQL AGAVSLREIV GSLESHSARL YHLGARPVSR STFADANGLR PSTVFAELFA QMVARAGRGL KRAIGEAVYL IDGSSLSLAG AGSQWARFSD QACGAKMHVV YDANAERPIY AAVTPANVND ITAAKEMPIE AGATYVFDLG YYDFGWWAKL NAAGCRIVSR LKSHTKLTVS AEQAANADAG ILFDRIGLLP QRQAKSRRNP MNRPVREIGV RIETGKVLRI FSNDLTAPAE EIAALYKRRW AIELFFRWVK QTLKIRHFLG NSENAVRIQV AVALIAYLLL QMAKADQATV TSPLAFARLV RTNLMHRKRI DRLLKPRHSP PGNPGQMSLQ W
|
| |