Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4953 |
Symbol | |
ID | 8007546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 333587 |
End bp | 335143 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644821870 |
Product | transposase IS204/IS1001/IS1096/IS1165 family protein |
Protein accession | YP_002973130 |
Protein GI | 241113295 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.12169 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACCA AGTTTCGTCT GTCGTCCCTG ATCCCGGCCG GATTGATTGT TGAGCGTTCC GACGAGAGCA ACGGGGTTAT CATTGTTTCA GCGCGGGCTG CCGCCGATCG ACGTTCCTGC CCTTTATGCA ACCGAATGTC AGATCGTGTT CATAGCCGCT ACGTTCGGAA AATTGCTGAT TTGCCTTGTG CTGGCACCAG GGTCCAACTG CGACTATCGG CAAGGCGCTT TATCTGTGAG ATGACATTCT GCCGTCGCCG GATCTTCGTC GAAAGGTTTG GAGAGCTTGT CGTTCCGGAG CGCAGCCGTC GGACCGCTCG GCTTGATACC GTCGTACATC ATCTCGGGTT GGCTCTGGGA GGGCGGCCCG CAGCAGCCTT TGCCAAACGC TTGATGATCC CAGTCAGCAA TGACACGCTG ATCCGCGCGG TGAGAAGAAG ATCCGCTGCG TCGGATGACG CGCTAAGCGT CGTCGGCGTC GACGATTGGG CTTTCCGCCG CAATCACCGC TATGGCACCG TCGTATGTGA TCTTGAGAAG CGAAAGATTA TAAAGCTTCT GCCCGATCGA GAGATCGCGA CAGTTTCCAC CTTCCTTGCT CAGCACCCCG AAATTGCGAT TGTTTCCCGC GACCGAGGCG GTGGCTATCG TGAAGCTGCC GCCAAGGCTT TACCCCATGC CATGCAGGTC GCAGATCGTT GGCATTTGAT GGAGAACGCC AGCGCGGCGT TCCTCGACGT CGTGCGCAAA TCCATGCGAG CGATCCGTAC CGCAATTGGT GCTACGACGA TCAATCCTGC GCTACTCACC TGCGCTGAAC GACTGCAATA TGACAGCTAT CTGCGGCGTG AGGACGAGAA TTCGACGATC ACCAAGCTGT CCTCCGATGG CGTTCCAATA AAGGAAATCG TGCGACAAAC CGGCTATAGC CGCGGCACAG TTCGCCAGAT CGTCCGCGGT CACAGAACTG ATGTGTTCCG TGTCAGACAA AGTTCCCTTG AGGCCCATCT GCCGCTGTTA GATCAACTCT GGAGATCTGG GCAGCACAAC GGCGCTGAAC TCTGGCGACA GCTAAAGTGC AAAGGCTTCC GTGGTTGTTC TCGTGTCGTC GGCGAGTGGG CCGCAAGGCG ACGGCGGTCT GAGCGGATCT GCGACCAGCA ACTTCAAAAA GTGCCATCCG CCAGAACGAT TGCCCGATTG ATGACAACTG CTCGCGATCA ACTGAGTAAA GCTGACACCA TCACCATTGC GGCCATAGAA GCCGGCGTTC CCGCCTTGAT CCAAGCCCGC AATCTAATTG ATCGGTTCCA GACAATGATC CGAAGGAAGG CCAGGACAGA ACTCGACCCA TGGATCGCTG ATGCGCGCGA CAGTCTGTTC GCCCCCTTCG CCAACGGGAT ACTAAAAGAC AAGGCAGCGG TGTCCGCCGC CATCACAGAA CCTTGGTCGA ACGGCCAGGT CGAAGGACAG ATCAACAAGC TGAAGCTCGT TAAAAGGCAA ATGTATGGGC GTGCCAAGCT GGATCTTCTT CAGGCACGAT TGATCGGGGC AATGTGA
|
Protein sequence | MATKFRLSSL IPAGLIVERS DESNGVIIVS ARAAADRRSC PLCNRMSDRV HSRYVRKIAD LPCAGTRVQL RLSARRFICE MTFCRRRIFV ERFGELVVPE RSRRTARLDT VVHHLGLALG GRPAAAFAKR LMIPVSNDTL IRAVRRRSAA SDDALSVVGV DDWAFRRNHR YGTVVCDLEK RKIIKLLPDR EIATVSTFLA QHPEIAIVSR DRGGGYREAA AKALPHAMQV ADRWHLMENA SAAFLDVVRK SMRAIRTAIG ATTINPALLT CAERLQYDSY LRREDENSTI TKLSSDGVPI KEIVRQTGYS RGTVRQIVRG HRTDVFRVRQ SSLEAHLPLL DQLWRSGQHN GAELWRQLKC KGFRGCSRVV GEWAARRRRS ERICDQQLQK VPSARTIARL MTTARDQLSK ADTITIAAIE AGVPALIQAR NLIDRFQTMI RRKARTELDP WIADARDSLF APFANGILKD KAAVSAAITE PWSNGQVEGQ INKLKLVKRQ MYGRAKLDLL QARLIGAM
|
| |