Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4903 |
Symbol | |
ID | 8007629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 281995 |
End bp | 283584 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644821824 |
Product | transposase IS66 |
Protein accession | YP_002973084 |
Protein GI | 241113249 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.683483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGAA CCGGCGAACC TGATGTTGCA GAGCTGATGG CGCAGTTGGC GGCAAGTGCT GCCGAAATCG CTGCGCTCAA AGCCGAGAAG GAAGCGCTGT CACGGCGGGT CGTCAAGCTG GAAGAGGAAC TGGCACTCGC AAGGCTGCAT CGGTTTGCAC CCCGCAGCGA AAAGCACGTC GATCGCCTCT TCAACGAAGC CGAAGAGGCC GCAGACGAGG ATGATCCTGA TCATGGCGAC GACGTCGCCG ATCTCCCGGA TACAGGCCTG CCTGCAGTCG AAAGTGCCGC GGGTAAAAAG CGGGGCCGCA GACCTCTGCC AGAAGACCTG CCCCGCGAGC GTGTCGAATA CGACCTTCCC GACGATCGGA AAGTTTGCCC TTGCTGCGAC AGTCAAATGC ATCGCATGGG CGAGGCCATT ACCGAGCAGC TTCATATCGA GGTCAAGGCT AAGGTTCTGC AGAATGTGCG GTTCAAATAC GCCTGCCGCC ATTGCGACCG CACCGGCATC AACACACCTG TCGTGATCGC ACCGATGCCG CCGCAACCAT TGCCGGGCAG CATCGCCACC GCCTCCACGC TGGCCTTCGC ACTCGTCCAC AAATATGTCG ACGGCACGCC GCTCTACCGC GTGGCGCAAA CGTTCGAACG GGCCGGCGTA CCGATCAGCC GAGGTGCTCT CGCGCACTGG GTGATCGGTT CGAGCGAGAG GCATTTGCAT CGCATCTACG ATGCGCTGAG ATTGCGGCTT CAGTCGCAGC CTCTCATTCA TGGTGACGAG ACGACAGTTC AGGTCTTGAA GGAAAAAGAC AAGGAAGCCA CCAGCACATC TTACATGTGG GCGTATCGCA GCAGCGACGA TAGTGAGGAG CCAATCGTGC TTCTCGACTA TCAACAGGGC CGCGGCCAGG TCCACCCGCA GACCTTTCTC GGTAACTATA GCGGCATATT GGTGACCGAT GGATACACCG CATGGCGCAC ATTGCATGGC GCAACCCATG TCGGATGCAT GGCCCATTCC CGGCGGCGCT TCGTCGAGGC TCTCAAAACC CGGAAGAATG GAGGCGGACC GCCGGAACAG GCGCTCCGGT TCTTCGAGCA GCTCTACCGT ATCGAAAAGC AGGCAAGAGA CCAAACGCCC GACGCCGGTG AAACGCAGGC CGATTGCAGT CGTCGTTTCC GGCAACAGCA CAGCTTGCCT GTCCTCATCG CCCTAAAGAC GTGGCTCGAC AATATCGCGC CGAAGGTCGT GCCGGATACC AAGCTAGGCG ATGCCGTGTC CTACACCCTG AACCAATGGG ATTACCTGAC GCGCTACATC AGCGACGGCA GGATCCCGAT CGATAACAAC ATTCTGGAAC GCGACATCAG AGTTTTTGCG ACCGGAAGAA AATCGTGGCT GTTCAGCGAT ACCGCTGACG GAGCCAGGGC CAGCGCCGTG ATCTACAGTC TGATGCTGAC CTGCCGCGCC TGTGGCGTCG ACCCACTCAC CTGGCTACGC CACGTGCTTG CTGAGTTGCC TCAGCGCGAA GAAGCAGCCG ACATCGGCGA CCTGCTGCCG TTCAACTTCT CCAAAGCCTC CGCTGCCTGA
|
Protein sequence | MTRTGEPDVA ELMAQLAASA AEIAALKAEK EALSRRVVKL EEELALARLH RFAPRSEKHV DRLFNEAEEA ADEDDPDHGD DVADLPDTGL PAVESAAGKK RGRRPLPEDL PRERVEYDLP DDRKVCPCCD SQMHRMGEAI TEQLHIEVKA KVLQNVRFKY ACRHCDRTGI NTPVVIAPMP PQPLPGSIAT ASTLAFALVH KYVDGTPLYR VAQTFERAGV PISRGALAHW VIGSSERHLH RIYDALRLRL QSQPLIHGDE TTVQVLKEKD KEATSTSYMW AYRSSDDSEE PIVLLDYQQG RGQVHPQTFL GNYSGILVTD GYTAWRTLHG ATHVGCMAHS RRRFVEALKT RKNGGGPPEQ ALRFFEQLYR IEKQARDQTP DAGETQADCS RRFRQQHSLP VLIALKTWLD NIAPKVVPDT KLGDAVSYTL NQWDYLTRYI SDGRIPIDNN ILERDIRVFA TGRKSWLFSD TADGARASAV IYSLMLTCRA CGVDPLTWLR HVLAELPQRE EAADIGDLLP FNFSKASAA
|
| |