Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5451 |
Symbol | |
ID | 5319753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 416697 |
End bp | 418307 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640777213 |
Product | transposase IS204/IS1001/IS1096/IS1165 family protein |
Protein accession | YP_001314145 |
Protein GI | 150377550 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.268297 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACGA AAACGAAATG GTCGCCCGGC CCGGGGGCAA AAGTTCTGGG TATCTCGCTC GATGATGATG ACGGTTGGGT TGTTTCCGCC GCCGGACCAG TTTTCGGCAT TTGCCCTGAC TGCGGACGGC GGACCCGACA TCGGCATGGC TGGTCCAACC GTAGCCTCCA AGATCTGCCG GTCCAGGGCA AAACCGTAAC GGTGAAGCTT CGGTTGAGCC GCTGGCGGTG CGCGCATCAG AAATGTGAAC GACAAACGTT CACCGACCGA CTGCCGACGA TTGCTTCCCC TTATGCGCGC CGGACAAGGA GGGTCTCCGA GATTGTCGGT CTGCTCGGCC ATAGCGCAGG CGGCCGGCCC GGCGAGCGCT TGATCCGACG GCTCGGCATG CCGGTCAGCG ACGACACGAT CCTGCGGCAG CTGAAGCGGG ATGCCGCGCT CGTTTCCTGC GATGCCACGA TCCGGGTCGT CGGCATAGAT GATTGGAGCT GGCGGCGATC GTGGCGCTAC GGCACGATGA TCGTCGACCT GGAGCGCCGT TCGGTTGTCG ATATTCTTGA GGACCGGAGC GTAGCGAGTG TCGCGCGATG GCTGGAGCAG CATCCTTCTG TCGAGATCGT GAGCCGAGAC CGAGGCGGGC TGTACGCACA AGCTGCTCGC GAGGGTGCGC CGCAAGCTCG TCAAGTCGCT GATCGGTTCC ATCTCATGCA GAATCTGCGA GTCGCAATCG AGGAGCAGAT GAGCCTTGGC GGCCGCGCCA CCGGACGAGC ATTGCTGCCG GATAAATGGA TCGGGAGCGC GCAAATCGAT CTGCTTCAGG ATGATCCGCA CGTTGACGCA AGGCACCGGC GCCGGGGGCG TCACGCTCAT CGAGAATCAC GGCAGGCGGT GTTCGATACA GTGCACGCTT TGAATGAGGA AGGTTTGTCC TGTTCGGAGA TCGCACGTCG CACCGGCTAC GGGCGGCGCA GCATCGCGAA ATGGCTGACT TTCGAAACGC CACCCGACCG ACAGAAGGCG GCGTTGAAGC CGACATCGCC CCTGTACTTT GAGGCGTTTC TCGCCGTGTG CTGGAAAGAT GGCAATCGCT GCGGACGGCA TCTGTTCTAC GATATCAAAC AGCGCGGCTA CACGGGCAGT TTCTCGAATC TCGAGCGGCT TCTCGCAAGC TGGCGCCGCT CGGAGAGATC GGTCGAGGGC AGCGCGTCGT CGGCTCCGAT CATCTCGCAT CAACGGGGCC GCGGTGTTGT CCCCATACGC GATCCGGAGA CCGGCCATGT GATCTCACCG GTGGTCGCAG CTGCCCTCTG CATCAAGCCG CGAAGCATGC TGACGATCAC TCAGGCGAGA AAGGTCGATG CCTTGAAACA GGGCGCGCCT GAGTTCGCTT TGATGCGCAG CCTTGGCATG CGCTTTCGCG GAATCTTTCG CAGCGGCGAT CCGGGCAAGC TTAACAGCTG GATTGACGAT GCCGTTAATT CCGGTTTGGT CGCAATTGAG CGGTTCGCAC GCGTCCTGCA CCGTGACATC GGCGCCGTCC GCAACGCCGT GGAACTCCCC TGGAGCAACG GCCAGGCGGA AGGCCAGATC AACCGTCTGA AGACAATTTA G
|
Protein sequence | MQTKTKWSPG PGAKVLGISL DDDDGWVVSA AGPVFGICPD CGRRTRHRHG WSNRSLQDLP VQGKTVTVKL RLSRWRCAHQ KCERQTFTDR LPTIASPYAR RTRRVSEIVG LLGHSAGGRP GERLIRRLGM PVSDDTILRQ LKRDAALVSC DATIRVVGID DWSWRRSWRY GTMIVDLERR SVVDILEDRS VASVARWLEQ HPSVEIVSRD RGGLYAQAAR EGAPQARQVA DRFHLMQNLR VAIEEQMSLG GRATGRALLP DKWIGSAQID LLQDDPHVDA RHRRRGRHAH RESRQAVFDT VHALNEEGLS CSEIARRTGY GRRSIAKWLT FETPPDRQKA ALKPTSPLYF EAFLAVCWKD GNRCGRHLFY DIKQRGYTGS FSNLERLLAS WRRSERSVEG SASSAPIISH QRGRGVVPIR DPETGHVISP VVAAALCIKP RSMLTITQAR KVDALKQGAP EFALMRSLGM RFRGIFRSGD PGKLNSWIDD AVNSGLVAIE RFARVLHRDI GAVRNAVELP WSNGQAEGQI NRLKTI
|
| |