Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_6340 |
Symbol | |
ID | 5320643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009622 |
Strand | + |
Start bp | 7027 |
End bp | 8619 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640777934 |
Product | transposase IS66 |
Protein accession | YP_001314866 |
Protein GI | 150378272 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.360304 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGAA CCGGCGAACC GAGCGTTGCA GAGCTGATGG CGCAGTTGGC GGCCAATGCT GCCGAAATCG CCGCGCTGAA AGCCGAGAAG GAAACGCTCT CGCAGCGGGT CGTCAAGCTG GAGGAAGAGC TGGCGCTTGC ACGGCTGCAT CGCTTTGCGC CGCGCAGCGA AAAGCACGTT GATCGCATCT TCAACGAAGC CGAAGAGGCT GCGGACGAGG ATGGAGCGGA CAGCCACGAG AGCGAGGTCA TCGACATTCC GGACACAGGT CTGCCGCCTG TCGAAAGCAC GACGGGTCAG AAGCGCGGCC GCAGACCACT ACCGCAGAAC CTGCCGCGCG AGCGCGTCGA GTATGACCTT CCCGACGGTC AGAAGGCTTG TCCTTGCTGC CGTGGGCAGA TGCACCGCAT GGGCGAGGCC GTTACCGAGC AACTTCATAT CGAAGTGAAG GCGAAGGTCT TACAGAATGT GCGGTTCAAA TATGCATGCC GCCATTGCGA TCGCACCGGG ATCAACACGC CTGTCGTGAC CGCGCCGATG CCGGTGCAGC CCTTGCCGGG CAGCATCGCC ACGGCCTCGA CACTGGCCTT CGCACTCGTT CACAAGTACG TCGACGGCAC ACCGCTCTAC CGTCTGGCGC AGACCTTCGA ACGCGCCGGT GTTCCTGTCA GCCGTGGTGC TCTCGGTCAC TGGGTGATCG GTTCGAGCGA GAAGCATCTG CATCGCATCT ATGACGCGCT GAAACTGCGG CTCAGGTCGC AATCCCTCAT CCATGGCGAC GAGACGACCG TTCAGGTCCT GAAGGAAAAG GGCAAAGAGG CCACCAGCAC ATCGTATATG TGGGCCTATC GGAGCGGCGA CGACAGTGAC GAGCCGATCG TGCTTCTCGA TTATCAGCCG GGCCGCGGCC AGATTCACCC CCAGACCTTC CTCGGCGATT ATCGCGGCAT ATTGATGAGC GATGGCTACA CCGCCTGGCG CACGCTGGAT GGCGCGATCC ATATCGGATG CATGGCCCAT TCGAGGCGAC GCTTCGTCGA TGCCCTCAAA GCCAGAAAGA AAGGCGGCGG TCCGCCCGAG CAGGCGCTCC GGTTCTTCGA ACAACTCTAT CGGGTTGAAA GGCAAGCACG AGACAAGAAG CCTGATGCCG GCGAAACGAA GGCCGACAGC GTTCGCCGAT TCCGACAGCA ACACAGCATA CCCGTCCTGA CCGCTCTCAA GGTATGGCTC GACGAAATCG CGCCGAAGGT TATGCCGGAC ACCAAGCTGG GCGATGCTGT CTCCTACACC CTGAACCAGT GGGATTATCT GACACGCTAC AAAGACGATG GCAGGATGCC GATCGACAAC AACATCTTGG AGCGCGACAT CAGAGTTTTT GCGACCGGCA GAAAGAGTTG GCTGTTCAGC GATAGCACCG ACGGAGCCAA GGCCAGCGCC GTCATCTACA GCCTCATGCT GACCTGCCGT GCCTGCGGCG TCGAGCCGTT GACGTGGTTG CGCCACGTCC TTACCGAATT GCCTCAACGC GCGGGCGATG CTGATATCGA CGACCTGTTG CCCTTCGACC TCACACAGAC TGCCACCGCC TGA
|
Protein sequence | MNRTGEPSVA ELMAQLAANA AEIAALKAEK ETLSQRVVKL EEELALARLH RFAPRSEKHV DRIFNEAEEA ADEDGADSHE SEVIDIPDTG LPPVESTTGQ KRGRRPLPQN LPRERVEYDL PDGQKACPCC RGQMHRMGEA VTEQLHIEVK AKVLQNVRFK YACRHCDRTG INTPVVTAPM PVQPLPGSIA TASTLAFALV HKYVDGTPLY RLAQTFERAG VPVSRGALGH WVIGSSEKHL HRIYDALKLR LRSQSLIHGD ETTVQVLKEK GKEATSTSYM WAYRSGDDSD EPIVLLDYQP GRGQIHPQTF LGDYRGILMS DGYTAWRTLD GAIHIGCMAH SRRRFVDALK ARKKGGGPPE QALRFFEQLY RVERQARDKK PDAGETKADS VRRFRQQHSI PVLTALKVWL DEIAPKVMPD TKLGDAVSYT LNQWDYLTRY KDDGRMPIDN NILERDIRVF ATGRKSWLFS DSTDGAKASA VIYSLMLTCR ACGVEPLTWL RHVLTELPQR AGDADIDDLL PFDLTQTATA
|
| |