Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0131 |
Symbol | |
ID | 5320960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 145040 |
End bp | 146011 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640789064 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_001325826 |
Protein GI | 150395359 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.482202 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATACATC CTGATGTGAT CGGATGCGAT ATTGCAAAAG CGCATCTAGA TTTTTTCGAC AGCGGCCTTG AGCGCCATTT CCGTATCGAC AACACTCCGG CCGCAATTTC CGCGTGGCTC GACGGCCTTG ATGGCAGAGG CGTTCATATC GTCTTTGAGG CGACCGGGCG TTACGATCGG CAGTTGCGCA TAGCCCTGGA GACCCGGGAG TTGCCCTATT CCCGCGTCAA TCCTGCCCGC GCCCGCGACT TTGCCAAGGC GATCGGCCTT CTTGCCAAGA CGGATGCGAT CGATGCACGT CTGCTTGCCC GGATGGGTCA AAGCCTGCCA CTCTCAACTC AGGCGCCTGA CGATCCCGCC CGCCACGTGC TCGCCCGCCT TCACACGCGG CGTGACCAGC TCGTGGCCAT GCGCCAGCAA GAGCGGACAC GCCTTCATGA GACCGAGGGG ATCGAGCGTG ACAGTGCTGA AAGCCATATG GCTTGGCTCG ACGCGGAGGT TGCGCGCATC GAAATGGCAT GCCGTGATGT TCTGAAGGCC GAGAAGACCT TGCAAGAACA AGAGGCAAGG CTGCGTTCCA TTCCCGGCAT CGGCCCCGTG GCCGCATTGA CCCTGATCGC GCATATGCCA GAACTCGGCA ATCGTTCTGC CAAGGCGATT GCAGCCCTTG CCGGTCTTGC GCCCTTCAAT GTCGACAGCG GCACGTCACG GGGAAAGCGG CATATACGCG GCGGTCGCAA GCGGATACGT GACGCGCTCT ACATGGCGGC GCTCACAGCC AGCCGTATGC CCCGTGCTTT TAAGTCCCAT GCTGACCAAA ATGAAGGAGG CAGGCAAGCC CTTCAAGGTC GTCATCATTG CGCTTGCCCG CAAATTGCTC GCCATCGCAA ACGCCATCAT CAGGGACAAA ACAACCTTCC GACGAACCAC CTGACAAACA CAGTTGCCAG TCAGTCCAAG TCCCAGCGTT GA
|
Protein sequence | MIHPDVIGCD IAKAHLDFFD SGLERHFRID NTPAAISAWL DGLDGRGVHI VFEATGRYDR QLRIALETRE LPYSRVNPAR ARDFAKAIGL LAKTDAIDAR LLARMGQSLP LSTQAPDDPA RHVLARLHTR RDQLVAMRQQ ERTRLHETEG IERDSAESHM AWLDAEVARI EMACRDVLKA EKTLQEQEAR LRSIPGIGPV AALTLIAHMP ELGNRSAKAI AALAGLAPFN VDSGTSRGKR HIRGGRKRIR DALYMAALTA SRMPRAFKSH ADQNEGGRQA LQGRHHCACP QIARHRKRHH QGQNNLPTNH LTNTVASQSK SQR
|
| |