Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2213 |
Symbol | |
ID | 5323073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2291664 |
End bp | 2293502 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640791150 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001327880 |
Protein GI | 150397413 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.226859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.3766 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGGTGC CGCATTTTCA TCTGCAGACA TTCGGGGAAT TGCGGCTGGT CGATTCCGGC GGCGTGGTCG TGGCGTGCCC GGAGCGGGGC CTGCTTATTC TGGCCTACCT CAGCATATCC GGGCAGAGAA CGGTCTCCCG CGAAAAGCTC TCGATGCTGA TCTGGCCCGA GCGGGAGAAG AGCGTCGCTT TCAAGAACCT GCGCTCTACC CTGTGGCGCA TGCGCCACCT TGCCGCGAAC TCGGGGCCGC TCACGTCGGC GACGGAAATA GATGTGACGC TGGGGGAGAT TGCCTGCGAC GCCGCCGGTT TCGAGGCGCT CGGGAGCTCG GAGGAAGCGT TTCGCGCCGG GCTCTCGCTC TGGCGCGATA CGTTCCTAGC AGGCGCCGAT GTGCCTCCCG GGCGTTATGC GGACTGGGTT GGAGAGCAGC GGAGCGTGTT TACGCAAAAG CTGCGCGCGG CACTTTTAGC CGGCGGAGAG CGATTTGCCG ACAGGTCGAT GCTGCGGACG GCGTGCATGC ATCTCCTCGA ACTCGATCCG GCGGATACTG CCGTGCGCGC AATCCTGGAG CGGTTGACCG GCGTGGGTCT CGTCATCGGG AAAGGCTCGC CGGACGGGGC CTGGCTGCGG CAGGGGGCCT TTGCAGGGGA GTCGCCCCGG GCGCCGGCAA GGGATGCGAT AACCCCGCCG GTTGCGCCCG CACCGCGCGT TGCGCTGCTT CCGCCCGCGG CTTTCGGCAG TGATCCGATG CTTCACGCGG TCAGCATGGC CGTGGTCGAA GACATCACGA TCGGATTATG CGCGCTGCGC TCGATGTCGG TTGTTGCCCC TTACACTGCC GAGCGTATTC GGGATGCGGC GGACAAGGCT GCCTTCCTGG AAAAACATGC CGTCACTTAT GCGCTCGACT GTCGCATGTC GGACCAGGGG CTGTTTACTC AACTCATCTT CCTGCCGTCC GACGCGATCG TCTGGGCGGA GCGCTTTTCG ATCTCTCCGG TCGGGCTCCT GCAACAGCGT CAGGAAATTG CCTTTCACGT GGCCAGCGCG GTAGCGGAAC AGGTGGAGAC CGGGCGTATC GGGCACCTCG ATTACGTGAC CAATCCGGAC GCCTATTACG CCTATCTCGC CGGCCTGCGA AACCTTTCGA ACGTCGGCCT TCCCGAAATC CGCAGGGCAC GCCGCGATTT CAAGACGGCG CTCAGGCACA AGCCGGATTT TGCCCCGGCG CTAAGCGGCA TCGCACGCAC CTATGCGATC GAGTGGGTCC TGACGGCTCG GGGCGACCAG GAGCTTTTGA CGGTAGCCGA GCGTCACGCC AAGGAAGCGA TCGAGAGCGA TGAAGAGCTT CCCGGAGCAC ATCGCGAATT CGGCGTCGTC AGACTTTATC AGGGCGACCT CGATGGAAGT CTTGCGGCGC TGGACCGCGC CGAAAATCTG AGCCCGCATT ATGCCGACGT GCTGTACAGC CATGCGGACA CGCTCGTCCA TGCCTCCCGC CCGCGTGAGG CTCTCGACAA GCTCGGCAAG GCGCTTTCGC TTAATCCCCT GGCGCCGGAC ATGTATTTCT GGAGCGCTGC CGGAGCCAGC TACTTTCTCG AAGAGTATGA GGATGCGATC GGCTACGTTC AGAAAATGAA GGACAAGTCG CCGGGCGACC GGCTGCTTGC AGCAAGCTGG GCGATGCTCG GCGATCAGAA GAAAGCGCGG TCATACAAGG TGAAGGCCCT GAGGGCCAAT CCGACATTCG ATGTCGACAA GTGGCTCGCC GTCGTTCCGA TGAAGGAACA ATGGCAGAAG GACCTCTATC GCGAGGGGCT GAAAAGAGCG GGGTTTTGA
|
Protein sequence | MAVPHFHLQT FGELRLVDSG GVVVACPERG LLILAYLSIS GQRTVSREKL SMLIWPEREK SVAFKNLRST LWRMRHLAAN SGPLTSATEI DVTLGEIACD AAGFEALGSS EEAFRAGLSL WRDTFLAGAD VPPGRYADWV GEQRSVFTQK LRAALLAGGE RFADRSMLRT ACMHLLELDP ADTAVRAILE RLTGVGLVIG KGSPDGAWLR QGAFAGESPR APARDAITPP VAPAPRVALL PPAAFGSDPM LHAVSMAVVE DITIGLCALR SMSVVAPYTA ERIRDAADKA AFLEKHAVTY ALDCRMSDQG LFTQLIFLPS DAIVWAERFS ISPVGLLQQR QEIAFHVASA VAEQVETGRI GHLDYVTNPD AYYAYLAGLR NLSNVGLPEI RRARRDFKTA LRHKPDFAPA LSGIARTYAI EWVLTARGDQ ELLTVAERHA KEAIESDEEL PGAHREFGVV RLYQGDLDGS LAALDRAENL SPHYADVLYS HADTLVHASR PREALDKLGK ALSLNPLAPD MYFWSAAGAS YFLEEYEDAI GYVQKMKDKS PGDRLLAASW AMLGDQKKAR SYKVKALRAN PTFDVDKWLA VVPMKEQWQK DLYREGLKRA GF
|
| |