Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1794 |
Symbol | |
ID | 5322652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 1876418 |
End bp | 1877434 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640790732 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001327464 |
Protein GI | 150396997 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.112719 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00342809 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGAGGTT TTATGCACAA ACCGCTGACG AAGAAGCGTT CGCTCGTCTT CTTTCTGGTG CCGCATTTTT CGATGCTACC CTTCTCGGCC GCGATCGAAA CACTGCGCAT TGCCAATCGC ATGCTCGGTT ACGAGGCCTA TACCTGGCGC CTCGCTTCGA CCGACGGCCA GAAGGTCTAT TCGTCGGCGG GCATTGCGCT CGAAGTCAAC ACCTCGCTTG CCGACGAGCG CAAATATCTC GGTGGAGAGA ACCGGCCCTC GATGGTTCTG GTCTGTTCGG GCGTCTATGT CGAAGACTTC CAGAACAAGT CGGTAAATGC TTGGCTGCGC GAGGCCTACA ACCGCGGCAT TGCGGTCGGC AGTCTCTGTA CCGGCGCCCA CGTGCTCGCT TCGGCGGGCC TGCTCACGGG CAAGCGCTGC GCGATCCATT GGGAAAACCT ACCAGGCTTC TCCGAGAGCT TCCCTCAGGC CGACGTATAT GCGGACCTCT ATGAGATCGA CGGCAACATC TATACCTGCG CCGGCGGCAC CGCTTCGCTC GACATGATGC TGAACCTGAT CGATCAGGAT TTCGGCGAGA ACCTCGTCAA CCGCGTCTGC GAGCAGGCGC TGACCGACCG CGTGCGCGGG CCGCACGACC GGCAGCGGCT GCCACTGAGA GCACGCCTTG GCGTTCAGAA CTCCAAGGTC CTTTCGATCA TCGAACTGAT GGAAAGCAAC CTTTCCGAAC CGCTGTCCCT CCTTGAGATA GCGGAGAGCG CCGACCTGTC CCGCCGTCAG ATCGAGCGGC TCTTCCGCCA GGAAATGGGA CGGTCGCCAG CCCGCTACTA TCTCGAGATT CGCCTCGATC GCGCCCGGCA TCTGCTTATC CAGTCTTCCA TGCCGGTCGT GGAAGTCGCT GTGGCCTGCG GTTTCGTATC CGCTTCACAT TTTTCGAAGT GTTATCGGGA GCTCTACAAC CGCTCGCCGC AACAGGAACG CGCCGAACGC AAGCTGACGT TACAGATGGC GCGGTAG
|
Protein sequence | MGGFMHKPLT KKRSLVFFLV PHFSMLPFSA AIETLRIANR MLGYEAYTWR LASTDGQKVY SSAGIALEVN TSLADERKYL GGENRPSMVL VCSGVYVEDF QNKSVNAWLR EAYNRGIAVG SLCTGAHVLA SAGLLTGKRC AIHWENLPGF SESFPQADVY ADLYEIDGNI YTCAGGTASL DMMLNLIDQD FGENLVNRVC EQALTDRVRG PHDRQRLPLR ARLGVQNSKV LSIIELMESN LSEPLSLLEI AESADLSRRQ IERLFRQEMG RSPARYYLEI RLDRARHLLI QSSMPVVEVA VACGFVSASH FSKCYRELYN RSPQQERAER KLTLQMAR
|
| |