Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4083 |
Symbol | |
ID | 5317902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 544377 |
End bp | 545402 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640775890 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001312823 |
Protein GI | 150376227 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0474473 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAGG CAGCGCTGCG CTCGAAAGCT GAAATCGGTC TCATGCTTTA TCCGGGCTGC CAGATGGCGA TGGTTCACGG CATGACGGAC CTCATCAACA TAGCCCGTCA ATTCTCCGCC GAGCGGGGCG GGGCGGTGGC ACGGATCAGT CACTGGCGGC TTCGGGACGA CGGCTGTCTT GCGCGAAGTT TCGACACCCA TCCCGAACTC GGATCACGGG AGATGCCGGA GATACTGCTT GTCCCCGGCA GGTTGACCGG GCCGATGGAA GCGGAGGAGG CCGCGCCCTA TGCTCGCTGG CTGCTCGACG GGCATGCCCA AGGCGCGACC CTCGCATCGA CTTGCGGCGG GACCTTCGTG CTTGCGGCGA CGGGCCTGCT CAAGGGCCGG CCAGCGACGA CGCACTGGCT CTTCGCCAAT GCTTTCCGTG ACCGGTTTCC GGATGTCCGG CTCGACACGG ACAAGATCGT GATCGAGGAC GGCGACATCG TAACTGCCGG CGGCCTCATG GCATGGACAG ACCTCGGCAT GCGCCTCGTG GATCGTCTGT TCGGTCCGAC GGTAACGGTC GAGACCGGGC GTTTCCTTCT TATCGACCCC GCCGGACGCG AGCAGCGGCA CTATTCCAGC TTTTCGCCGC GCCTCAATCA CGGCGACGAA GTGATCCTGA AGGTGCAGCA TTGGCTGCAA GCGCGTGAGG CTCGCGCCGT CAGCGTTGCG CAGATGGCAA AAGTGGTGGC GATGGAGGAG CGTACTTTCC TGCGCCGTTT CAAGGCGGCG ACCGGAATGA AGCCGATCGA ATATGCCCAG CATCTGCGCG TCGGTAAGGC GCGGGAACTC CTGGAGTTCA CCAGACGTTC GGTTGAACAG GTCGCCTGGG CAGTCGGCTA CGAGGACGCG GCCGCCTTTC GCAAGCTGTT CCATAGGATC GTCGGGCTGT CGCGAGGTGA TTACCGGCAT CGATTCGCGG TAGCGGGCGA GGCCTTTTCG CGCCCCGTCA CCGGCTTGGA ACAAATTGAC GTTTGA
|
Protein sequence | MPKAALRSKA EIGLMLYPGC QMAMVHGMTD LINIARQFSA ERGGAVARIS HWRLRDDGCL ARSFDTHPEL GSREMPEILL VPGRLTGPME AEEAAPYARW LLDGHAQGAT LASTCGGTFV LAATGLLKGR PATTHWLFAN AFRDRFPDVR LDTDKIVIED GDIVTAGGLM AWTDLGMRLV DRLFGPTVTV ETGRFLLIDP AGREQRHYSS FSPRLNHGDE VILKVQHWLQ AREARAVSVA QMAKVVAMEE RTFLRRFKAA TGMKPIEYAQ HLRVGKAREL LEFTRRSVEQ VAWAVGYEDA AAFRKLFHRI VGLSRGDYRH RFAVAGEAFS RPVTGLEQID V
|
| |