Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3976 |
Symbol | |
ID | 5318109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 427192 |
End bp | 428202 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640775785 |
Product | helix-turn-helix domain-containing protein |
Protein accession | YP_001312718 |
Protein GI | 150376122 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAGTGG GCTCCTGCTC ATTGTATCCA GTCACAGATA GCTGCGAATC CGCAGCAAGA TCAGGCCGCG TAGCGCGCCA AACCGGAAAT CGGAGACCCA CACCGATGGC CCTTCATCAG GAATCGAATG CCGTGGATTC ACAAGGCCCG ATCATCAGAT CGATCACTTC CGGCGATATT TCGGTGACCA GAGTCTGTTC CGACCGGTAC GATCTGAAAG CCGCACCGGC GACACCCTGT GCCGAAGCAT TCAGTGTGAT CGTTCAACTG CGTGACTTTG AAGCCCATCG ACTGTGGAAG GCTGGCAAGC TCGTTTACGA AGGCGGGCAC TCCAGGGCAT CTCTGGCGAT CACCGATCTG CGCGATCGCT GGCAGTGTCA TCATCTTTCG CCATTCGACA ATATTCGCTT TCAAATTCCA TTTGCCCGAA TGAGCGCCTT TGCGAAGCAG GCCGGGCGGA ACGAATATCT GGGGCTTGCC TCTGTGCAGG GGCGCATCGA TCCCGTGATG TACGGACTTG CCCAGGCGCT GTTGCCGTCG CTCGAAAGTC CCGAAACTGC CAGTTCCCTT TTTCTGGAGC AAATCAACCT TGCCGTGCTG GCGCATCTGA GCCAGACATA TGGCGGGCTG CACTTTCCGA TCGGCAAAAA GGGGACCCTT GCGCCCTGGC AGGAAAGACT TGCGACGGAA TTTCTCGCGA ACCACTTCAA TAAACCGTTT TCGCTTGGAG AGTTGGCACG CCTTTGCGAG CTGTCGCGCA GCTATTTCAA TAAGGCTTTC AAGGAGAGTT TCGGGCGCAC GCCATCCAGG TGGCTGAGCG AATATCGGAC CGGGCGCGTG AAGGAGCTTT TGCTTCAGGA TGTGCCGATC GCGGAAGCCG CAATCGCCTG CGGTTTCGCG GATCAGAGCC ATCTGACGCG GGTCTTCACT GGTCTGACGG GCGAGACGCC TGCGCGCTAC AGACGGAAGA ACCGGTGCGC GCGGCCAGCA ATGGAGCAGC TCTCCGGCTG A
|
Protein sequence | MVVGSCSLYP VTDSCESAAR SGRVARQTGN RRPTPMALHQ ESNAVDSQGP IIRSITSGDI SVTRVCSDRY DLKAAPATPC AEAFSVIVQL RDFEAHRLWK AGKLVYEGGH SRASLAITDL RDRWQCHHLS PFDNIRFQIP FARMSAFAKQ AGRNEYLGLA SVQGRIDPVM YGLAQALLPS LESPETASSL FLEQINLAVL AHLSQTYGGL HFPIGKKGTL APWQERLATE FLANHFNKPF SLGELARLCE LSRSYFNKAF KESFGRTPSR WLSEYRTGRV KELLLQDVPI AEAAIACGFA DQSHLTRVFT GLTGETPARY RRKNRCARPA MEQLSG
|
| |