Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1360 |
Symbol | dhaR |
ID | 6271835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1238520 |
End bp | 1240439 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641725471 |
Product | DNA-binding transcriptional regulator DhaR |
Protein accession | YP_001879981 |
Protein GI | 187732464 |
COG category | [K] Transcription [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3284] Transcriptional activator of acetoin/glycerol metabolism |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGGCG CTTTTAACAA CGATGGTCGG GGCATATCTC CCTTAATTGC AACCTCCTGG GAGCGATGCA ATAAGCTGAT GAAACGGGAG ACATGGAGCG TACCACATCA GGCCCAGGGC GTGACATTTG CTTCTATTTA TCGGCGTAAG AAAGCGATGC TGACGCTCGG GCAGGCTGCG CTGGAAGATG CCTGGGAATA TATGGCACCG CGAGAGTGTG CGCTGTTTAT CCTCGATGAA ACCGCCTGCA TTCTCAGCCG TAATGGCGAT CCGCAAACCT TGCAGCAGCT AAGTGCACTT GGATTCCATG ACGGCACGTA TTGCGCCGAG GGAATTATTG GTACTTGTGC GCTATCGTTA GCGGCTATCT CTGGTCAGGC CGTGAAAACG ATGGCCGATC AACATTTCAA ACAGGCACTC TGGAACTGGG CCTTTTGTGC AACGCTGTTG TTTGACAGCA AGGGCCGATT GACGGGAACA ATAGCGCTGG CGTGTCCGGT TGAGCAAACT ACCGCAGCTG ATTTGCCGTT GACGTTGGCA ATCGCCCGCG AGGTCGGAAA TTTACTGCTG ACGGACAGTT TGCTCACTGA AACTAACCGT CATTTAAATC AACTTAATGC CCTGTTAGAA AGTATGGATG ATGGCGTGAT TAGCTGGGAC GAGCAGGGTA ATTTGCAATT TATCAATGCC CAGGCGGCGC GGGTCTTGCG CCTTGACGCG ACGGCAAGTC AGGGACGGGC AATCACTGAA CTCTTAACGT TACCCGCCGT ATTGCAACAA GCAATAAAAC AGGCACATCC GCTCAAACAC GTAGAAGCAA CCTTTGAAAG TCAGCATCAG TTTATTGATG CGGTGATAAC CCTTAAACCG ATAATAGAAA CGCAGGGAAC CAGCTTTATT TTGTTGCTCC ATCCTGTGGA ACAGATGCGG CAGTTGATGA CCAGTCAATT AGGAAAAGTC AGCCATACCT TTGCTCATAT GCCACAGGAC GATCCGCAAA CCCGCCGCTT GATTCATTTT GGTCGCCAGG CGGCGCGCAG TAGCTTTCCT GTCCTGCTTT GTGGAGAAGA GGGCGTGGGC AAGGCACTGC TAAGTCAGGC AATTCATAAT GAAAGCGAGC GTGCTGCAGG TCCTTATATC GCCGTCAATT GTGAGTTATA TGGTGATGCG GCACTGGCGG AAGAATTTAT TGGTGGCGAT CGTACGGACA ATGAAAATGG CCGTCTGAGT CGGCTGGAAC TGGCGCACGG CGGCACGCTG TTTCTTGAAA AGATTGAATA TCTGGCGGTG GAGTTACAGT CTGCTTTGCT TCAGGTTATC AAGCAGGGTG TTATCACGCG ACTGGATGCG CGGCGTTTAA TACCAATTGA TGTCAAAGTG ATTGCAACAA CGACCGCGGA CCTCGCAATG CTGGTGGAAC AAAATCGTTT TAGTCGCCAG CTGTATTACG CGCTGCATGC ATTTGAAATT ACCATCCCGC CTCTGCGTAT GCGGCGTGGC AGCATTCCGG CGCTGGTGAA TAACAAATTA CGCAGTCTTG AAAAACGCTT CTCTACGCGG CTGAAAATTG ATGACGATGC CCTCGCTCGC CTGGTTTCTT GTGCATGGCC AGGCAACGAT TTTGAACTTT ACAGCGTCAT CGAGAATCTT GCTCTGAGTA GTGATAATGG GCGCATTCGC GTCAGTGATT TGCCGGAACA TCTGTTTACC GAGCAGGCAA CAGATGATGT CAGCGCCACC CGCCTTTCCA CCAGTCTGTC ATTTGCGGAA GTTGAAAAAG AGGCAATTAT TAACGCAGCC CAGGTCACAG GCGGTCGCAT TCAGGAAATG TCGGCTTTAC TTGGAATCGG CCGCACTACG CTGTGGCGGA AAATGAAGCA ACATGGCATT GATGCAGGGC AGTTTAAGCG CCGGGTATGA
|
Protein sequence | MSGAFNNDGR GISPLIATSW ERCNKLMKRE TWSVPHQAQG VTFASIYRRK KAMLTLGQAA LEDAWEYMAP RECALFILDE TACILSRNGD PQTLQQLSAL GFHDGTYCAE GIIGTCALSL AAISGQAVKT MADQHFKQAL WNWAFCATLL FDSKGRLTGT IALACPVEQT TAADLPLTLA IAREVGNLLL TDSLLTETNR HLNQLNALLE SMDDGVISWD EQGNLQFINA QAARVLRLDA TASQGRAITE LLTLPAVLQQ AIKQAHPLKH VEATFESQHQ FIDAVITLKP IIETQGTSFI LLLHPVEQMR QLMTSQLGKV SHTFAHMPQD DPQTRRLIHF GRQAARSSFP VLLCGEEGVG KALLSQAIHN ESERAAGPYI AVNCELYGDA ALAEEFIGGD RTDNENGRLS RLELAHGGTL FLEKIEYLAV ELQSALLQVI KQGVITRLDA RRLIPIDVKV IATTTADLAM LVEQNRFSRQ LYYALHAFEI TIPPLRMRRG SIPALVNNKL RSLEKRFSTR LKIDDDALAR LVSCAWPGND FELYSVIENL ALSSDNGRIR VSDLPEHLFT EQATDDVSAT RLSTSLSFAE VEKEAIINAA QVTGGRIQEM SALLGIGRTT LWRKMKQHGI DAGQFKRRV
|
| |