Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4643 |
Symbol | |
ID | 6484311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4532298 |
End bp | 4533188 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642739865 |
Product | DNA-binding transcriptional regulator MelR |
Protein accession | YP_002043547 |
Protein GI | 194444626 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 78 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCAACG GCGATGAAAA GCAAACCCGT AGCCCGCTGT CGCTCTATTC CGAATATCAA CGACTGGACG TTGAACTGAG GCCGCCGCAC AGAATGGCCA GCAGTCACTG GCATGGGCAA GTAGAAGTGA ATGTTCCGTT TGACGGCGAT GTGGAGTATT TAATCAACAA TGAAGTCGTG CAGATAAAGC AGGGGCATAT CACCCTGTTT TGGGCCTGTA CGCCACACCA GCTTACCCGC CCCGGCAACT GCCGCCAGAT GGCGATTTTC AGTTTGCCGA TGCACCTGTT TCTCTCCTGG CCGCTGGATC GCGATCTTAT CAACCACGTC ACGCACGGGA TGGTGGTTAA ATCACTGGCG ACCCAGCAGC TTAGCACCTT TGAAGTGTTG CGCTGGCAGC AGGAAACAAG CAGCCCGAAT GAGCAAATTC GCCAGTTGGC GATCGATGAA ATCGGCCTGA TGCTTAAACG CTTTAGCCTT TCCGGCTGGC AGCCCATTTT GCTCAATAAA ACATCACGCA CCCACAAGAA TAGCGTCTCA CGCCATGCGC AGTTTTACGT AAGCCAGATG CTGGGATTTA TTGCCGATAA CTACGATCAG GCGCTCACCA TTAACGACGT CGCGGAGCAT GTCAAACTCA ATGCTAATTA CGCGATGGGA ATATTCCAGC GGGTTATGCA ATTGACGATG AAGCAGTACA TCACGGCGAT GCGCATCAAT CACGTACGTG CCTTATTGAG CGATACTGAC AAAACGATCC TCGATGTCGC TCTGACTGCC GGGTTCCGAT CCAGCAGCCG TTTTTACAGT ACTTTTAGCA AATTTGTCGG TATGTCGCCG CAACAATACC GCAAGCTAAG CCAGCAACGA CGCCAGACGA TGCCCGGCTA A
|
Protein sequence | MCNGDEKQTR SPLSLYSEYQ RLDVELRPPH RMASSHWHGQ VEVNVPFDGD VEYLINNEVV QIKQGHITLF WACTPHQLTR PGNCRQMAIF SLPMHLFLSW PLDRDLINHV THGMVVKSLA TQQLSTFEVL RWQQETSSPN EQIRQLAIDE IGLMLKRFSL SGWQPILLNK TSRTHKNSVS RHAQFYVSQM LGFIADNYDQ ALTINDVAEH VKLNANYAMG IFQRVMQLTM KQYITAMRIN HVRALLSDTD KTILDVALTA GFRSSSRFYS TFSKFVGMSP QQYRKLSQQR RQTMPG
|
| |