Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3431 |
Symbol | |
ID | 6486425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3328646 |
End bp | 3329512 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642738720 |
Product | transcription activator, effector binding |
Protein accession | YP_002042440 |
Protein GI | 194446792 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins [COG3449] DNA gyrase inhibitor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGACC TGATCAGCGC GGCTTATTCC GAACGTCTGC GGCGCGTTTG CGACCACATT GAGCGTCATC TGGATGAGCC TCTGTCGATT GAGGCGTTAA GCCGGATGGC GCACAGCTCG CCGTTTCATT TTCACCGGCA GTTTACCACC TGGAGCGGGC TTCCGCTCTA TCGCTATATC CAGTGGTTGC GTTTGCGCCG CGCGTCATGG CGGCTGGCGT TTAACCCGCA GGATAAAGTT ATCGATATCG CACTGGATGC CGGGTTTCAG AATCCGGAGT CGTTCACACG TGCTTTCAAA ACCGCGTTTG GTCAAAGCCC GCGCCGGTTT CGGCAATCGC CGGACTGGCT GGCCTGGCAC CAGCGCGTTC CTAAACTTGC GTTACAGGAG CAGCATGTCA TGGACGTAAA AATCGTTGAA TTTCCACCTA CCCGCGTCGC GATGCTGACG CATCTGGGGC ATCCGGATAA GGTCAACGCC AGCGCGGCGA AATTTATTGC ATGGCGGCGC GAAACGGGGC AGTCGCCGAT TGCCAGCAGT CAAACTTTTG GTATTGCCTG GCACGACCCG CAGACTACGC CGCCGGATCA ATTCCGCTTT GATATCTGCG GCAGCGTCCA CCAGCCGATA GCCGAAAATG ACGTTGGCGT GGTGAATAGC GAGATTCCTG GCGGGCGCTG CGCGGTGGTA CGCCATCAGG GATCGCTCGA CAGCCTCCCG GAGAGCGTGT GGTATTTATT CCGCGAATGG CTACCGGCCA GCGGCGAGAC GCTGCGCGAT TTTCCGGTCT TCTTCCAGTA CCTGAATTTC GTCAATGAGG TGGCGGAACA TGAGCTGCTG ACGGATATCT ATCTGCCGCT TCGTTAA
|
Protein sequence | MNDLISAAYS ERLRRVCDHI ERHLDEPLSI EALSRMAHSS PFHFHRQFTT WSGLPLYRYI QWLRLRRASW RLAFNPQDKV IDIALDAGFQ NPESFTRAFK TAFGQSPRRF RQSPDWLAWH QRVPKLALQE QHVMDVKIVE FPPTRVAMLT HLGHPDKVNA SAAKFIAWRR ETGQSPIASS QTFGIAWHDP QTTPPDQFRF DICGSVHQPI AENDVGVVNS EIPGGRCAVV RHQGSLDSLP ESVWYLFREW LPASGETLRD FPVFFQYLNF VNEVAEHELL TDIYLPLR
|
| |