Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4052 |
Symbol | |
ID | 6484957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3941909 |
End bp | 3944707 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642739310 |
Product | sigma-54 dependent transcription regulator |
Protein accession | YP_002043019 |
Protein GI | 194443050 |
COG category | [K] Transcription |
COG ID | [COG3933] Transcriptional antiterminator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.0716366 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGTA TTGAGATCGT ACTGGGAGAG CTGGAACGGC TGACGCGCGG GCTATGCCTT GCCGATTTGG CGCAGGAGAC GGCGTTTACG GCGGAGGCGA TAGGTTTCAA TCTTGGGCTG GCGCGTAACT CCGTCAGCAA AGATCTCAAT CAGTTATGGA ATGACGGCCT GGCAATCAAA AGCCGTGGCC GCCCGGTCTA TTTTCTGCAT CGCCAGGCGC TGGAAACGTT GCTGGGACGA CAGCTGGAAG AGTCTGAACG CGAAGTGCGG TCGGTGGCGG ATGTGCTGCC GCATGAAGAG CATTACGCGC CCGACGATCC GTTTACCAGC CTGATTGGTT ACGATCGCAG CCTGCGCGAT GCGGTAGAAA AAGGCCGTGC GGCGGTGCTC TATCCGCACG GTTTACACGT TCTGCTTACC GGGCCGTCCG GCGTCGGTAA AACCTTTTTT GCGGAACTGA TGCACCGTTT CGCCTGTGAA CAGGCGAGCG GCGCTATCCC GCCGCTGGTC TACTTTAATT GCGCGGAATA CGCCCATAAC CCGGAACTGC TCTCCTCGCA TCTGTTTGGT CATCGGCAGG GGGCGTTTAC CGGCGCGAAT GAACATAAAA CAGGCCTCGT GGAGCAGGCG GACGGTGGTT ATCTGCTGTT GGACGAGGTG CACCGTCTGT CTTATGAAGG GCAGGAAAAG CTGTTCTCTA TTCTGGATAA AGGCGAGTAC CGTCCGCTTG GCGTGAGCAG CCAGCCGCGA TCAATTTCGG TACGCCTGAT TTGCGCCACT ACCGAGCCGG TCGGGTCGGC GCTGTTACGT ACTTTCCAGC GGCGTATTCA GGTATGCATT GATTTGCCGG GCATTCGCCA GCGCTCCGTT GAAGAACAGA TCGAACTGAT CGTGGGGTTC TTACAGCGGG AAAGCCGCAA AATAGAACGT ACGGTCAGCA TTGATAAACC GTTGTTGCTC TGGTTGCTGA ATAAACCCCT TGAAGGCAAT ATCGGTCAGC TAAAAAGCGA TATTCAGTTT CTCTGCGCTC AGGCGTGGGC ATCGGGAATG ACCGAGCATA ACGACACGCT ACAGCTGGAT AAGCGGCTGG CGGAGATGTC GATTAACCCG ACGCCGGAAC AGCGTCTATT GGTGGATACT CTGTTTGAGG GTAAAGCGCG GCTAAACATA GACGCGCGCA CGCTGCCCGC ATTGAAGACG TCGCTGGCGA TCGGGGCGGA AATTGAAGAG AGCGACCTCT TTTACAGCTT CCTGACGCGC GAGTATGTTA ATTTGCGCAA CAGTAACGTC CCCCCGGCGG AGACGCTGGC GATCCTGAAA AATAAACTCA GCTCGATTTT TGAATACGGT CTCTACAGCC GCGACAGCGT GGCGCATCCG CCGCGCTATG GCGACCAGAT TGAAGAGCGC GTGACGCTGC TGATAGGCTG CGTAGAGCAG GTGTTGGGTT TTTCGCTGCC GGAAAATCTG GTTAACCCAC TGCGTAAACA CTTCCTGGCG CTGATAGGCT ACGTGCAGCG CGGCCTGATC CCGCAGCTTT ACTCTTCCAG TTTGATCCTG GATCGCTGCA AAGACGAATA TGACAACGCC ACGCTGCTGT GTCGGAAAAT CAACGAACTG CTGCATATTC AGTGTCCGGC GACGGAAGTG GTCTGGCTAT GCCTGTTCCT GAAAGAGTGC CGCCATTATC GTCAGCGCAT CGATGCCAGC CCCGACTGCG GGGTCATTTT GATCGCCCAC GGCGCGACCA CCGCCACCAG TCAGGCGCAA TATGTCAACC GCGTGCTGGA GCGCGAGTTG TTCAGCGCCA TCGACATGCC GTTTGAACAG TCGGTGCATG ACACGCTGGA AACGCTGACC CAAATGATTC AAACCCGCCA ATATCGGCGG CTTATCCTGT TGGTGGACAT CGGTTCATTG ATCCATTTCG GCAGCACCAT CAGTAAATTA TTCCAGATAG ATGTTTTGCT CATGCCGAAC ATCACGCTGA CCAGTCTACT GGAAGTCGGG CTGGATTTAA GCTATGAAAC CAGCGACTTA CCACAGTTGA CGGCGCTCCT GCAGAGTAAA AATATCCCCT GCCAGCTTTG TACGCCGCAG CAGGAGAACG GCGGCAAAGT GCTGGTCATC TCCTGTATTA CCGGCATGGG AACGGCGGAA AAAATCAAAA AGGTGCTGGA GGAGAGCTTT GGCGAACTGA TGTCGCAGGA CACCAGGATG GTGATCCTTG ATTATAACGA GGTACGTAGT CTGGAGCGCG TTCAGCAGGC ATTGAATGCC AGCGAGCGGC TGGCGGGGAT TGTCGGCACT TTCCAGCCGG GGCTGCCGGA TATTCCGTTT ATTTCGCTGG AAGAGCTTTT CTCCGAACAA GGGCCGGAAC TGGTGTTGAG CCTGTTAACG CCCGATCTGT CCAACGCTGA ACGCCGTCTG GAGATGGAGC GCAGCGCCAT GCGCTTTATC AGCGCGCTAA CGATGGAGAG TATCATCAAC CATATTTCCG TGCTTAACCC GCAGCGTATT CTGAAAGAGA TGGAGGGCGT TTTTAACCAT CTGACGTCTT CGCTTTCCCT GAAACCAAGC CGCCAGGTGA CACTGCGCTT CCTGATCCAC TGCTGCTGTA TGGTAGAGCG TATTGTGATT AACCGAAAAC CGTTACAGAT GGCGTTGGAA AGTCAGCCGA ATCTGGACGC GCGCGCGTTT AGTGTCATCA AATCCGCTTT TCTGCCGATC GAAGACGCCT ACGCCATCCG TTTATCGGAT GCGGAATATT TTTATATCTA CGAACTGCTC TATAGTTAA
|
Protein sequence | MRRIEIVLGE LERLTRGLCL ADLAQETAFT AEAIGFNLGL ARNSVSKDLN QLWNDGLAIK SRGRPVYFLH RQALETLLGR QLEESEREVR SVADVLPHEE HYAPDDPFTS LIGYDRSLRD AVEKGRAAVL YPHGLHVLLT GPSGVGKTFF AELMHRFACE QASGAIPPLV YFNCAEYAHN PELLSSHLFG HRQGAFTGAN EHKTGLVEQA DGGYLLLDEV HRLSYEGQEK LFSILDKGEY RPLGVSSQPR SISVRLICAT TEPVGSALLR TFQRRIQVCI DLPGIRQRSV EEQIELIVGF LQRESRKIER TVSIDKPLLL WLLNKPLEGN IGQLKSDIQF LCAQAWASGM TEHNDTLQLD KRLAEMSINP TPEQRLLVDT LFEGKARLNI DARTLPALKT SLAIGAEIEE SDLFYSFLTR EYVNLRNSNV PPAETLAILK NKLSSIFEYG LYSRDSVAHP PRYGDQIEER VTLLIGCVEQ VLGFSLPENL VNPLRKHFLA LIGYVQRGLI PQLYSSSLIL DRCKDEYDNA TLLCRKINEL LHIQCPATEV VWLCLFLKEC RHYRQRIDAS PDCGVILIAH GATTATSQAQ YVNRVLEREL FSAIDMPFEQ SVHDTLETLT QMIQTRQYRR LILLVDIGSL IHFGSTISKL FQIDVLLMPN ITLTSLLEVG LDLSYETSDL PQLTALLQSK NIPCQLCTPQ QENGGKVLVI SCITGMGTAE KIKKVLEESF GELMSQDTRM VILDYNEVRS LERVQQALNA SERLAGIVGT FQPGLPDIPF ISLEELFSEQ GPELVLSLLT PDLSNAERRL EMERSAMRFI SALTMESIIN HISVLNPQRI LKEMEGVFNH LTSSLSLKPS RQVTLRFLIH CCCMVERIVI NRKPLQMALE SQPNLDARAF SVIKSAFLPI EDAYAIRLSD AEYFYIYELL YS
|
| |