Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B3998 |
Symbol | |
ID | 6792459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 3897061 |
End bp | 3899859 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642778115 |
Product | sigma-54 dependent transcription regulator |
Protein accession | YP_002148709 |
Protein GI | 197249675 |
COG category | [K] Transcription |
COG ID | [COG3933] Transcriptional antiterminator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACGTA TTGAGATCGT ACTGGGAGAG CTGGAACGGC TGACGCGCGG GCTATGTCTT GCCGATTTGG CGCAGGAGAC GGCGTTTACG GCGGAGGCGA TAGGTTTCAA TCTTGGGCTG GCGCGTAACT CCGTCAGCAA AGATCTCAAT CAGTTATGGA ATGACGGCCT GGCAATCAAA AGCCGTGGCC GCCCGGTCTA TTTTCTGCAT CGCCAGGCGC TGGAAATGTT GCTGGGACGA CAGCTGGAAG AGTCTGAACG CGAGGTGCGG TCGGTGGCGG ATGTGCTGCC GCATGAAGAG CATTATGCGC CCGACGATCC GTTTACCAGC CTGATTGGTT ACGATCGCAG CCTGCGCGAT GCGGTAGAAA AAGGCCGTGC GGCGGTGCTC TATCCGCACG GTTTACACGT TCTGCTTACC GGGCCGTCCG GCGTCGGTAA AACCTTTTTT GCGGAACTGA TGCACCGTTT CGCCTGTGAA CAGGCGAGCG GCGCTATCCC GCCGCTGGTC TACTTTAACT GCGCGGAATA CGCCCATAAC CCGGAACTGC TCTCCTCGCA TCTGTTTGGT CATCGGCAGG GGGCGTTTAC CGGCGCGAAT GAACATAAAA CAGGCCTCGT GGAGCAGGCG GACGGTGGTT ATCTGCTGCT GGACGAGGTG CACCGTCTGT CTTATGAAGG GCAGGAAAAG CTGTTCTCTA TTCTGGATAA AGGCGAGTAC CGTCCGCTTG GCGTGAGCAG CCAGCCGCGA TCAATTTCGG TACGCCTGAT TTGCGCCACT ACCGAGCCGG TCGGGTCGGC GCTGTTACGT ACTTTCCAGC GGCGTATTCA GGTATGCATT GATTTGCCGG GCATTCGCCA GCGCTCCGTT GAAGAACAGA TCGAACTGAT CGTGGGGTTC TTACAGCGGG AAAGCCGCAA AATAGAACGT ACGGTCAGCA TTGATAAACC GTTGTTGCTC TGGTTGCTGA ATAAACCCCT GGAAGGCAAT ATCGGTCAGC TAAAAAGCGA TATTCAGTTT CTTTGCGCTC AGGCGTGGGC ATCGGGAATG ACCGAGCATA ACGACACGCT ACAGCTGGAT AAGCGGCTGG CGGAGATGTC GGTTAACCCG ACGCCGGAAC AGCGTCTATT GGTGGATACT CTGTTTGACG GTAAAGCGCG GCTAAATATA GACGCGCGCA CGCTGCCCGC ATTGAAGACG TCGCTGGCGA CCGGGGCGGA AATTGAAGAG AGCGACCTCT TTTACAGCTT CCTGACGCGC GAGTATGTTA ATTTGCGCAA CAGTAACGTC CCCCCGGCGG AGACGCTGGC GATCCTGAAA AATAAACTCA GCTCGATTTT TGAATACGGT CTCTACAGCC GCGACAGCGT GGCGCATCCG CCGCGCTATG GCGACCAGAT TGAAGAGCGC GTGACGCTGC TGATTGGCTG CGTAGAGCAG GTGTTGGGAT TTTCGCTGCC GGAAAATCTG GTTAACCCAC TGCGTAAACA CTTCCTGGCG CTGATAGGCT ACGTGCAGCG CGGCCTGATC CCGCAGCTTT ACTCTTCCAG TTTGATCCTG GATCGCTGCA AAGACGAATA TGACAACGCC ACGCTGCTGT GTCGGAAAAT CAACGAACTG CTGCATATTC AGTGTCCGGC GACGGAAGTG GTCTGGCTAT GCCTGTTCCT GAAAGAGTGC CGCCATTATC GTCAGCGTAT CGATGCCAGC CCCGACTGCG GGGTCATTTT GATCGCCCAC GGCGCGACCA CCGCCACCAG CCAGGCGCAA TATGTCAACC GCGTGCTGGA GCGCGAGCTG TTCAGCGCCA TCGACATGCC GTTTGAACAG TCGGTGCATG ACACGCTGGA AACGCTGACC CAAATGATTC AAACCCGCCA ATATCGGCGG CTTATCCTGT TGGTGGACAT CGGTTCATTG ATCCATTTCG GCAGTACGAT CAGTAAGTTA TTCCAGATAG ATGTTTTGCT CATGCCGAAC ATCACGCTGA CCAGTCTGCT GGAAGTCGGG CTGGATTTAA GCTATGAAAC CAGCGACTTA CCACAGCTGA CGGCGCTCCT GCAGAGTAAA AATATCCCCT GCCAGCTTTG TACGCCGCAG CAGGAGAACG GCGGCAAAGT GCTGGTCATC TCCTGTATTA CCGGCATGGG AACGGCGGAA AAAATCAAAA AGGTGCTGGA GGAGAGCTTT GGCGAACTGA TGTCGCAGGA CACCAGGATG GTGATCCTTG ATTATAACGA GGTACGTAGT CTGGAGCGCG TTCAGCAGGC ATTGAATGCC AGCGAGCGGC TGGCGGGGAT TGTCGGCACT TTCCAGCCGG GGCTGCCGGA TATTCCGTTT ATTTCGCTGG AAGAGCTTTT CTCCGAACAA GGGCCTGAAC TGGTGTTGAG CCTGTTAACG CCCGATCTGT CCAACGCTGA ACGCCGTCTG GAGATGGAGC GCAGCGCCAT GCGCTTTATC AGCGCGCTAA CGATGGAGAG CATCATCAAC CATATTTCCG TGCTTAACCC GCAGCGTATT CTGAAAGAGA TGGAGGGCGT TTTTAACCAT CTGACGTCTT CGCTTTCCCT GAAACCAAGC CGCCAGGTGA CACTGCGCTT CCTGATCCAC TGCTGCTGTA TGGTAGAGCG TATTGTGATT AACCGTAAAC CGTTACAGAT GGCGCTGGAA AGTCAGCCGA ATCTGGACGC GCGCGCTTTT AGTGTCATCA AATCCGCTTT TCTGCCGATC GAAGACGCCT ACGCCATCCG TTTATCGGAT GCGGAATATT TTTATATCTA CGAACTGCTC TATAGCTAA
|
Protein sequence | MRRIEIVLGE LERLTRGLCL ADLAQETAFT AEAIGFNLGL ARNSVSKDLN QLWNDGLAIK SRGRPVYFLH RQALEMLLGR QLEESEREVR SVADVLPHEE HYAPDDPFTS LIGYDRSLRD AVEKGRAAVL YPHGLHVLLT GPSGVGKTFF AELMHRFACE QASGAIPPLV YFNCAEYAHN PELLSSHLFG HRQGAFTGAN EHKTGLVEQA DGGYLLLDEV HRLSYEGQEK LFSILDKGEY RPLGVSSQPR SISVRLICAT TEPVGSALLR TFQRRIQVCI DLPGIRQRSV EEQIELIVGF LQRESRKIER TVSIDKPLLL WLLNKPLEGN IGQLKSDIQF LCAQAWASGM TEHNDTLQLD KRLAEMSVNP TPEQRLLVDT LFDGKARLNI DARTLPALKT SLATGAEIEE SDLFYSFLTR EYVNLRNSNV PPAETLAILK NKLSSIFEYG LYSRDSVAHP PRYGDQIEER VTLLIGCVEQ VLGFSLPENL VNPLRKHFLA LIGYVQRGLI PQLYSSSLIL DRCKDEYDNA TLLCRKINEL LHIQCPATEV VWLCLFLKEC RHYRQRIDAS PDCGVILIAH GATTATSQAQ YVNRVLEREL FSAIDMPFEQ SVHDTLETLT QMIQTRQYRR LILLVDIGSL IHFGSTISKL FQIDVLLMPN ITLTSLLEVG LDLSYETSDL PQLTALLQSK NIPCQLCTPQ QENGGKVLVI SCITGMGTAE KIKKVLEESF GELMSQDTRM VILDYNEVRS LERVQQALNA SERLAGIVGT FQPGLPDIPF ISLEELFSEQ GPELVLSLLT PDLSNAERRL EMERSAMRFI SALTMESIIN HISVLNPQRI LKEMEGVFNH LTSSLSLKPS RQVTLRFLIH CCCMVERIVI NRKPLQMALE SQPNLDARAF SVIKSAFLPI EDAYAIRLSD AEYFYIYELL YS
|
| |