Gene SeSA_A3980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3980 
Symbol 
ID6518666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp3853036 
End bp3855834 
Gene Length2799 bp 
Protein Length932 aa 
Translation table11 
GC content55% 
IMG OID642748952 
Productsigma-54 dependent transcription regulator 
Protein accessionYP_002116714 
Protein GI194737698 
COG category[K] Transcription 
COG ID[COG3933] Transcriptional antiterminator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.436261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGTA TTGAGATCGT ACTGGGAGAG CTGGAACGGC TGACGCGCGG GCTATGTCTT 
GCCGATTTGG CGCAGGAGAC GGCGTTTACG GCGGAGGCGA TAGGTTTCAA TCTTGGGCTG
GCGCGTAACT CCGTCAGCAA AGATCTCAAT CAGTTATGGA ATGACGGCCT GGCAATCAAA
AGCCGTGGCC GCCCGGTCTA TTTTCTGCAT CGCCAGGCGC TGGAAACGTT GCTGGGACGA
CAGCTGGAAG AGTCTGAACG CGAGGTGCGA ACGGTGGCGG ATGTGCTACC GCATGAAGAG
CATTACGCGC CCGACGATCC GTTTACCAGC CTGATTGGTT ACGATCGTAG CCTGCGTGAT
GCGGTAGAAA AAGGCCGTGC GGCGGTACTC TATCCGCACG GTTTACACGT TCTGCTTACC
GGGCCGTCCG GCGTCGGTAA AACCTTTTTT GCGGAACTGA TGCACCGTTT CGCCTGTGAA
CAGGCGAGCG GCGTTATCCC GCCGCTGGTC TACTTTAACT GCGCGGAATA CGCCCATAAC
CCGGAACTGC TCTCCTCTCA TCTGTTTGGT CATCGGCAGG GGGCATTTAC CGGCGCGAAT
GAACATAAAA CAGGCCTCGT GGAGCAGGCG GACGGTGGTT ATCTGCTGCT GGACGAGGTG
CACCGTCTGT CTTATGAAGG GCAGGAAAAG CTGTTCTCTA TTCTGGATAA AGGCGAGTAC
CGTCCGCTCG GCGTGAGCAG CCAGCCGCGA TCAATTTCGG TACGCCTGAT TTGCGCCACT
ACCGAGCCGG TCGGGTCGGC GCTGTTACGT ACTTTCCAGC GGCGTATTCA GGTATGCATT
GATTTGCCGG GCATTCGCCA GCGCTCCGTT GAAGAACAGA TCGAACTGAT CGTGGGGTTC
TTACAGCGGG AAAGCCGCAA AATAGAACGC ACGGTCAGCA TTGATAAACC GTTGTTGCTC
TGGTTGCTGA ATAAACCCCT GGAAGGCAAT ATCGGTCAGC TAAAAAGCGA TATTCAGTTT
CTTTGCGCTC AGGCGTGGGC ATCGGGAATG ACCGAGCATA ACGACACGCT ACAGCTGGAT
AAACGGCTGG CGGAGATGTC GGTTAACCCG ACGCCGGAAC AGCGTCTATT GGTGGATACT
CTGTTTGACG GTAAAGCGCG GCTAAATATA GACGCGCGCA CGCTGCCCGC ATTGAAGACG
TCGCTGGCGA CCGGGGCGGA AATTGAAGAG AGCGACCTCT TTTACAGCTT CCTGACGCGC
GAATATGTTA ATTTGCGCAA CAGTAACGTC CCCCCGGCGG AGACGCTGGC GATCCTGAAA
AATAAACTCA GCTCGATTTT TGAATACGGT CTCTACAGCC GCGACAGCGT GGCGCATCCG
CCGCGCTATG GCGACCAGAT TGAAGAGCGC GTGACGCTGC TGATTGGCTG CGTAGAGCAG
GTGTTGGGCT TTTCGCTGCC GGAAAATCTG GTTAACCCAC TGCGTAAACA CTTCCTGGCG
CTGATAGGCT ACGTGCAGCG TGGCCTGATC CCGCAGCTTT ACTCTTCCAG TTTGATCCTG
GATCGCTGCA AAGACGAATA TGACAACGCC GCGCTGCTGT GTCGGAAAAT CAACGAACTG
CTGCATATTC AGTGTCCGGC GACGGAAGTG GTCTGGCTAT GCCTGTTCCT GAAAGAGTGC
CGCCATTATC GTCAGCGCAT CGATGCCAGC CCCGACTGCG GGGTCATTTT GATCGCTCAC
GGCGCGACCA CCGCCACCAG TCAGGCGCAA TATGTCAACC GCGTGCTGGA GCGCGAGCTG
TTCAGCGCCA TCGACATGCC GTTTGAACAG TCGGTGCATG ACACGCTGGA AACGCTGACC
CAAATGATTC AAACCCGCCA ATATCGGCGG CTTATCCTGT TGGTGGACAT CGGTTCATTG
ATCCATTTCG GCAGTACGAT CAGTAAGTTA TTCCAGATAG ATGTTTTGCT CATGCCGAAC
ATCACGCTGA CCAGTCTGCT GGAAGTCGGG CTGGATTTAA GCTATGAAAC CAGCGATTTA
CTACAGCTGA CGGCGCTCCT GCAGAGTAAA AATATCCCCT GCCAGCTTTG TACGCCGCAG
CAGGAGAACG GCGGCAAAGT GCTGGTCATC TCCTGTATTA CCGGCATGGG AACGGCGGAA
AAAATCAAAA AGGTGCTGGA GGAGAGCTTT GGCGAACTGA TGTCGCAGGA CACCAGGATG
GTGATCCTTG ATTATAACGA GGTACGTAGT CTGGAGCGCG TTCAGCAGGC ATTGAATGCC
AGCGAGCGGC TGGCGGGGAT TGTCGGCACT TTCCAGCCGG GGCTGCCGGA TATTCCGTTT
ATTTCGCTGG AAGAGCTTTT CTCCGAACAA GGGCCGGAAC TGGTGTTGAG CCTGTTAACG
CCCGATCTGT CCAACGCTGA ACGCCGTCTG GAGATGGAGC GCAGCGCGAT GCGCTTTATC
AGCGCGCTAA CGATGGAGAG CATCATCAAC CATATTTCCG TGCTTAACCC GCAGCGTATT
CTGAAAGAGA TGGAGGGCGT TTTTAACCAT CTGACGTCTT CGCTTTCCCT GAAACCAAGC
CGCCAGGTGA CACTGCGCTT CCTGATCCAC TGCTGCTGTA TGGTAGAGCG TATTGTGATT
AACCGAAAAC CGTTACAGAT GGCGCTGGAA AGTCAGCCGA ATCTGGACGC GCGCGCGTTT
AGTGTCATCA AATCCGCTTT TCTGCCGATC GAAGACGCCT ACGCCATCCG TTTATCGGAT
GCGGAATATT TTTATATCTA CGAACTGCTC TATAGCTAA
 
Protein sequence
MRRIEIVLGE LERLTRGLCL ADLAQETAFT AEAIGFNLGL ARNSVSKDLN QLWNDGLAIK 
SRGRPVYFLH RQALETLLGR QLEESEREVR TVADVLPHEE HYAPDDPFTS LIGYDRSLRD
AVEKGRAAVL YPHGLHVLLT GPSGVGKTFF AELMHRFACE QASGVIPPLV YFNCAEYAHN
PELLSSHLFG HRQGAFTGAN EHKTGLVEQA DGGYLLLDEV HRLSYEGQEK LFSILDKGEY
RPLGVSSQPR SISVRLICAT TEPVGSALLR TFQRRIQVCI DLPGIRQRSV EEQIELIVGF
LQRESRKIER TVSIDKPLLL WLLNKPLEGN IGQLKSDIQF LCAQAWASGM TEHNDTLQLD
KRLAEMSVNP TPEQRLLVDT LFDGKARLNI DARTLPALKT SLATGAEIEE SDLFYSFLTR
EYVNLRNSNV PPAETLAILK NKLSSIFEYG LYSRDSVAHP PRYGDQIEER VTLLIGCVEQ
VLGFSLPENL VNPLRKHFLA LIGYVQRGLI PQLYSSSLIL DRCKDEYDNA ALLCRKINEL
LHIQCPATEV VWLCLFLKEC RHYRQRIDAS PDCGVILIAH GATTATSQAQ YVNRVLEREL
FSAIDMPFEQ SVHDTLETLT QMIQTRQYRR LILLVDIGSL IHFGSTISKL FQIDVLLMPN
ITLTSLLEVG LDLSYETSDL LQLTALLQSK NIPCQLCTPQ QENGGKVLVI SCITGMGTAE
KIKKVLEESF GELMSQDTRM VILDYNEVRS LERVQQALNA SERLAGIVGT FQPGLPDIPF
ISLEELFSEQ GPELVLSLLT PDLSNAERRL EMERSAMRFI SALTMESIIN HISVLNPQRI
LKEMEGVFNH LTSSLSLKPS RQVTLRFLIH CCCMVERIVI NRKPLQMALE SQPNLDARAF
SVIKSAFLPI EDAYAIRLSD AEYFYIYELL YS