Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3278 |
Symbol | |
ID | 5589483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 3294068 |
End bp | 3296866 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640926915 |
Product | sigma-54 dependent transcription regulator |
Protein accession | YP_001464287 |
Protein GI | 157158736 |
COG category | [K] Transcription |
COG ID | [COG3933] Transcriptional antiterminator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACGGA TTGATGTTAT TCACAAGGAG CTGGAGCGGC TGACGTATGG GCTGGAACTG TCTGATTTAG CGAAGGAGAA AGCATTTACT GCTGAGGCGA TTGGTTTTAA CCTTGGGCTG GCACGCAACT CGGTTAGTAA AGATCTTAAT CAATTATGGA ATGAAGGCCT GGTGGTAAAA AGTCAGGGAC GTCCTGTTTT TTTTCTACAT CGTCACGCAC TAGAATTGCT CATTAATCGA AAACTGGATG ATTGCGAATG CATAGTTCAT TCAGTGGCTA GTTTATTGCC AAAGAAAGAA AAATATACAG ATGATGATCC GTTTTCCGGA CTTATCGGAT ACGACCGTAG TCTCCGTGAT GCGGTAGAAA AAGGCCGCGC AGCGGTGCTT TATCCCCACG GACTACATGT GCTGTTAACC GGTGCATCTG GTGTTGGCAA AACATTCTTT GCTGAGTTAA TGCATTGCTT CGCTTGCAAG CACACCATAG GTTCGCCACC TCCGTTGGTT TACTTTAACT GTGCCGAATA TGCTCATAAT CCGGAGTTGC TTTCCTCACA TCTGTTTGGT CACCATAAGG GGGCATTTAC CGGGGCAAGT GAGAATAAAA CGGGATTGGT GGAGCAAGCT GACGGAGGTT ATTTACTTCT GGATGAAGTG CATCGCCTCC CGTACGAAGG ACAAGAGAAG CTCTTTTCTA TTCTCGATAA AGGTGAATAC CGCCCACTTG GTTCAAGTGG GCCAACATAT TCAATTTCGG TGCGCCTTAT TTGCGCCACT ACAGAGTCGG TCAACTCGGC ACTATTGCGT ACCTTCCAGC GGCGTATCCA GGTATGTATT GACTTACCGG GTATTCGCCA GCGTTCGGTC GAGGAGCAGC TCGAATTGAT TGTAAGCTTC TTTCAGCGTG AAAGTCGTAA AATTGAACGT ACCATAAGCC TTGATAAAAC GCTCTTGCAT TGGTTGTTAA GTAAACCCCT GGAAGGCAAT ATCGGCCAGC TTAAAAGTGA TATTCAGTTC TTGTGCGCGC AGGCGTGGGC GTCGGGAATG ACCGAGCGTA ATGATACGCT AGAGTTAGAT AAACGACTGG CGGAGATGTC GTTTAATACC ACGCCAGAGC AGCGTCTGCT GGTCGATGCG TTGTTTGGCG GTAAAGATCG TCTGAACGTG GATGCGCGTA CGTTGCCCGC GTTAAAAAAT TCGTTGGCGA CCAGCGCGGA AATAGAAGAG AGCGACCTGT TTTACAGTTT CCTGACTCGT GAATATGTCA ATCTGCGCAA CAGTAACGTG CCGCCCGCAG AAACGCTGGC TATCCTGAAA AACAAACTCA GTTCGATTTT TGAATATGGT CTTTATAGCC GCCATAGTGC GGCGCATCCA CCGCGCTATG GCGATCAAAT TGAGGAGCGT GTGACGCTGC TGATCGGTTA CGTGGAGCAG GTATTAGGAT TTACATTGCC AGAAAACCTG GCGAACCCGT TACGTAAACA CTTTCTTGCG CTGATTGGCT ATGTGCGGCG AGGGCTTATA TCTCAACTCT ATTCATCCAG CCTGATTCTG GATCGCTGTA AAGATGAGTA TGACAATGCC ACGCTTCTGT GCCGCAAAAT CAATGAGTTG TTGCATATTC AGTGCCCGGC GACGGAGGTG GTCTGGCTGT GCCTGTTCCT GAAAGAGTGT CGCCACTACC GCCAGCGCAT CGACACCAGC CCGGATTGCG GCGTGATTCT AATCGCTCAC GGTGCGACCA CCGCAACGAG CCAGGCGCAA TATGTGAACC GGGTGCTGGA GCGCGATCTG TTCTACGCAA TTGATATGCC ATTTGAGCAG TCGGTGCATG ACACCCTGGA AACCCTGACC CGGATGATTC AGGACCGGTG CTGGCAGCGG CTGATACTGA TGGTGGATAT CGGATCGCTG GTCCATTTTG GCAGCACCAT CAGCAAGCTG TTCCAGATTG ACGTTCTGTT GTTGCCAAAT ATCACTCTGA CCAGCCTGCT GGAGGTTGGG CTGGATTTAA GCTACGAAAC TGGCGATCTA TCGCAATTGG CTGTGCTTAT GCAGAGTAAA AATATCCCTT GTCGGCTCTG CACGCCGCAG CAAGAGAGCG GTGGCAAAGT GCTGGTTATC TCATGTATAA CCGGAATGGG GACGGCGGAA AAGATCAAAA AGGTGCTGGA GGAGAGCTTT GGCGAGCTGA TGTCGCAGGA CACGCGTATG GTGATTCTCG ACTATAACGA GGTGCGCAGC CTGGAGCGTA TTCAGCAGGC GCTTAATGCC AGTGAACGGC TGGCAGGGAT CGTCGGCACC TTCCAGCCGG GACTGCCGGA TATACCGTTT ATCTCGCTGG AGGAACTGTT TTCCGAGCAA GGGCCGGAGC TGGTGCTCAG CCTGCTGACG CCGGATCTCT CAAGCAGCGA GCGCCGACTG GAGATGGAGC GCAGCGCCAT GCGCTTTATC AGCGCGCTGA CCATGGAGAG CATCATCAAC CATATTTCGG TGCTCAACCC GCAGCGAATT CTGAAAGAGA TCGAGGATGT TCTGAACTAC TTAACCAACA CGCTCTCCCT GAAACCGAGC CGCCAGGTGA CACTACGTTT TCTGATCCAC TGCTGCTGCA TGGTCGAGCG TATCGTGATT AACCGCAAAC CGTTACAGAT GGCGCTAGAA AACCGGCTTG ATCTGGACGC TCGTGCCTTT AGCGTCATCA AATCCTCCTT TTTGCCGATT GAAGAGGCTT ACGCTATCCG TTTATCGGAC GCAGAATATT TTTATATCTA CGAACTGCTC TACAGCTGA
|
Protein sequence | MRRIDVIHKE LERLTYGLEL SDLAKEKAFT AEAIGFNLGL ARNSVSKDLN QLWNEGLVVK SQGRPVFFLH RHALELLINR KLDDCECIVH SVASLLPKKE KYTDDDPFSG LIGYDRSLRD AVEKGRAAVL YPHGLHVLLT GASGVGKTFF AELMHCFACK HTIGSPPPLV YFNCAEYAHN PELLSSHLFG HHKGAFTGAS ENKTGLVEQA DGGYLLLDEV HRLPYEGQEK LFSILDKGEY RPLGSSGPTY SISVRLICAT TESVNSALLR TFQRRIQVCI DLPGIRQRSV EEQLELIVSF FQRESRKIER TISLDKTLLH WLLSKPLEGN IGQLKSDIQF LCAQAWASGM TERNDTLELD KRLAEMSFNT TPEQRLLVDA LFGGKDRLNV DARTLPALKN SLATSAEIEE SDLFYSFLTR EYVNLRNSNV PPAETLAILK NKLSSIFEYG LYSRHSAAHP PRYGDQIEER VTLLIGYVEQ VLGFTLPENL ANPLRKHFLA LIGYVRRGLI SQLYSSSLIL DRCKDEYDNA TLLCRKINEL LHIQCPATEV VWLCLFLKEC RHYRQRIDTS PDCGVILIAH GATTATSQAQ YVNRVLERDL FYAIDMPFEQ SVHDTLETLT RMIQDRCWQR LILMVDIGSL VHFGSTISKL FQIDVLLLPN ITLTSLLEVG LDLSYETGDL SQLAVLMQSK NIPCRLCTPQ QESGGKVLVI SCITGMGTAE KIKKVLEESF GELMSQDTRM VILDYNEVRS LERIQQALNA SERLAGIVGT FQPGLPDIPF ISLEELFSEQ GPELVLSLLT PDLSSSERRL EMERSAMRFI SALTMESIIN HISVLNPQRI LKEIEDVLNY LTNTLSLKPS RQVTLRFLIH CCCMVERIVI NRKPLQMALE NRLDLDARAF SVIKSSFLPI EEAYAIRLSD AEYFYIYELL YS
|
| |