Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5348 |
Symbol | |
ID | 6970481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4989856 |
End bp | 4991604 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643389004 |
Product | putative frv operon regulatory protein |
Protein accession | YP_002273413 |
Protein GI | 209397591 |
COG category | [K] Transcription |
COG ID | [COG3711] Transcriptional antiterminator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAAACG AACGCCAGTT AAAGATTGTC GATCTGCTGG AGCAACAGCC GCGCACGCCT GGCGAGCTGG CGCAACAGAC TGGCGTTTCA GGCAGGACCA TCCTGCGTGA TATTGACTAT CTCAACTTTA CCCTTAACGG CAAAGCTCGT ATCTCTGCCA GTGGCAGTGC GGGCTATCAG TTGGAAATCT TCGAGCGCCG CAGCTTTTTT CAGTTGCTGC AAAAGCACGA TAACGACGAT CGGCTGCTGG CGCTGTTATT ACTGAATACC TTTACTCCCC GTGCGCAACT CGCCTCGGCG CTTAATTTGC CAGAAACTTG GGTCGCAGAG CGTCTGCCCC GGTTAAAACA GCGTTATGAA CGCACTTGTT GCCTGGCCAG TCGTCCCGGT TTGGGCCATT TCATTGATGA AACAGAAGAG AAACGCGTTA TCTTGCTGGC GAACTTGCTA CGCAAAGATC CGTTTTTAAT ACCTTTGGCT GGCGTAACAC GAGACAACCT TCAGCATTTA TCCACGGCCT GCGACAACCA ACACCGCTGG CCGCTCATGC AGGGGGATTA TCTCTCCAGC CTGATTCTGG CGATTTACGC CCTGCGTAAT CAACTGACCG ATGAGTGGCC GCAATATCCC GGTGACGAGA TAAAACAAAT CGTTGAACAT AACGGTCTGT TTCTTGGTGA TAACGCTGTA AGAACCCTGA CAGGTTTGAT AGAGAAACAG CATCAGCAAG CGCAGGTAAT TTCTGCCGAT CATGTGCTGG GGTTGCTGCA AAGGGTGCCA GGTCTCGCGT CATTGAATAT TATTGATACG CAGCTGGTTG AGAACATTAC CGGTCATTTA TTACGTTGCC TTGCCGCACC AGTGTGGATT GCTGAGCATC GCCAAAGCAG CATGAATAAC CTGAAAGCCG CCTGGCCTGC GGCCTTTGAT ATGAGTCTGC ACTTTATTAC GCTACTGCGT GAACAGCTCG ATATTCCCCT TTTCGACAGC GATCTGATCG GTTTGTATTT TGCCTGTGCG CTGGAACGAC ATCAAAACGA ACGCCAGCCG ATCATTTTGC TCTCGGACCA GAACGCGATT GCCACTATTA ATCAGCTCGC CATTGAGCGC GATGTTTTAC ATTGTCGGGT GATTATTGCC CGTAGCTTAA GCGAACTTGT TGCCATTCGC GAAGAGATTG AGCCGTTATT GATCATTAAC AACAGCCATT ATTTACTGGA TGACGCGGTA AATAATTACA TCACCGTAAA AAATATCATT ACGGCTGCCG GTATCGAACA AATAAAACAT TTTCTGGCGA CGGCATTTAT TCGCCAGCAA CCGGAGCGTT TTTTCTCTGC CCCCGGAAGT TTTCATTATT CGAATGTACG CGGTGAAAGC TGGCAACATA TTACCCGGCA AATTTGTGCG CAATTAGTCG CACAACACCA TATTACCGCC GATGAAGCGC AACGGATCAT CGCCCGCGAA GGCGAAGGTG AAAACCTGAT TGTTAATCGC CTCGCCATCC CACATTGCTG GAGCGAACAG GAGCGACGTT TTCGTGGATT TTTTATTACC CTCGCCCAAC CAGTTGAGGT GAATAACGAA GTCATTAACC ATGTCTTGAT CGCCTGCGCC GCCGCCGATG CGCGTCATGA GCTAAAAATA TTTAGCTATC TGGCAAGCGT ATTGTGTCAG CATCCGGCAG AAGTTATTGC CGGGCTAACA GGATATGAGG CATTTATGGA GTTACTTCAC AAGGGGTGA
|
Protein sequence | MLNERQLKIV DLLEQQPRTP GELAQQTGVS GRTILRDIDY LNFTLNGKAR ISASGSAGYQ LEIFERRSFF QLLQKHDNDD RLLALLLLNT FTPRAQLASA LNLPETWVAE RLPRLKQRYE RTCCLASRPG LGHFIDETEE KRVILLANLL RKDPFLIPLA GVTRDNLQHL STACDNQHRW PLMQGDYLSS LILAIYALRN QLTDEWPQYP GDEIKQIVEH NGLFLGDNAV RTLTGLIEKQ HQQAQVISAD HVLGLLQRVP GLASLNIIDT QLVENITGHL LRCLAAPVWI AEHRQSSMNN LKAAWPAAFD MSLHFITLLR EQLDIPLFDS DLIGLYFACA LERHQNERQP IILLSDQNAI ATINQLAIER DVLHCRVIIA RSLSELVAIR EEIEPLLIIN NSHYLLDDAV NNYITVKNII TAAGIEQIKH FLATAFIRQQ PERFFSAPGS FHYSNVRGES WQHITRQICA QLVAQHHITA DEAQRIIARE GEGENLIVNR LAIPHCWSEQ ERRFRGFFIT LAQPVEVNNE VINHVLIACA AADARHELKI FSYLASVLCQ HPAEVIAGLT GYEAFMELLH KG
|
| |