Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4125 |
Symbol | |
ID | 5593412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4116294 |
End bp | 4118048 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640923228 |
Product | putative frv operon regulatory protein |
Protein accession | YP_001460687 |
Protein GI | 157163369 |
COG category | [K] Transcription |
COG ID | [COG3711] Transcriptional antiterminator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAACG AACGCCAGTT AAAGATTGTC GATCTGCTGG AGCAACAGCC GCGCACGCCT GGCGAGCTGG CGCAACAGAC TGGCGTTTCA GGCAGGACCA TCCTGCGTGA TATTGACTAT CTCAACTTCA CCCTTAACGG CAAAGCCCGC ATTTTCGCCA GTGGCAGTGC GGGCTATCAG CTGGAAATCT TCGAGCGCCG CAGCTTTTTT CAGTTGCTGC AAAAGCACGA TAACGACGAT CGGCTGCTGG CGCTGTTATT ACTGAATACT TTCACTCCCC GTGCGCAACT CGCCTCGGCG CTTAATTTGC CAGAAACGTG GGTAGCAGAG CGTCTGCCCC GGTTAAAACA GCGTTATGAA CGCACTTGTT GCCTGGCCAG CCGCCCTGGT TTGGGCCATT TCATTGATGA GACAGAAGAG AAACGCGTTA TCTTGCTGGC GAACTTGCTG CGCAAAGATC CGTTTTTAAT TCCGCTGGCG GGCATAACAC GAGACAACCT TCAGCATTTA TCCACGGCCT GCGACAACCA ACACCGCTGG CCGCTCATGC AGGGTGATTA TCTCTCCAGC CTGATTCTGG CGATTTACGC CCTGCGTAAT CAACTGACCG ATGAGTGGCC GCAATATCCC GGTGACGAGA TAAAACAAAT CGTTGAACAA AGCGGCATGT TTCTTGGTGA TAACGCTGTA AGAACCCTGA CAGGTTTGAT AGAGAAACAG CATCAGCAAG CGCAGATAAT TTCTGCCGAT AATGTGCAGC GGTTGCTGCA AAGGGTGCCG GGCATCGCGT CATTGAATAT TATTGATACG CGGCTGGTTG AGAACATTAC CGGGCATTTA TTACGTTGCC TTGCCGCACC AGTGTGGATT GCTGAGCACC GCCAAAGCAG CATGAATAAC CTGAAAGCAG CCTGGCCTGC GGCCTTTGAT ATGAGTCTGC ACTTTATTAC GCTACTGCGT GAACAGCTCG ATATTCCCCT TTTCGACAGC GATCTGCTCG GTTTGTATTT TGCCTGTGCG CTGGAGCGGC ATCAAAACGA ACGCCAGCCG ATCATTTTGC TCTCGGACCA GAACGCGATT GCCACCATTA ATCAGCTCGC CATTGAGCGC GATGTTTTAA ATTGTCGGGT GATTATTGCC CGTAGCTTAA GCGAGCTTGT TGCCATTCGC GAAGAGATTG AGCCGTTATT GATCATTAAC AACAGCCATT ATTTACTGGA TGACGCGGTA AATAATTACA TCACCGTAAA AAATATCATT ACGGCTGCCG GTATCGAACA AATAAAACAT TTTCTGGCGA CGGCATTTAT TCGCCAGCAA CCGGAGCGTT TTTTCTCTGC CCCCGGAAGT TTTCATTATT CGAATGTACG CGGTGAAAGC TGGCAACATA TTACCCGGCA AATTTGTGCG CAATTAGTCG CACAACACCA TATTACCGCC GATGAAGCAC AACGCATCAT CGCCCGCGAA GGCGAAGGTG AAGGTGAAAA CCTGATTGTT AATCGCCTCG CCATCCCACA TTGCTGGAGC GAACAGGAGC GACGTTTTCG TGGATTTTTT ATTACCCTCG CCCAACCAGT TGAGGTGAAT AACGAAGTCA TTAACCATGT CTTGATCGCC TGCGCCGCCG CCGATGCGCG TCATGAGCTA AAAATATTTA GCTATCTGGC AAGCATATTG TGTCAGCATC CGGCGGAGAT TATTGCCGGG TTAACAGGAT ATGAGGCATT TATGGAGTTA CTTCACAAGG GGTGA
|
Protein sequence | MLNERQLKIV DLLEQQPRTP GELAQQTGVS GRTILRDIDY LNFTLNGKAR IFASGSAGYQ LEIFERRSFF QLLQKHDNDD RLLALLLLNT FTPRAQLASA LNLPETWVAE RLPRLKQRYE RTCCLASRPG LGHFIDETEE KRVILLANLL RKDPFLIPLA GITRDNLQHL STACDNQHRW PLMQGDYLSS LILAIYALRN QLTDEWPQYP GDEIKQIVEQ SGMFLGDNAV RTLTGLIEKQ HQQAQIISAD NVQRLLQRVP GIASLNIIDT RLVENITGHL LRCLAAPVWI AEHRQSSMNN LKAAWPAAFD MSLHFITLLR EQLDIPLFDS DLLGLYFACA LERHQNERQP IILLSDQNAI ATINQLAIER DVLNCRVIIA RSLSELVAIR EEIEPLLIIN NSHYLLDDAV NNYITVKNII TAAGIEQIKH FLATAFIRQQ PERFFSAPGS FHYSNVRGES WQHITRQICA QLVAQHHITA DEAQRIIARE GEGEGENLIV NRLAIPHCWS EQERRFRGFF ITLAQPVEVN NEVINHVLIA CAAADARHEL KIFSYLASIL CQHPAEIIAG LTGYEAFMEL LHKG
|
| |