Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4287 |
Symbol | |
ID | 6147311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4389292 |
End bp | 4391040 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641619108 |
Product | putative frv operon regulatory protein |
Protein accession | YP_001746232 |
Protein GI | 170683057 |
COG category | [K] Transcription |
COG ID | [COG3711] Transcriptional antiterminator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.899072 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAAACG AACGCCAGTT AAAGATTGTC GATCTGCTGG AGCAACAGCC ACGCACGCCT GGCGAGCTGG CGCAACAGAC TGGCGTTTCC GGCAGGACCA TCCTGCGTGA TATTGACTAT CTCAACTTTA CCCTTAACGG CAAAGCCCGC ATTTCCGCCA GTGGCAACGC GGGCTATCAG CTGGAAATCT TCGAGCGCCG CAGCTTTTTT CAGTTGCTGC AAAAGCACGA TAACGACGAT CGGCTGCTGG CGCTGTTATT ACTGAATACC TTCACTCCCC GTGCGCAACT CGCCTCGGCG CTTAATTTGC CAGAAACGTG GGTAGCAGAG CGCCTGCCCC GCCTGAAACA GCGCTATGAA CGCGGTTTTT GTCTCGCTAG TCGCCCTGGT TTGGGCCATT TCATTGATGA GACAGAAGAG AAACGCATTA TTTTGCTGGC GAATCTTTTA CGCAAAGATC CGTTTTTAAT TCCGCTGTCG GGCGTAACAC GAGACAATTT TCAACAATTA ACCACCGCCT GCGAAAAGCA ACGTCGCTGG CCGCTGATGC AAGGCGACTA TCTCTCAAGC CTGATTCTGG CGATTTACGC CCTGCGTAAT CAACTGACCG ATGAGTGGCC GCAATATCCC GGTGACGAGA TAAAACAAAT CGTTGAACAG AGCGGCCTGT TTCTTGGTGA TAACGCTGTA AGAACCCTGA CAGGTTTGAT AGAGAAACAG CATCAGCAAG CGCAGGTAAT TTCTGCCGAT CATGTGCTGG GGTTGCTGCA AAGGGTGCCA GGCATCGCGT CATTGAATAT TATTGATACG CAGTTGGTTG AGAATATTAC CGGGCATTTA TTACGTTGCC TTGCCGCACC GGTGTGGATT GCCGAGCACC GCCAAAGCAG CATGAATAAC CTGAAAGCCG CCTGGCCTGC GGCCTTTGAT ATGAGTCTGC ACTTTATTAC GCTACTGCGT GAACAGCTCG ATATTCCCCT TTTCGACAGC GATCTGATCG GTTTGTATTT TGCCTGTGCG CTGGAACGGC ATCAAAACGA ACGCCAGCCG ATCATTTTGC TCTCGGACCA GAACGCGATT GCCACCATTA ATCAGCTCGC CATTGAGCGC GATGTTTTAC ATTGTCGGGT GATTATTGCC CGTAGCTTAA GCGAACTTGT TGCCATTCGC GAAGAGATTG AGCCGTTATT GATCATTAAC AACAGCCATT ATTTACTGGA TGACGCGGTA AATAATTACA TCACCGTAAA AAATATCATT ACGGCTGCCG GTATCGAACA AATAAAACAT TTTCTGGCGA CGGCATTTAT TCGCCAGCAA CCGGAGCGTT TTTTCTCTGC CCCCGGAAGT TTTCATTATT CGAATGTACG CGGTGAAAGC TGGCAACATA TTACCCGGCA AATTTGTGCT CAATTAGTCG CACAACACCA TATTACCGCC GATGAAGCGC AACGGATCAT CGCCCGCGAA GGCGAAGGTG AAAACCTGAT TGTTAATCGC CTCGCCATCC CACATTGCTG GAGCGAACAG GAGCGACGTT TTCGTGGATT TTTTATTACC CTCGCCCAAC CGGTTGAGGT GAATAACGAA GTAGTTAACC ATGTACTGAT CGCCTGTGCC GCCGCCGATG CGCGTCATGA GCTAAAAATA TTTAGCTATC TGGCAAGCGT ATTGTGCCAG CATCCGGCAG AGGTTATTGC CGGGTTAACA GGATATGAGG CATTTATGGA GTTACTTCAC AAGGGGTGA
|
Protein sequence | MLNERQLKIV DLLEQQPRTP GELAQQTGVS GRTILRDIDY LNFTLNGKAR ISASGNAGYQ LEIFERRSFF QLLQKHDNDD RLLALLLLNT FTPRAQLASA LNLPETWVAE RLPRLKQRYE RGFCLASRPG LGHFIDETEE KRIILLANLL RKDPFLIPLS GVTRDNFQQL TTACEKQRRW PLMQGDYLSS LILAIYALRN QLTDEWPQYP GDEIKQIVEQ SGLFLGDNAV RTLTGLIEKQ HQQAQVISAD HVLGLLQRVP GIASLNIIDT QLVENITGHL LRCLAAPVWI AEHRQSSMNN LKAAWPAAFD MSLHFITLLR EQLDIPLFDS DLIGLYFACA LERHQNERQP IILLSDQNAI ATINQLAIER DVLHCRVIIA RSLSELVAIR EEIEPLLIIN NSHYLLDDAV NNYITVKNII TAAGIEQIKH FLATAFIRQQ PERFFSAPGS FHYSNVRGES WQHITRQICA QLVAQHHITA DEAQRIIARE GEGENLIVNR LAIPHCWSEQ ERRFRGFFIT LAQPVEVNNE VVNHVLIACA AADARHELKI FSYLASVLCQ HPAEVIAGLT GYEAFMELLH KG
|
| |