Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4120 |
Symbol | |
ID | 6066004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4545445 |
End bp | 4547193 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641603542 |
Product | putative frv operon regulatory protein |
Protein accession | YP_001727045 |
Protein GI | 170022091 |
COG category | [K] Transcription |
COG ID | [COG3711] Transcriptional antiterminator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAAACG AACGCCAGTT AAAGATTGTC GATCTGCTGG AGCAACAGCC GCGCACGCCT GGCGAGCTGG CGCAACAGAC TGGCGTTTCA GGCAGGACCA TCCTGCGTGA TATTGACTAT CTCAACTTCA CCCTTAACGG CAAAGCCCGC ATTTTCGCCA GTGGCAGTGC GGGCTATCAG CTGGAAATCT TCGAGCGCCG CAGCTTTTTT CAGTTGCTGC AAAAGCACGA TAACGACGAT CGGCTGCTGG CGCTGTTATT ACTGAATACT TTCACTCCCC GTGCGCAACT CGCCTCGGCG CTTAATTTGC CAGAAACGTG GGTAGCAGAG CGTCTGCCCC GGTTAAAACA GCGTTATGAA CGCACTTGTT GCCTGGCCAG CCGCCCTGGT TTGGGCCATT TCATTGATGA GACAGAAGAG AAACGCGTTA TCTTGCTGGC GAACTTGCTG CGCAAAGATC CGTTTTTAAT TCCGCTGGCG GGCATAACAC GAGACAACCT TCAGCATTTA TCCACGGCCT GCGACAACCA ACACCGCTGG CCGCTCATGC AGGGTGATTA TCTCTCCAGC CTGATTCTGG CGATTTACGC CCTGCGTAAT CAACTGACCG ATGAGTGGCC GCAATATCCC GGTGACGAGA TAAAACAAAT CGTTGAACAT AGCGGTCTGT TTCTTGGTGA TAACGCTGTA AGAACCCTGA CGGGTTTGAT AGAGAAACAG CATCAGCAAG CGCAGGTAAT TTCAGCCGAT AATGTGCAGG GGTTGCTGCA AAGGGTGCCG GGCATCGCGT CATTGAATAT TATTGATGCG CAGCTGGTTG AGAATATTAC CGGGCATTTA TTACGTTGCC TTGCCGCACC AGTGTGGATT GCTGAGCACC GCCAGAGCAG CATGAATAAC CTGAAAGCCG CCTGGCCTGC GGCCTTTGAT ATGAGTCTGC ACTTTATTAC GCTACTGCGT GAACAGCTCG ATATTCCTCT TTTCGACAGC GATCTGATCG GTTTGTATTT TGCCTGTGCG CTGGAACGGC ATCAAAACGA ACGCCAGCCG ATCATTTTGC TCTCGGACCA GAACGCGATT GCCACCATTA ATCAGCTCGC CATTGAGCGC GATGTTTTAC ATTGTCGGGT GATTATTGCC CGTAGCTTAA GCGAACTTGT TGCCATTCGC GAAGAGATTG AGCCGTTATT GATCATTAAC AACAGCCATT ATTTACTGGA TGACGCGGTC AATAATTACA TCACCGTAAA AAATATCATT ACGGCTGCCG GTATCGAACA AATAAAACAT TTTCTGGCGA CGGCATTTAT TCGCCAGCAA CCGGAGCGTT TTTTCTCTGC CCCCGGAAGT TTTCATTATT CGAATGTACG CGGTGAAAGC TGGCAACATA TTACCCGGCA AATTTGTGCG CAATTAGTGG CACAACACCA TATTACCGCC GATGAAGCGC AACGGATCAT CGCCCGCGAA GGCGAAGGTG AAAACCTGAT TGTTAATCGC CTCGCCATCC CACATTGCTG GAGCGAACAG GAGCGACGTT TTCGTGGATT TTTTATTACC CTCGCCCAAC CAGTTGAGGT GAATAACGAA GTCATTAACC ATGTCTTGAT CGCCTGCGCC GCCGCCGATG CGCGTCACGA GTTGAAAATA TTCAGCTATC TGGCAAGCGT ATTGTGTCAG CATCCGGCAG AGGTTATTGC CGGGTTAACA GGATATGAGG CATTTATGGA GTTACTTCAC AAGGGGTGA
|
Protein sequence | MLNERQLKIV DLLEQQPRTP GELAQQTGVS GRTILRDIDY LNFTLNGKAR IFASGSAGYQ LEIFERRSFF QLLQKHDNDD RLLALLLLNT FTPRAQLASA LNLPETWVAE RLPRLKQRYE RTCCLASRPG LGHFIDETEE KRVILLANLL RKDPFLIPLA GITRDNLQHL STACDNQHRW PLMQGDYLSS LILAIYALRN QLTDEWPQYP GDEIKQIVEH SGLFLGDNAV RTLTGLIEKQ HQQAQVISAD NVQGLLQRVP GIASLNIIDA QLVENITGHL LRCLAAPVWI AEHRQSSMNN LKAAWPAAFD MSLHFITLLR EQLDIPLFDS DLIGLYFACA LERHQNERQP IILLSDQNAI ATINQLAIER DVLHCRVIIA RSLSELVAIR EEIEPLLIIN NSHYLLDDAV NNYITVKNII TAAGIEQIKH FLATAFIRQQ PERFFSAPGS FHYSNVRGES WQHITRQICA QLVAQHHITA DEAQRIIARE GEGENLIVNR LAIPHCWSEQ ERRFRGFFIT LAQPVEVNNE VINHVLIACA AADARHELKI FSYLASVLCQ HPAEVIAGLT GYEAFMELLH KG
|
| |