Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4903 |
Symbol | |
ID | 5586601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4887680 |
End bp | 4890571 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640928504 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001465831 |
Protein GI | 157158933 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.891445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGGTA CAACACCTTA TCATGCGAAA TATTTTGCGC ATGAACTTTC CATTCTCCAT TCAAACAATG GGGTGGACCG TCTGTCTCAA TCGCTGTTTG ATGCAAGCGT AGATCTAAAT CCTCATCAAA TTGAGGCGGC TCTTTTTGCT ATTGAAAATC CGCTAAGCAA AGGTGTTGTG CTTGCCGATG AAGTTGGTTT AGGTAAAACC ATTGAAGCGG GGTTAGTGCT TTGCCAGTTG TGGGCTGAAA GAAAACGTAA ACTGTTGATC ATTTGTCCTG CTTCACTGCG TCGCCAATGG GCGAGCGAGT TACAGGAAAA GTTCAATCTA CCTTCTCAGG TGTTGGACGC GAAAACGTAT AGCCAGCTTC AAAAAGAAGG GATCCATAAT CCCTTAAATA ATAAATCGAT CACTATCATG TCATACCACT ATGCTGCTCG GTTAGAAGAA AAATTGGTGG CTGAAATGTG GGATCTGGTT GTTATCGACG AAGCCCATAA ACTGCGTAAT GCTCATCGAG AAAGTAACAA AATGGGGCAA GCGTTAAAGC GTGCGCTTGA TGGGCGGAAA AAATTACTAC TGACCGCAAC GCCTTTACAG AATTCGTTAA TGGAACTGTA TGGCATGTCG ACGCTTATAG ATGAACATAC CTTCGGCGAA GTTAAAGCGT TCAGAAAGCA ATATATGCAG GCAGATAGCG ATATTGCGGA GTTGAAGGGG CGATTGTCGC GTTTTATCAA ACGAACTTTA CGCAAGAATG TGCTGGAGTA CATCAAGTAT ACAGAACGTA AAGCGATCAC TATTCCTTTC TATCCGAGCG AACAGGAGCA GGATCTTTAT GAACGCGTAC AAAAATTGCT TGAACGTGAG GATAGCTATG CCTTACCAAA ACGACACCGG CATCTGACGG GGCTTATTCT CAGAAAGCTA CTTTCCTCAA GTACGAAAGC TGTATTGAAT AACTTGCAGA TCCTTAAGAG TCGTTTAGAA AGGTTAAAGC TGGAAGGTAT CGTTGAAGAT GATATGAACA TCATCCAACA GATCATAATG GATGACGATC TTGAAGATGA CATCGTGGAA GATGCCGAAT CAGTGGCTTT AGATACCGAG TGCAAAGTTG TCGATTCAGA CGCGCTTCAG GCAGAAATCA ATGAGCTGGA ATCACTCATT GTCAAAGCAG AACAGATTGG CACTGATACC AAATCAAAAG AGTTGCTTTC TGGCCTGGAG CAAGGTTTTG CACAGCTTGC CGAAATGGGG GCTGCAAAAA AAGTCATCAT CTTCACGGAG TCAATGCGAA CTCAACAGTA TCTTGCACAT TTTCTGGAAA ACAATGGCTA CCAGGGGAAA GTAGTAACCT TTAGTGGCAC AAACAATACA CCGCAGGCAA ACAAGATCTA CCAGCAATGG CGCGAGGAAT ATCAGGGATC TTCTCGTATT ACCGGGTCGG CGCAAATTGA TAAACGCTCG GCTTTGATCG ACCACTTCAA AGATCATGCT GAAATCATGA TAGCTACAGA AGCTGCAGCT GAAGGGGTGA ACCTGCAATT TTGTTCGCTA CTGATCAATT ACGATTTACC CTGGAACCCA CAACGCATTG AACAGCGTAT TGGACGTTGT CACCGATACG GGCAAAAATT TGATGTGGTT GTTATTAACT TCCTCAATCA GCGAAACCAG GCCGATCAAC GCGTTCTTGA ACTCTTAACT GAAAAGTTCA GCCTGTTTGA TGGCGTGTTT GGCGCATCGG ATAGTGTGCT TGGCAGCATA GAAAATGGTG TGGATTTTGA AAAAAGGATC CAGTCAATTT ATGAGTCATG CCGTACCCCC GAGGAGATAG AGCTGGCTTT TGAAAGGTTG CAGAAAGAGC TGGAAGATGA AATCAATCTC AAGATGCAGA AGACGCAAGA ACAGTTGCTT GAGCATTTTG ATGAAGATAT TCACGATATC CTTAAAATAA GGCTTGATGA AGCTCGGGAG CGTCTGGATA AAGTCGGTCG TTGGTTTTGG GCCGTGACCT GTCAGCAGCT CAAACAATAT GCTGATTTTG ATCATGAGCA TCATCGTTTC ACTTTACGTG AATCAGTAAA TGAACTGCCT GTCGGGCAGT ACCAGCTGTT GCGCAAAGAT GGTAGGCAAA ATAGTGCGTT ACAAATGCAG GATCACCACA GTTTTGCATA TCGGATCAAT CATCCGCTGG GTGAGTACGT CATTGAGGGG GCTAAGGTCC AGGAGACGAA AACAGCTGAA GTCATCTTCG ATTACAGTAA CCACACAACG AAAGTCTCGG TGGCTGAGCA GCTGAAAGGG CAATCAGGGT GGTTAACACT AAATTTGCTT ACTGTTGATG CGTTTAACAG GCAGCAATTT TTAGTCTTTA CCGCAAAAAC TGACAACGGA CGTGTGGTAG ATGGTGAAGC TTGTAAACGT CTGTTCAACT TGTCTGCAAT AACCAACGAA AAGCTGGATA ATTCGGCAAG TTCAGATGCA TTACCTGATG ACTTGATCTT ACTCAAGAAC AGGCAAATTG ACGCCAGACT TGCGGAAGTT CTGGAGCAGA ACAATGTCCT TTTTGAAGCT GAAAGAGATA AGCTGGAAAA ATGGGCTGAA GATATGATCT TTGCGGCGGA AGAAGCATTA CGCGATACCA AAATGCAAAT CAAGTCGTTA AAGCGAGAAG CCAGACTGGC GCAATCGATG GAAGAACAAA AGCAAAACCA GGAAAAGCTG AAGCTACTGG AACGGCAGCA GAAACGCCAA CGTATGGAAA TTTTTGACAT TGAAGATGAG ATCGCTGACA AGCGGGATGA ACTCATCAGT GCCCTTGAAG AGCGGATGAA GCAAAAAACG GATATCACCG AGCTTTTTTC CATCCGCTGG AAAGTTATTT AA
|
Protein sequence | MIGTTPYHAK YFAHELSILH SNNGVDRLSQ SLFDASVDLN PHQIEAALFA IENPLSKGVV LADEVGLGKT IEAGLVLCQL WAERKRKLLI ICPASLRRQW ASELQEKFNL PSQVLDAKTY SQLQKEGIHN PLNNKSITIM SYHYAARLEE KLVAEMWDLV VIDEAHKLRN AHRESNKMGQ ALKRALDGRK KLLLTATPLQ NSLMELYGMS TLIDEHTFGE VKAFRKQYMQ ADSDIAELKG RLSRFIKRTL RKNVLEYIKY TERKAITIPF YPSEQEQDLY ERVQKLLERE DSYALPKRHR HLTGLILRKL LSSSTKAVLN NLQILKSRLE RLKLEGIVED DMNIIQQIIM DDDLEDDIVE DAESVALDTE CKVVDSDALQ AEINELESLI VKAEQIGTDT KSKELLSGLE QGFAQLAEMG AAKKVIIFTE SMRTQQYLAH FLENNGYQGK VVTFSGTNNT PQANKIYQQW REEYQGSSRI TGSAQIDKRS ALIDHFKDHA EIMIATEAAA EGVNLQFCSL LINYDLPWNP QRIEQRIGRC HRYGQKFDVV VINFLNQRNQ ADQRVLELLT EKFSLFDGVF GASDSVLGSI ENGVDFEKRI QSIYESCRTP EEIELAFERL QKELEDEINL KMQKTQEQLL EHFDEDIHDI LKIRLDEARE RLDKVGRWFW AVTCQQLKQY ADFDHEHHRF TLRESVNELP VGQYQLLRKD GRQNSALQMQ DHHSFAYRIN HPLGEYVIEG AKVQETKTAE VIFDYSNHTT KVSVAEQLKG QSGWLTLNLL TVDAFNRQQF LVFTAKTDNG RVVDGEACKR LFNLSAITNE KLDNSASSDA LPDDLILLKN RQIDARLAEV LEQNNVLFEA ERDKLEKWAE DMIFAAEEAL RDTKMQIKSL KREARLAQSM EEQKQNQEKL KLLERQQKRQ RMEIFDIEDE IADKRDELIS ALEERMKQKT DITELFSIRW KVI
|
| |