Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0289 |
Symbol | |
ID | 5589443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 311933 |
End bp | 315067 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640924014 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_001461443 |
Protein GI | 157157518 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAGCG AAGACGATTT AGAACAGCAA AGCCTGCAAT GGTTTGCTGA ACTGGGCTGG GAAGTATTGC ACGGGCCGGA TATTGCGCCA GATGGCAACA ATCCGCTGCG TGCGTCGTTC CACGATGTGT TTTTGCGCCC GGTGCTACTG GAGCAGTTGC AAAAGATTAA CCCACATCTC CCGGTTGCCG TGCTGGATGA GGTGATACTG CGTATCGCCC ATGCGCAAAG CCCGGATCTG GTCGTCAGTA ACAAAGCCTT CCATCACCTG CTGCTCGACG GCGTGCCGGT GCAGTACAAG CAGGATGACA AAGTTATCCA CGACAAAGCG CTACTGATGG ATTTTAACCA CCCAAATAAT AACCACTTTA CGGTGGTGAA TCAGGTGGCC ATCCAGGGAA CGAAGCAGGT ACGTCGCCCG GACGTTATCT GCTATATCAA CGGATTGCCA GTAGTAGTGA TTGAGCTGAA AAGCCCGATT GATGCTAATG CCGATATCTG GGCGGCATTT AATCAGTTGC AGACCTATAA AAACGAACTC AGCGATCTGT TCATCTGCAA CGAAGCGCTG GTGGTCAGCG ATGGCTACAA CGCCCGTATT GGCTCTCTCA CCGCGAACGA AGAGCGCTTC TTGCCGTGGA AAACGCTGTC TAATGAAGAC GACAAACCGC TGTTTGAATG GCAGCTTGAA ACGGTAGTAA AAGGGTTCTT TAACCGCGAA CTGCTGCTCG ACTACATTCG TTACTTCATC CTGTTTGAAA GCGACGGCAA ACGACTGATT AAAAAGATTG CCGCTTACCA CCAATTCCAC GCAGTACGTG AAGCGGTGAC GGCGACGATT GTGGCCTCTA CCGGTAAACA CTTGCCGCTG CGCAGCAACA TCACGCCAGG CAGTAAAAAA GCCGGTGTGG TGTGGCATAC GCAGGGTTCC GGTAAGAGTA TCTCGATGTG TTGTTACGCC GGAAAACTGC TACAACAAGC GGAAATGAAC AACCCGACCA TCGTGGTGGT GACCGACCGT AACGATCTCG ACGGCCAACT GTATGCCACC TTCTGCCAGG CACAGGATTT GCTTAAGCAG GAACCGTTAC AGGCAAACGA TCGCGACCAG CTCCGCGAGA TGCTCAATGT CCGTGAATCA GGCGGGATTA TTTTTACCAC CGTACAAAAA TTTGCCCCGC TTGATGGCGA ACAGACTCAC CCGGCGCTAA ACCTGCGCAG CAATATCGTC GTCATTTCCG ATGAAGCGCA CCGCAGCCAG TATGGTCTTA GCGCCACGCT GAACCGGGAG ACTGGCGCTT ATAAATACGG TTACGCCAAA CATATGCGCG ATGCGTTACC CAATGCGTCG TTTATGGGCT TTACCGGAAC ACCGGTTTCT TCCGAAGATA AAGACACCCG CGCGGTGTTT GGTGATTACG TCTCTATCTA CGATATACAG GATGCGGTGG AAGATGGCGC AACCGTGCCT ATCTACTATG AATCGCGCCT GGCAAAACTC GACCTCAACC ACGAAGAGCT GGAAACGCTG TCTAATCAGG TGGATGAGCT GGTCGAAGAT GAAGAGACCG ATCAGAAAGA GAAAACCAAA AGTGACTGGA GCCGTCTGGA AAAACTGGTT GGTTCTGAAC CTCGTATCAA TGAGGTGGCC GCCGATCTGG TTCAGCATTT TGAGGCACGT AACGCCACAA TGAATGGCAA AGCAATGATT GTTGCCATGA GCCGTGAGAT CTGCGTGAAG CTGTATGATG CGCTGGTGGC TCTACGCCCG GAATGGCACA GTGATGACGT CGAGAAAGGT GAAATCAAAA TCATCATGAC CGGCTCAGCC TCCGATAATA AATTCCTTCA GCCGCACATC TACAATAAGC AAACCAAAAA ACGCCTTGAA GCGCGCTTTA AAGATCTCAA CGACCCGTTG AAACTGGTGA TTGTGCGCGA TATGTGGCTT ACCGGGTTTG ATGCGCCATG TTGTCATACC ATGTATATCG ACAAACCGAT GCGCGGGCAT AATCTGATGC AGGCCATTGC GCGCGTCAAC CGCGTCTTCA AAGATAAACC GGGCGGTTTA GTGGTGGATT ACATCGGTAT CGCCAATGAA CTGAAACAGG CGCTGAAAAC TTATACCGAT TCAAAAGGTA AAGGACAGAC GACAGTCGAT GCTCATGAAG CGTTCTCCAT CCTGCTTGAA AAGCTGGATG TGATTCATGG AATGTTTGCC AAAACACCAA CCGCCGCCGG GTTTGATTAC ACCGGTTTTG CAGAGGTGCC CCAACGGTTT TTACTCAAAG CCGCAGATTA TGTGCTGGGC CTTGATGACG GTAAGAAGCG CTTTTTCGAT GTCGTGCTGG CGATGAACAA AGCCTGGTCG CTGTGCAGTA CGTTAGATGA AGCTAAACCC TTGCAAAAAG AGATCGCGTT TTTGTCGGCG GTGAAAGTGG CGATTATCAA GCTGACGACA ACCGACAAAA AATTCAGTCA GTCAGAGAAA AATACGCTAC TCGGTAAAAT CCTCGATAAC GCCATTATTG CGACGGGCGT GGATGATGTG TTTGCGCTGG CGGGTCTGGA TAAGCCGAAT ATTGGATTGT TGTCAGACGA GTTTCTGGAA GAAGTGCGCG AATTGCCGCA GCGTAATCTG GCAGTCGAGT TGCTGGAGAA ACTGCTGAAC GACGGCATTC ATGCCCGCAC CAAAAACAAC GTGGTGCAGG AGAAGAAATA CTCAGATCGC CTGAAAGCCG TGCTGCTCAA ATACAATAAC CGCGCCATTG AAACTGCGCA GGTGATTGAA GAACTGATCC AGATGGCAAA AGAGTTTCAG GAAGCGATGG CGCGTGATGA AGCGCTGGGG CTAAACCCGG ACGAAATCGC GTTCTACGAT GCGCTGGCAG AAAACGAAAG TGCGGTACGG GAGCTGGGTG ATGACGTCCT TAAGAAACTG GCTATCGAAG TCACGTTAAA ATTGCGCCAG TCCACAACCG TAGACTGGCA GGTGCGAGAA AGCGTGCGTG CGCGGTTACG TATTCTGGTG CGTCAGACGC TGCGTAAGTA CAAGTATCCG CCAGATAAAA CACCTTATGC AGTTGAACTG ATACTGAAGC AGGCTGAAGT GGTGTCGAAC AGCTGGACGG TATAG
|
Protein sequence | MLSEDDLEQQ SLQWFAELGW EVLHGPDIAP DGNNPLRASF HDVFLRPVLL EQLQKINPHL PVAVLDEVIL RIAHAQSPDL VVSNKAFHHL LLDGVPVQYK QDDKVIHDKA LLMDFNHPNN NHFTVVNQVA IQGTKQVRRP DVICYINGLP VVVIELKSPI DANADIWAAF NQLQTYKNEL SDLFICNEAL VVSDGYNARI GSLTANEERF LPWKTLSNED DKPLFEWQLE TVVKGFFNRE LLLDYIRYFI LFESDGKRLI KKIAAYHQFH AVREAVTATI VASTGKHLPL RSNITPGSKK AGVVWHTQGS GKSISMCCYA GKLLQQAEMN NPTIVVVTDR NDLDGQLYAT FCQAQDLLKQ EPLQANDRDQ LREMLNVRES GGIIFTTVQK FAPLDGEQTH PALNLRSNIV VISDEAHRSQ YGLSATLNRE TGAYKYGYAK HMRDALPNAS FMGFTGTPVS SEDKDTRAVF GDYVSIYDIQ DAVEDGATVP IYYESRLAKL DLNHEELETL SNQVDELVED EETDQKEKTK SDWSRLEKLV GSEPRINEVA ADLVQHFEAR NATMNGKAMI VAMSREICVK LYDALVALRP EWHSDDVEKG EIKIIMTGSA SDNKFLQPHI YNKQTKKRLE ARFKDLNDPL KLVIVRDMWL TGFDAPCCHT MYIDKPMRGH NLMQAIARVN RVFKDKPGGL VVDYIGIANE LKQALKTYTD SKGKGQTTVD AHEAFSILLE KLDVIHGMFA KTPTAAGFDY TGFAEVPQRF LLKAADYVLG LDDGKKRFFD VVLAMNKAWS LCSTLDEAKP LQKEIAFLSA VKVAIIKLTT TDKKFSQSEK NTLLGKILDN AIIATGVDDV FALAGLDKPN IGLLSDEFLE EVRELPQRNL AVELLEKLLN DGIHARTKNN VVQEKKYSDR LKAVLLKYNN RAIETAQVIE ELIQMAKEFQ EAMARDEALG LNPDEIAFYD ALAENESAVR ELGDDVLKKL AIEVTLKLRQ STTVDWQVRE SVRARLRILV RQTLRKYKYP PDKTPYAVEL ILKQAEVVSN SWTV
|
| |