Gene EcE24377A_0289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0289 
Symbol 
ID5589443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp311933 
End bp315067 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content51% 
IMG OID640924014 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001461443 
Protein GI157157518 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAGCG AAGACGATTT AGAACAGCAA AGCCTGCAAT GGTTTGCTGA ACTGGGCTGG 
GAAGTATTGC ACGGGCCGGA TATTGCGCCA GATGGCAACA ATCCGCTGCG TGCGTCGTTC
CACGATGTGT TTTTGCGCCC GGTGCTACTG GAGCAGTTGC AAAAGATTAA CCCACATCTC
CCGGTTGCCG TGCTGGATGA GGTGATACTG CGTATCGCCC ATGCGCAAAG CCCGGATCTG
GTCGTCAGTA ACAAAGCCTT CCATCACCTG CTGCTCGACG GCGTGCCGGT GCAGTACAAG
CAGGATGACA AAGTTATCCA CGACAAAGCG CTACTGATGG ATTTTAACCA CCCAAATAAT
AACCACTTTA CGGTGGTGAA TCAGGTGGCC ATCCAGGGAA CGAAGCAGGT ACGTCGCCCG
GACGTTATCT GCTATATCAA CGGATTGCCA GTAGTAGTGA TTGAGCTGAA AAGCCCGATT
GATGCTAATG CCGATATCTG GGCGGCATTT AATCAGTTGC AGACCTATAA AAACGAACTC
AGCGATCTGT TCATCTGCAA CGAAGCGCTG GTGGTCAGCG ATGGCTACAA CGCCCGTATT
GGCTCTCTCA CCGCGAACGA AGAGCGCTTC TTGCCGTGGA AAACGCTGTC TAATGAAGAC
GACAAACCGC TGTTTGAATG GCAGCTTGAA ACGGTAGTAA AAGGGTTCTT TAACCGCGAA
CTGCTGCTCG ACTACATTCG TTACTTCATC CTGTTTGAAA GCGACGGCAA ACGACTGATT
AAAAAGATTG CCGCTTACCA CCAATTCCAC GCAGTACGTG AAGCGGTGAC GGCGACGATT
GTGGCCTCTA CCGGTAAACA CTTGCCGCTG CGCAGCAACA TCACGCCAGG CAGTAAAAAA
GCCGGTGTGG TGTGGCATAC GCAGGGTTCC GGTAAGAGTA TCTCGATGTG TTGTTACGCC
GGAAAACTGC TACAACAAGC GGAAATGAAC AACCCGACCA TCGTGGTGGT GACCGACCGT
AACGATCTCG ACGGCCAACT GTATGCCACC TTCTGCCAGG CACAGGATTT GCTTAAGCAG
GAACCGTTAC AGGCAAACGA TCGCGACCAG CTCCGCGAGA TGCTCAATGT CCGTGAATCA
GGCGGGATTA TTTTTACCAC CGTACAAAAA TTTGCCCCGC TTGATGGCGA ACAGACTCAC
CCGGCGCTAA ACCTGCGCAG CAATATCGTC GTCATTTCCG ATGAAGCGCA CCGCAGCCAG
TATGGTCTTA GCGCCACGCT GAACCGGGAG ACTGGCGCTT ATAAATACGG TTACGCCAAA
CATATGCGCG ATGCGTTACC CAATGCGTCG TTTATGGGCT TTACCGGAAC ACCGGTTTCT
TCCGAAGATA AAGACACCCG CGCGGTGTTT GGTGATTACG TCTCTATCTA CGATATACAG
GATGCGGTGG AAGATGGCGC AACCGTGCCT ATCTACTATG AATCGCGCCT GGCAAAACTC
GACCTCAACC ACGAAGAGCT GGAAACGCTG TCTAATCAGG TGGATGAGCT GGTCGAAGAT
GAAGAGACCG ATCAGAAAGA GAAAACCAAA AGTGACTGGA GCCGTCTGGA AAAACTGGTT
GGTTCTGAAC CTCGTATCAA TGAGGTGGCC GCCGATCTGG TTCAGCATTT TGAGGCACGT
AACGCCACAA TGAATGGCAA AGCAATGATT GTTGCCATGA GCCGTGAGAT CTGCGTGAAG
CTGTATGATG CGCTGGTGGC TCTACGCCCG GAATGGCACA GTGATGACGT CGAGAAAGGT
GAAATCAAAA TCATCATGAC CGGCTCAGCC TCCGATAATA AATTCCTTCA GCCGCACATC
TACAATAAGC AAACCAAAAA ACGCCTTGAA GCGCGCTTTA AAGATCTCAA CGACCCGTTG
AAACTGGTGA TTGTGCGCGA TATGTGGCTT ACCGGGTTTG ATGCGCCATG TTGTCATACC
ATGTATATCG ACAAACCGAT GCGCGGGCAT AATCTGATGC AGGCCATTGC GCGCGTCAAC
CGCGTCTTCA AAGATAAACC GGGCGGTTTA GTGGTGGATT ACATCGGTAT CGCCAATGAA
CTGAAACAGG CGCTGAAAAC TTATACCGAT TCAAAAGGTA AAGGACAGAC GACAGTCGAT
GCTCATGAAG CGTTCTCCAT CCTGCTTGAA AAGCTGGATG TGATTCATGG AATGTTTGCC
AAAACACCAA CCGCCGCCGG GTTTGATTAC ACCGGTTTTG CAGAGGTGCC CCAACGGTTT
TTACTCAAAG CCGCAGATTA TGTGCTGGGC CTTGATGACG GTAAGAAGCG CTTTTTCGAT
GTCGTGCTGG CGATGAACAA AGCCTGGTCG CTGTGCAGTA CGTTAGATGA AGCTAAACCC
TTGCAAAAAG AGATCGCGTT TTTGTCGGCG GTGAAAGTGG CGATTATCAA GCTGACGACA
ACCGACAAAA AATTCAGTCA GTCAGAGAAA AATACGCTAC TCGGTAAAAT CCTCGATAAC
GCCATTATTG CGACGGGCGT GGATGATGTG TTTGCGCTGG CGGGTCTGGA TAAGCCGAAT
ATTGGATTGT TGTCAGACGA GTTTCTGGAA GAAGTGCGCG AATTGCCGCA GCGTAATCTG
GCAGTCGAGT TGCTGGAGAA ACTGCTGAAC GACGGCATTC ATGCCCGCAC CAAAAACAAC
GTGGTGCAGG AGAAGAAATA CTCAGATCGC CTGAAAGCCG TGCTGCTCAA ATACAATAAC
CGCGCCATTG AAACTGCGCA GGTGATTGAA GAACTGATCC AGATGGCAAA AGAGTTTCAG
GAAGCGATGG CGCGTGATGA AGCGCTGGGG CTAAACCCGG ACGAAATCGC GTTCTACGAT
GCGCTGGCAG AAAACGAAAG TGCGGTACGG GAGCTGGGTG ATGACGTCCT TAAGAAACTG
GCTATCGAAG TCACGTTAAA ATTGCGCCAG TCCACAACCG TAGACTGGCA GGTGCGAGAA
AGCGTGCGTG CGCGGTTACG TATTCTGGTG CGTCAGACGC TGCGTAAGTA CAAGTATCCG
CCAGATAAAA CACCTTATGC AGTTGAACTG ATACTGAAGC AGGCTGAAGT GGTGTCGAAC
AGCTGGACGG TATAG
 
Protein sequence
MLSEDDLEQQ SLQWFAELGW EVLHGPDIAP DGNNPLRASF HDVFLRPVLL EQLQKINPHL 
PVAVLDEVIL RIAHAQSPDL VVSNKAFHHL LLDGVPVQYK QDDKVIHDKA LLMDFNHPNN
NHFTVVNQVA IQGTKQVRRP DVICYINGLP VVVIELKSPI DANADIWAAF NQLQTYKNEL
SDLFICNEAL VVSDGYNARI GSLTANEERF LPWKTLSNED DKPLFEWQLE TVVKGFFNRE
LLLDYIRYFI LFESDGKRLI KKIAAYHQFH AVREAVTATI VASTGKHLPL RSNITPGSKK
AGVVWHTQGS GKSISMCCYA GKLLQQAEMN NPTIVVVTDR NDLDGQLYAT FCQAQDLLKQ
EPLQANDRDQ LREMLNVRES GGIIFTTVQK FAPLDGEQTH PALNLRSNIV VISDEAHRSQ
YGLSATLNRE TGAYKYGYAK HMRDALPNAS FMGFTGTPVS SEDKDTRAVF GDYVSIYDIQ
DAVEDGATVP IYYESRLAKL DLNHEELETL SNQVDELVED EETDQKEKTK SDWSRLEKLV
GSEPRINEVA ADLVQHFEAR NATMNGKAMI VAMSREICVK LYDALVALRP EWHSDDVEKG
EIKIIMTGSA SDNKFLQPHI YNKQTKKRLE ARFKDLNDPL KLVIVRDMWL TGFDAPCCHT
MYIDKPMRGH NLMQAIARVN RVFKDKPGGL VVDYIGIANE LKQALKTYTD SKGKGQTTVD
AHEAFSILLE KLDVIHGMFA KTPTAAGFDY TGFAEVPQRF LLKAADYVLG LDDGKKRFFD
VVLAMNKAWS LCSTLDEAKP LQKEIAFLSA VKVAIIKLTT TDKKFSQSEK NTLLGKILDN
AIIATGVDDV FALAGLDKPN IGLLSDEFLE EVRELPQRNL AVELLEKLLN DGIHARTKNN
VVQEKKYSDR LKAVLLKYNN RAIETAQVIE ELIQMAKEFQ EAMARDEALG LNPDEIAFYD
ALAENESAVR ELGDDVLKKL AIEVTLKLRQ STTVDWQVRE SVRARLRILV RQTLRKYKYP
PDKTPYAVEL ILKQAEVVSN SWTV