Gene ECD_04216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_04216 
SymbolhsdR 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4490365 
End bp4493877 
Gene Length3513 bp 
Protein Length1170 aa 
Translation table11 
GC content54% 
IMG OID 
Productendonuclease R 
Protein accessionACT45997 
Protein GI253980327 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAATA AATCCAATTT TGAATTCCTG AAGGGCGTCA ACGACTTTAC CTATGCCATC 
GCCTGTGCGG CGGAAAATAA CTACCCGGAT GATCCCAACA CGACGCTGAT TAAAATGCGT
ATGTTTGGCG AAGCCACAGC GAAACATCTT GGTCTGTTAC TCAACATCCC CCCTTGTGAG
AATCAACACG ATCTCCTGCG CGAACTCGGC AAAATCGCCT TCGTTGATGA CAACATTCTC
TCTGTATTCC ACAAATTACG CCGCATTGGT AACCAGGCGG TGCACGAATA TCATAACGAT
CTCGACGATG CCCAGATGTG CCTGCGACTC GGGTTTCGCC TGGCTGTCTG GTACTACCGT
CTGGTCACTA AAGATTATGA CTTCCCGGTG CCGGTGTTTG TGTTGCCGGA ACGTGGTGAA
AACCTCTATC ACCAGGAAGT GCTGACGCTA AAACAACAGC TTGAACAGCA GGTGCGAGAA
AAAGCGCAGA GTCAGGCAGA AGTCGAAGCA CAACAGCAGA AGCTGGTTGC CCTGAACGGC
TATATCGCCA TTCTGGAAGG CAAGCAGCAG GAAACCGAAG CGCAAACCCA GGCTCGCCTT
GCTGCACTGG AAGCACAGCT CGCCGAGAAG AACGCGGAAC TGGCCAAGCA GACCGAACAG
GAACGTAAGG CTTACCACAA AGAAATTACC GATCAGGCCA TCAAGCGTAC ACTCAATCTT
AGCGAAGAAG AGAGCCGCTT CCTGATTGAC GCGCAACTGC GTAAAGCAGG CTGGCAGGCC
GATAGCAAAA CCCTGCGCTT CTCCAAAGGC GCGCGCCCGG AACCCGGCGT CAATAAAGCC
ATTGCCGAAT GGCCGACCGG GAAAGATGAA ACGGGTAATC AGGGCTTTGC GGATTATGTG
CTGTTTGTCG GTCTCAAACC CATCGCGGTG GTAGAAGCGA AACGTAAAAA TATCGACGTT
CCCGGCAAGC TCAATGAGTC GTATCGCTAC AGTAAATGTT TCGATAATGG CTTCCTGCGG
GAAACCTTGC TTGAGCACTA TTCACCAGAT GAAGTGCATG AAGCGGTGCC AGAGTATGAA
ACCAGCTGGC AGGACACCAG CGGCCAACAA CGGTTTAAAA TCCCCTTCTG CTACTCAACC
AACGGGCGCG AATACCGCGC AGCGATGAAG ATTAAAAGCG GCATCTGGTA TCGCGACGTG
CGTGATACCC GCAATATGTC GAAAGCCTTA CCCGAGTGGC ACCGCCCGGA AGAGCTGCTG
GAAATGCTCG GCAGCGAACC GCAAAAACAG AATCAGTGGT TTGCCGATAA CCCTGGCATG
AGTGAGCTGG GCCTGCGTTA TTATCAGGAA GATGCCGTCC GCGCGGTTGA AAAGGCAATC
GTCAAGGGGC AACAAGAGAT CCTGCTGGCG ATGGCAACCG GTACCGGTAA AACCCGTACG
GCAATTGCCA TGATGTTCCG CCTGATCCAG TCCCAGCGTT TTAAACGCAT TCTCTTCCTT
GTCGACCGCC GTTCTCTTGG CGAACAGGCG CTGGGCGCGT TTGAAGATAC GCGTATTAAC
GGCGACACCT TTAACAGCAT TTTCGACATT AAAGGGCTGA CGGATAAATT CCCTGAAGAC
AGCACCAAAA TTCACGTTGC CACCGTACAG TCGCTGGTGA AACGCACCCT GCAATCGGAT
GAACCGATGC CGGTAGCCCG TTACGACTGT ATCGTCGTCG ATGAAGCGCA TCGCGGCTAC
ATTCTCGATA AAGAGCAGAC CGAAGGCGAA CTACAGTTCC GCAGTCAGCT GGATTACGTC
TCTGCCTACC GTCGTATTCT CGATCACTTC GATGCGGTAA AAATCGCTCT CACCGCCACC
CCGGCTCTGC ATACGGTGCA GATTTTCGGC GAGCCGGTTT ACCGTTACAC CTACCGTACC
GCGGTTATCG ACGGTTTTCT GATCGACCAG GATCCGCCGA TTCAGATCAC CACCCGCAAC
GCCCAGGAAG GGGTTTATCT CTCCAAAGGC GAGCAGGTTG AGCGCATCAG CCCGCAGGGG
GAAGTGATCA ACGACACCCT GGAAGACGAT CAGGATTTTG AGGTCGCCGA CTTTAACCGT
GGCCTGGTGA TCCCGGCGTT TAACCGCGCC GTCTGTAACG AACTCACCAA CTACCTTGAC
CCGACCGGGT CACAAAAAAC GCTGGTCTTC TGCGTCACCA ATGCCCATGC CGATATGGTG
GTGGAAGAGC TGCGTACCGC GTTCAAGAAA AAGTATCCAC AACTGGAGCA CGACGCGATC
ATCAAGATCA CCGGTGATGC CGATAAAGAC GCGCGAAAAG TGCAGACCAT GATCACCCGC
TTCAATAAAG AGCGGCTGCC CAATATCGTG GTCACCGTCG ACCTGCTGAC GACCGGCGTC
GATATTCCGT CGATCTGTAA TATCGTGTTC CTGCGCAAGG TACGCAGCCG TATTCTGTAC
GAGCAGATGA AAGGGCGCGC CACGCGCTTA TGCCCGGAGG TGAATAAAAC CAGCTTTAAG
ATTTTCGACT GTGTCGATAT CTACAGCACG CTGGAGAGCG TCGACACCAT GCGTCCGGTG
GTGGTGCGTC CGAAGGTGGA ACTGCAAACG CTGGTCAACG AAATTACCGA TTCAGAAACC
TATAAAATCA CCGAAGCGGA TGGCCGCAGT TTTGCCGAGC ACAGCCATGA ACAACTGGTG
GCGAAGCTCC AGCGCATCAT CGGCCTGGCC ACGTTTAACC GTGACCGCAG CGAAACGATA
GACAAACAGG TGCGTCGTCT GGATGAGCTA TGCCAGGACG CGGCGGGGGT TAACTTTAAC
GGCTTCGCCT CGCGCCTGCG GGAAAAAGGG CCGCACTGGA GCGCCGAAGT CTTTAACAAA
CTACCTGGCT TTATCGCCCG TCTGGAAAAG CTGAAAACCG ACATCAACAA CCTGAATGAT
GCGCCGATCT TCCTCGATAT CGACGATGAA GTGGTGAGCG TAAAATCGCT GTACGGTGAT
TACGACACGC CGCAGGATTT CCTCGAAGCC TTTGACTCGC TGGTGCAACG TTCCCCGAAT
GCACAGCCGG CATTGCAGGC GGTGATTAAT CGCCCGCGCG ATCTCACCCG TAAAGGGCTG
GTCGAGCTAC AGGAGTGGTT TGACCGCCAG CACTTTGAGG AATCTTCCCT GCGCAAAGCA
TGGCAAGAGA CGCGCAATGA AGATATCGCC GCCCGGCTGA TTGGACATAT TCGCCGCGCT
GCGGTGGGCG ATGCGCTGAA ACCGTTTGAG GAACGTGTCG ATCACGCGCT GACGCGCATT
AAGGGCGAAA ATGACTGGAG CAGCGAGCAA TTAAGCTGGC TCGATCGTTT AGCGCAGGCG
CTGAAAGAGA AAGTGGTGCT CGACGACGAT GTCTTCAAAA CCGGCAACTT CCACCGTCGC
GGCGGGAAGG CGATGCTGCA AAGAACCTTT GACGATAATC TCGACAACCT GCTCGATAAA
TTCAGCGATT ACATTTGGGA CGAGCTGGCC TGA
 
Protein sequence
MMNKSNFEFL KGVNDFTYAI ACAAENNYPD DPNTTLIKMR MFGEATAKHL GLLLNIPPCE 
NQHDLLRELG KIAFVDDNIL SVFHKLRRIG NQAVHEYHND LDDAQMCLRL GFRLAVWYYR
LVTKDYDFPV PVFVLPERGE NLYHQEVLTL KQQLEQQVRE KAQSQAEVEA QQQKLVALNG
YIAILEGKQQ ETEAQTQARL AALEAQLAEK NAELAKQTEQ ERKAYHKEIT DQAIKRTLNL
SEEESRFLID AQLRKAGWQA DSKTLRFSKG ARPEPGVNKA IAEWPTGKDE TGNQGFADYV
LFVGLKPIAV VEAKRKNIDV PGKLNESYRY SKCFDNGFLR ETLLEHYSPD EVHEAVPEYE
TSWQDTSGQQ RFKIPFCYST NGREYRAAMK IKSGIWYRDV RDTRNMSKAL PEWHRPEELL
EMLGSEPQKQ NQWFADNPGM SELGLRYYQE DAVRAVEKAI VKGQQEILLA MATGTGKTRT
AIAMMFRLIQ SQRFKRILFL VDRRSLGEQA LGAFEDTRIN GDTFNSIFDI KGLTDKFPED
STKIHVATVQ SLVKRTLQSD EPMPVARYDC IVVDEAHRGY ILDKEQTEGE LQFRSQLDYV
SAYRRILDHF DAVKIALTAT PALHTVQIFG EPVYRYTYRT AVIDGFLIDQ DPPIQITTRN
AQEGVYLSKG EQVERISPQG EVINDTLEDD QDFEVADFNR GLVIPAFNRA VCNELTNYLD
PTGSQKTLVF CVTNAHADMV VEELRTAFKK KYPQLEHDAI IKITGDADKD ARKVQTMITR
FNKERLPNIV VTVDLLTTGV DIPSICNIVF LRKVRSRILY EQMKGRATRL CPEVNKTSFK
IFDCVDIYST LESVDTMRPV VVRPKVELQT LVNEITDSET YKITEADGRS FAEHSHEQLV
AKLQRIIGLA TFNRDRSETI DKQVRRLDEL CQDAAGVNFN GFASRLREKG PHWSAEVFNK
LPGFIARLEK LKTDINNLND APIFLDIDDE VVSVKSLYGD YDTPQDFLEA FDSLVQRSPN
AQPALQAVIN RPRDLTRKGL VELQEWFDRQ HFEESSLRKA WQETRNEDIA ARLIGHIRRA
AVGDALKPFE ERVDHALTRI KGENDWSSEQ LSWLDRLAQA LKEKVVLDDD VFKTGNFHRR
GGKAMLQRTF DDNLDNLLDK FSDYIWDELA