Gene EcDH1_3211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3211 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3454003 
End bp3455205 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content53% 
IMG OID 
Productnuclease SbcCD, D subunit 
Protein accessionACX40837 
Protein GI260450415 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.13705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATCC TTCACACCTC AGACTGGCAT CTCGGCCAGA ACTTCTACAG TAAAAGCCGC 
GAAGCTGAAC ATCAGGCTTT TCTTGACTGG CTGCTGGAGA CAGCACAAAC CCATCAGGTG
GATGCGATTA TTGTTGCCGG TGATGTTTTC GATACCGGCT CGCCGCCCAG TTACGCCCGC
ACGTTATACA ACCGTTTTGT TGTCAATTTA CAGCAAACTG GCTGTCATCT GGTGGTACTG
GCAGGAAACC ATGACTCGGT CGCCACGCTG AATGAATCGC GCGATATCAT GGCGTTCCTC
AATACTACCG TGGTCGCCAG CGCCGGACAT GCGCCGCAAA TCTTGCCTCG TCGCGACGGG
ACGCCAGGCG CAGTGCTGTG CCCCATTCCG TTTTTACGTC CGCGTGACAT TATTACCAGC
CAGGCGGGGC TTAACGGTAT TGAAAAACAG CAGCATTTAC TGGCAGCGAT TACCGATTAT
TACCAACAAC ACTATGCCGA TGCCTGCAAA CTGCGCGGCG ATCAGCCTCT GCCCATCATC
GCCACGGGAC ATTTAACGAC CGTGGGGGCC AGTAAAAGTG ACGCCGTGCG TGACATTTAT
ATTGGCACGC TGGACGCGTT TCCGGCACAA AACTTTCCAC CAGCCGACTA CATCGCGCTC
GGGCATATTC ACCGCGCACA GATTATTGGC GGCATGGAAC ATGTTCGCTA TTGCGGCTCC
CCCATTCCAC TGAGTTTTGA TGAATGCGGT AAGAGTAAAT ATGTCCATCT GGTGACATTT
TCAAACGGCA AATTAGAGAG CGTGGAAAAC CTGAACGTAC CGGTAACGCA ACCCATGGCA
GTGCTGAAAG GCGATCTGGC GTCGATTACC GCACAGCTGG AACAGTGGCG CGATGTATCG
CAGGAGCCAC CTGTCTGGCT GGATATCGAA ATCACTACTG ATGAGTATCT GCATGATATT
CAGCGCAAAA TCCAGGCATT AACCGAATCA TTGCCTGTCG AAGTATTGCT GGTACGTCGG
AGTCGTGAAC AGCGCGAGCG TGTGTTAGCC AGCCAACAGC GTGAAACCCT CAGCGAACTC
AGCGTCGAAG AGGTGTTCAA TCGCCGTCTG GCACTGGAAG AACTGGATGA ATCGCAGCAG
CAACGTCTGC AGCATCTTTT CACCACGACG TTGCATACCC TCGCCGGAGA ACACGAAGCA
TGA
 
Protein sequence
MRILHTSDWH LGQNFYSKSR EAEHQAFLDW LLETAQTHQV DAIIVAGDVF DTGSPPSYAR 
TLYNRFVVNL QQTGCHLVVL AGNHDSVATL NESRDIMAFL NTTVVASAGH APQILPRRDG
TPGAVLCPIP FLRPRDIITS QAGLNGIEKQ QHLLAAITDY YQQHYADACK LRGDQPLPII
ATGHLTTVGA SKSDAVRDIY IGTLDAFPAQ NFPPADYIAL GHIHRAQIIG GMEHVRYCGS
PIPLSFDECG KSKYVHLVTF SNGKLESVEN LNVPVTQPMA VLKGDLASIT AQLEQWRDVS
QEPPVWLDIE ITTDEYLHDI QRKIQALTES LPVEVLLVRR SREQRERVLA SQQRETLSEL
SVEEVFNRRL ALEELDESQQ QRLQHLFTTT LHTLAGEHEA