Gene EcDH1_2533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2533 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2705727 
End bp2709221 
Gene Length3495 bp 
Protein Length1164 aa 
Translation table11 
GC content55% 
IMG OID 
Producttranscription-repair coupling factor 
Protein accessionACX40169 
Protein GI260449747 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00875995 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCCAT ATGTTGAGGC ATATCCTAAC GAGAATCTGA CAACCGTTAT GCCTGAACAA 
TATCGTTATA CGCTGCCCGT CAAAGCGGGT GAGCAGCGTC TGCTGGGCGA GTTAACCGGC
GCAGCCTGTG CAACGCTGGT AGCGGAAATT GCCGAACGTC ACGCCGGTCC GGTGGTACTC
ATTGCACCAG ATATGCAAAA TGCTCTGCGT TTGCATGATG AAATCAGCCA GTTTACCGAT
CAAATGGTGA TGAATCTGGC GGACTGGGAA ACTCTTCCCT ACGACAGTTT TTCGCCTCAT
CAGGACATTA TCTCCTCGCG CCTTTCCACC CTTTACCAGC TACCGACGAT GCAGCGTGGC
GTACTGATTG TTCCGGTGAA TACGCTTATG CAGCGCGTTT GCCCACACAG TTTTCTCCAC
GGTCATGCGC TGGTGATGAA AAAAGGTCAG CGCCTGTCAC GAGATGCATT ACGAACCCAA
CTGGACAGCG CCGGTTATCG CCATGTTGAC CAGGTGATGG AGCACGGCGA ATACGCCACG
CGCGGCGCGT TGCTGGATCT CTTCCCGATG GGGAGTGAGC TGCCTTATCG TCTTGATTTC
TTTGATGATG AAATCGACAG CCTGCGGGTG TTTGACGTCG ACAGCCAGCG CACGCTGGAG
GAAGTAGAAG CGATCAATCT GCTGCCCGCG CACGAATTTC CGACCGATAA AGCGGCAATT
GAACTGTTCC GCAGCCAGTG GCGCGATACC TTCGAAGTGA AGCGCGATCC AGAACATATT
TACCAGCAAG TGAGTAAAGG CACATTACCT GCCGGGATCG AGTACTGGCA GCCATTGTTC
TTCAGCGAAC CACTGCCGCC GCTGTTCAGT TATTTCCCTG CCAATACCTT GCTCGTGAAT
ACTGGCGATC TGGAAACCAG TGCCGAACGT TTCCAGGCTG ACACGCTGGC GCGTTTTGAG
AATCGCGGCG TCGATCCGAT GCGCCCGCTG TTGCCACCAC AATCGCTCTG GCTGCGGGTG
GACGAGCTCT TCTCAGAGCT GAAAAACTGG CCCCGGGTGC AGCTAAAAAC TGAACATTTA
CCGACAAAAG CCGCGAATGC CAATTTAGGT TTCCAGAAAC TGCCAGACCT GGCCGTTCAG
GCACAACAAA AAGCGCCGCT GGATGCGCTG CGTAAGTTCC TCGAGACTTT CGACGGTCCG
GTGGTGTTCT CGGTAGAAAG TGAAGGTCGC CGTGAAGCGC TGGGTGAACT GCTCGCACGA
ATTAAAATTG CTCCGCAACG CATTATGCGT CTTGATGAAG CCAGCGACCG TGGGCGTTAT
CTGATGATTG GCGCTGCCGA ACATGGTTTT GTCGATACGG TGCGTAATCT GGCGCTGATT
TGCGAAAGCG ATCTGCTCGG TGAACGCGTT GCCCGTCGTC GTCAGGATTC TCGCCGCACC
ATCAACCCCG ATACACTGAT CCGTAACCTT GCGGAACTGC ATATTGGTCA GCCGGTGGTC
CATCTGGAGC ACGGCGTCGG TCGTTATGCC GGAATGACCA CGCTGGAAGC GGGTGGCATT
ACTGGCGAGT ATCTGATGCT CACCTATGCC AACGACGCCA AACTGTATGT TCCGGTGTCG
TCACTGCATC TGATTAGCCG TTACGCAGGT GGCGCGGAAG AAAACGCCCC GCTGCATAAA
CTTGGCGGCG ATGCGTGGTC ACGCGCGCGG CAGAAAGCGG CGGAAAAAGT GCGTGATGTG
GCGGCGGAAT TGCTGGATAT CTACGCGCAA CGCGCCGCCA AAGAGGGCTT CGCGTTTAAA
CACGATCGTG AGCAGTATCA GTTGTTCTGC GACAGCTTCC CGTTTGAAAC CACGCCGGAT
CAGGCGCAGG CCATTAATGC GGTACTTAGC GACATGTGTC AGCCGCTGGC AATGGATCGT
CTGGTGTGCG GCGATGTTGG CTTTGGTAAA ACAGAAGTGG CGATGCGCGC AGCTTTCCTG
GCAGTAGATA ACCACAAGCA GGTGGCGGTG CTGGTGCCTA CCACCCTTCT CGCGCAGCAG
CATTACGACA ACTTCCGCGA CCGTTTCGCC AACTGGCCGG TACGTATCGA AATGATCTCC
CGTTTCCGCA GCGCCAAAGA GCAGACGCAA ATCCTTGCGG AAGTGGCGGA AGGGAAAATC
GATATTCTGA TCGGTACGCA CAAACTGCTG CAAAGTGACG TCAAGTTTAA AGATTTAGGC
CTGCTGATTG TCGATGAAGA ACACCGCTTC GGGGTGCGTC ATAAAGAGCG CATTAAAGCG
ATGCGCGCGA ACGTGGATAT TCTGACGCTT ACTGCAACGC CGATCCCACG TACGCTGAAT
ATGGCAATGA GCGGAATGCG TGACCTGTCG ATTATCGCCA CGCCGCCCGC CCGTCGTCTG
GCAGTTAAAA CCTTTGTCCG TGAGTATGAC AGCATGGTGG TCCGGGAGGC GATCCTGCGT
GAAATTTTGC GCGGAGGACA GGTTTATTAT CTCTACAATG ATGTGGAAAA CATTCAGAAA
GCCGCCGAAC GGCTGGCAGA ACTGGTGCCA GAAGCGCGGA TCGCCATCGG TCACGGGCAG
ATGCGCGAGC GCGAACTGGA ACGGGTGATG AATGATTTCC ATCATCAACG TTTCAACGTG
CTGGTTTGTA CAACCATTAT CGAAACCGGG ATCGACATCC CGACAGCCAA CACTATTATC
ATTGAACGCG CGGATCACTT CGGTCTGGCG CAGCTGCACC AGTTACGCGG TCGCGTCGGA
CGTTCGCATC ATCAGGCATA TGCATGGTTG CTGACACCGC ATCCAAAAGC GATGACTACC
GATGCACAAA AACGTCTTGA AGCAATTGCC TCGCTGGAAG ATCTCGGGGC AGGTTTTGCG
CTGGCAACGC ACGATCTGGA GATTCGCGGC GCGGGTGAAC TGCTTGGCGA AGAACAAAGC
GGCTCAATGG AAACCATCGG TTTCTCGCTG TATATGGAGT TGCTGGAAAA CGCCGTCGAT
GCACTGAAAG CCGGACGCGA GCCGTCGCTG GAAGATCTCA CCAGCCAGCA AACAGAAGTC
GAGCTGCGGA TGCCGTCGCT ATTGCCAGAT GATTTCATCC CTGACGTGAA CACGCGTCTG
TCGTTCTACA AACGTATTGC CAGCGCCAAA ACGGAAAACG AACTGGAAGA GATCAAAGTC
GAGCTTATCG ATCGCTTCGG CCTGCTGCCA GATCCGGCGC GTACCCTGCT GGATATTGCC
CGTCTGCGCC AGCAAGCGCA GAAACTGGGG ATCAGGAAGC TGGAAGGTAA TGAGAAAGGC
GGGGTGATCG AATTTGCCGA GAAGAATCAC GTTAATCCGG CCTGGTTGAT TGGTTTGCTG
CAAAAACAGC CGCAGCATTA CCGCCTTGAT GGTCCGACGC GCCTGAAATT TATTCAGGAT
TTGAGTGAGC GGAAAACGCG TATCGAATGG GTACGCCAGT TTATGCGTGA ACTGGAAGAG
AACGCGATCG CTTAA
 
Protein sequence
MPPYVEAYPN ENLTTVMPEQ YRYTLPVKAG EQRLLGELTG AACATLVAEI AERHAGPVVL 
IAPDMQNALR LHDEISQFTD QMVMNLADWE TLPYDSFSPH QDIISSRLST LYQLPTMQRG
VLIVPVNTLM QRVCPHSFLH GHALVMKKGQ RLSRDALRTQ LDSAGYRHVD QVMEHGEYAT
RGALLDLFPM GSELPYRLDF FDDEIDSLRV FDVDSQRTLE EVEAINLLPA HEFPTDKAAI
ELFRSQWRDT FEVKRDPEHI YQQVSKGTLP AGIEYWQPLF FSEPLPPLFS YFPANTLLVN
TGDLETSAER FQADTLARFE NRGVDPMRPL LPPQSLWLRV DELFSELKNW PRVQLKTEHL
PTKAANANLG FQKLPDLAVQ AQQKAPLDAL RKFLETFDGP VVFSVESEGR REALGELLAR
IKIAPQRIMR LDEASDRGRY LMIGAAEHGF VDTVRNLALI CESDLLGERV ARRRQDSRRT
INPDTLIRNL AELHIGQPVV HLEHGVGRYA GMTTLEAGGI TGEYLMLTYA NDAKLYVPVS
SLHLISRYAG GAEENAPLHK LGGDAWSRAR QKAAEKVRDV AAELLDIYAQ RAAKEGFAFK
HDREQYQLFC DSFPFETTPD QAQAINAVLS DMCQPLAMDR LVCGDVGFGK TEVAMRAAFL
AVDNHKQVAV LVPTTLLAQQ HYDNFRDRFA NWPVRIEMIS RFRSAKEQTQ ILAEVAEGKI
DILIGTHKLL QSDVKFKDLG LLIVDEEHRF GVRHKERIKA MRANVDILTL TATPIPRTLN
MAMSGMRDLS IIATPPARRL AVKTFVREYD SMVVREAILR EILRGGQVYY LYNDVENIQK
AAERLAELVP EARIAIGHGQ MRERELERVM NDFHHQRFNV LVCTTIIETG IDIPTANTII
IERADHFGLA QLHQLRGRVG RSHHQAYAWL LTPHPKAMTT DAQKRLEAIA SLEDLGAGFA
LATHDLEIRG AGELLGEEQS GSMETIGFSL YMELLENAVD ALKAGREPSL EDLTSQQTEV
ELRMPSLLPD DFIPDVNTRL SFYKRIASAK TENELEEIKV ELIDRFGLLP DPARTLLDIA
RLRQQAQKLG IRKLEGNEKG GVIEFAEKNH VNPAWLIGLL QKQPQHYRLD GPTRLKFIQD
LSERKTRIEW VRQFMRELEE NAIA