Gene EcDH1_3419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3419 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3662770 
End bp3666252 
Gene Length3483 bp 
Protein Length1160 aa 
Translation table11 
GC content55% 
IMG OID 
ProductDNA polymerase III, alpha subunit 
Protein accessionACX41035 
Protein GI260450613 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAC CACGTTTCGT ACACCTGCGG GTGCACAGCG ACTACTCGAT GATCGATGGC 
CTGGCCAAAA CCGCACCGTT GGTAAAAAAG GCGGCGGCGT TGGGTATGCC AGCACTGGCG
ATCACCGATT TCACCAACCT TTGTGGTCTG GTGAAGTTCT ACGGAGCGGG ACATGGCGCA
GGGATTAAGC CTATCGTCGG GGCAGATTTT AACGTCCAGT GCGACCTGCT GGGTGATGAG
TTAACCCACC TGACGGTACT GGCGGCGAAC AATACCGGCT ATCAGAATCT GACGTTGCTG
ATCTCAAAAG CGTATCAGCG CGGGTACGGT GCCGCCGGGC CGATCATCGA TCGCGACTGG
CTTATCGAAT TAAACGAAGG GTTGATCCTT CTTTCCGGCG GACGCATGGG CGACGTCGGA
CGCAGTCTTT TGCGTGGTAA CAGCGCGCTG GTAGATGAGT GTGTCGCGTT TTATGAAGAA
CACTTCCCGG ATCGCTATTT TCTCGAGCTG ATCCGCACCG GCAGGCCGGA TGAAGAAAGC
TATCTGCACG CGGCGGTGGA ACTGGCGGAA GCGCGCGGTT TGCCCGTCGT GGCGACCAAC
GACGTGCGCT TTATCGACAG CAGCGACTTT GACGCACACG AAATCCGCGT CGCGATCCAC
GACGGCTTTA CCCTCGACGA TCCTAAACGC CCGCGTAACT ATTCGCCGCA GCAATATATG
CGTAGCGAAG AGGAGATGTG TGAGCTGTTT GCCGACATCC CCGAAGCCCT TGCCAACACC
GTTGAGATCG CCAAACGCTG TAACGTAACC GTGCGTCTTG GTGAATACTT CCTGCCGCAG
TTCCCGACCG GGGACATGAG CACCGAAGAT TATCTGGTCA AGCGTGCAAA AGAGGGCCTG
GAAGAGCGTC TGGCCTTTTT ATTCCCTGAT GAGGAAGAAC GTCTTAAGCG CCGCCCGGAA
TATGACGAAC GTCTGGAGAC TGAACTTCAG GTTATCAACC AGATGGGCTT CCCGGGCTAC
TTCCTCATCG TTATGGAATT TATCCAGTGG TCGAAAGATA ACGGCGTACC GGTAGGGCCA
GGCCGTGGCT CCGGTGCGGG TTCACTGGTG GCCTACGCGC TGAAAATCAC CGACCTCGAT
CCGCTGGAAT TTGACCTGCT GTTCGAACGT TTCCTTAACC CGGAACGTGT CTCCATGCCT
GACTTCGACG TTGACTTCTG TATGGAGAAA CGCGATCAGG TTATCGAGCA CGTAGCGGAC
ATGTACGGTC GTGATGCGGT ATCGCAGATC ATCACCTTCG GTACAATGGC GGCGAAAGCG
GTGATCCGCG ACGTAGGCCG CGTGCTGGGG CATCCGTACG GCTTTGTCGA TCGTATCTCG
AAACTGATCC CGCCCGATCC GGGGATGACG CTGGCGAAAG CGTTTGAAGC CGAGCCGCAG
CTGCCGGAAA TCTACGAAGC GGATGAAGAA GTTAAGGCGC TGATCGACAT GGCGCGCAAA
CTGGAAGGGG TCACCCGTAA CGCCGGTAAG CACGCCGGTG GGGTGGTTAT CGCGCCGACC
AAAATTACCG ATTTTGCGCC GCTTTACTGC GATGAAGAGG GCAAACATCC GGTCACCCAG
TTTGATAAAA GCGACGTTGA ATACGCCGGA CTGGTGAAGT TCGACTTCCT TGGTTTGCGT
ACGCTCACCA TCATCAACTG GGCGCTGGAG ATGATCAACA AGCGGCGGGC GAAGAATGGC
GAGCCGCCGC TGGATATCGC TGCGATCCCG CTGGATGATA AGAAAAGCTT CGACATGCTG
CAACGCTCGG AAACCACGGC GGTATTCCAG CTTGAATCGC GCGGCATGAA GGACCTGATC
AAGCGTCTAC AACCTGACTG CTTCGAAGAT ATGATCGCCC TAGTGGCACT GTTCCGCCCC
GGTCCGTTGC AATCAGGGAT GGTGGATAAC TTTATCGACC GTAAACATGG TCGTGAAGAG
ATCTCCTATC CGGACGTACA GTGGCAGCAT GAAAGCCTGA AACCGGTACT GGAGCCAACC
TACGGCATTA TCCTGTATCA GGAACAGGTC ATGCAGATTG CGCAGGTGCT TTCTGGTTAT
ACCCTCGGTG GCGCGGATAT GCTGCGTCGT GCGATGGGTA AGAAAAAGCC GGAAGAGATG
GCTAAGCAAC GTTCTGTATT TGCTGAAGGT GCAGAAAAGA ACGGAATCAA CGCTGAACTG
GCGATGAAAA TCTTCGACCT GGTGGAGAAA TTCGCTGGTT ACGGATTTAA CAAATCGCAC
TCTGCGGCCT ATGCTTTGGT GTCATATCAA ACGTTATGGC TGAAAGCGCA CTATCCTGCG
GAGTTTATGG CGGCGGTAAT GACCGCCGAT ATGGACAACA CCGAGAAGGT GGTGGGTCTG
GTGGATGAGT GCTGGCGGAT GGGGCTGAAA ATCCTGCCAC CAGATATAAA CTCCGGTCTT
TACCATTTCC ACGTCAACGA CGACGGCGAA ATCGTGTATG GTATTGGCGC GATCAAAGGG
GTCGGTGAAG GTCCGATTGA GGCCATCATC GAAGCCCGTA ATAAAGGCGG CTACTTCCGC
GAACTGTTTG ATCTCTGCGC CCGTACCGAC ACCAAAAAGT TGAACCGTCG CGTGCTGGAA
AAACTGATCA TGTCCGGGGC GTTTGACCGT CTTGGGCCAC ATCGCGCAGC GCTGATGAAC
TCGCTGGGCG ATGCGTTAAA AGCGGCAGAT CAACACGCGA AAGCGGAAGC TATCGGTCAG
GCCGATATGT TCGGCGTGCT GGCCGAAGAG CCGGAACAAA TTGAACAATC CTACGCCAGC
TGCCAACCGT GGCCGGAGCA GGTGGTATTA GATGGGGAAC GTGAAACGTT AGGCCTGTAC
CTGACCGGAC ACCCTATCAA CCAGTATTTA AAAGAGATTG AGCGTTATGT CGGAGGCGTA
AGGCTGAAAG ACATGCACCC GACAGAACGT GGTAAAGTCA TCACGGCTGC GGGGCTCGTT
GTTGCCGCGC GGGTTATGGT CACCAAGCGC GGCAATCGTA TCGGTATCTG CACGCTGGAT
GACCGTTCCG GGCGGCTGGA AGTGATGTTG TTTACTGACG CCCTGGATAA ATACCAGCAA
TTGCTGGAAA AAGACCGCAT ACTTATCGTC AGCGGACAGG TCAGCTTTGA TGACTTCAGC
GGTGGGCTTA AAATGACCGC TCGCGAAGTG ATGGATATTG ACGAAGCCCG GGAAAAATAT
GCTCGCGGGC TTGCTATCTC GCTGACGGAC AGGCAAATTG ATGACCAGCT TTTAAACCGA
CTCCGTCAGT CTCTGGAACC CCACCGCTCT GGGACAATTC CAGTACATCT CTACTATCAG
AGGGCGGATG CACGCGCGCG GTTGCGTTTT GGCGCGACGT GGCGTGTCTC TCCGAGCGAT
CGTTTATTAA ACGATCTCCG TGGCCTCATT GGTTCGGAGC AGGTGGAACT GGAGTTTGAC
TAA
 
Protein sequence
MSEPRFVHLR VHSDYSMIDG LAKTAPLVKK AAALGMPALA ITDFTNLCGL VKFYGAGHGA 
GIKPIVGADF NVQCDLLGDE LTHLTVLAAN NTGYQNLTLL ISKAYQRGYG AAGPIIDRDW
LIELNEGLIL LSGGRMGDVG RSLLRGNSAL VDECVAFYEE HFPDRYFLEL IRTGRPDEES
YLHAAVELAE ARGLPVVATN DVRFIDSSDF DAHEIRVAIH DGFTLDDPKR PRNYSPQQYM
RSEEEMCELF ADIPEALANT VEIAKRCNVT VRLGEYFLPQ FPTGDMSTED YLVKRAKEGL
EERLAFLFPD EEERLKRRPE YDERLETELQ VINQMGFPGY FLIVMEFIQW SKDNGVPVGP
GRGSGAGSLV AYALKITDLD PLEFDLLFER FLNPERVSMP DFDVDFCMEK RDQVIEHVAD
MYGRDAVSQI ITFGTMAAKA VIRDVGRVLG HPYGFVDRIS KLIPPDPGMT LAKAFEAEPQ
LPEIYEADEE VKALIDMARK LEGVTRNAGK HAGGVVIAPT KITDFAPLYC DEEGKHPVTQ
FDKSDVEYAG LVKFDFLGLR TLTIINWALE MINKRRAKNG EPPLDIAAIP LDDKKSFDML
QRSETTAVFQ LESRGMKDLI KRLQPDCFED MIALVALFRP GPLQSGMVDN FIDRKHGREE
ISYPDVQWQH ESLKPVLEPT YGIILYQEQV MQIAQVLSGY TLGGADMLRR AMGKKKPEEM
AKQRSVFAEG AEKNGINAEL AMKIFDLVEK FAGYGFNKSH SAAYALVSYQ TLWLKAHYPA
EFMAAVMTAD MDNTEKVVGL VDECWRMGLK ILPPDINSGL YHFHVNDDGE IVYGIGAIKG
VGEGPIEAII EARNKGGYFR ELFDLCARTD TKKLNRRVLE KLIMSGAFDR LGPHRAALMN
SLGDALKAAD QHAKAEAIGQ ADMFGVLAEE PEQIEQSYAS CQPWPEQVVL DGERETLGLY
LTGHPINQYL KEIERYVGGV RLKDMHPTER GKVITAAGLV VAARVMVTKR GNRIGICTLD
DRSGRLEVML FTDALDKYQQ LLEKDRILIV SGQVSFDDFS GGLKMTAREV MDIDEAREKY
ARGLAISLTD RQIDDQLLNR LRQSLEPHRS GTIPVHLYYQ RADARARLRF GATWRVSPSD
RLLNDLRGLI GSEQVELEFD