Gene EcDH1_1159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1159 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1248160 
End bp1249530 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content53% 
IMG OID 
Productexodeoxyribonuclease VII, large subunit 
Protein accessionACX38833 
Protein GI260448411 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.02008e-12 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACCTT CTCAATCCCC TGCAATTTTT ACCGTTAGTC GCCTGAATCA AACGGTTCGT 
CTGCTGCTTG AGCATGAGAT GGGACAGGTT TGGATCAGCG GCGAAATTTC TAATTTCACG
CAACCAGCTT CCGGTCACTG GTACTTTACA CTCAAAGACG ACACCGCCCA GGTACGCTGC
GCGATGTTCC GCAACAGCAA CCGCCGGGTG ACCTTCCGCC CACAGCATGG GCAACAAGTT
TTAGTTCGCG CCAATATTAC GCTCTACGAG CCGCGCGGCG ACTACCAGAT AATCGTTGAG
AGTATGCAGC CGGCCGGTGA AGGGCTGCTG CAACAGAAGT ACGAACAGCT CAAAGCGAAG
TTGCAGGCTG AAGGTTTGTT CGATCAGCAA TACAAAAAAC CACTTCCCTC CCCTGCGCAT
TGCGTTGGTG TGATCACCTC AAAAACCGGT GCTGCGCTAC ATGATATTTT GCATGTGTTA
AAACGTCGCG ATCCTTCTCT GCCGGTGATC ATCTACCCTG CCGCCGTTCA GGGCGATGAC
GCGCCGGGGC AAATTGTTCG CGCCATTGAA CTGGCGAATC AGCGCAATGA GTGCGACGTA
TTGATCGTCG GGCGCGGCGG CGGTTCGCTG GAAGATTTAT GGAGTTTTAA CGACGAACGC
GTAGCGCGGG CGATTTTTAC CAGCCGCATT CCGGTTGTCA GCGCCGTCGG GCATGAGACG
GATGTGACCA TTGCCGATTT TGTTGCCGAT CTGCGTGCGC CAACGCCGTC TGCCGCCGCT
GAAGTAGTGA GCCGTAATCA GCAAGAGTTA CTGCGCCAGG TGCAATCGAC CCGTCAACGG
CTGGAGATGG CGATGGATTA TTATCTCGCC AACCGCACAC GTCGCTTTAC GCAAATTCAT
CACCGATTAC AGCAACAGCA TCCGCAGCTC CGGCTGGCAC GCCAGCAAAC CATGCTTGAG
CGCCTGCAAA AGCGAATGAG CTTTGCGCTG GAAAATCAAC TTAAGCGTAC CGGGCAACAG
CAGCAGCGGT TAACACAGCG GCTGAATCAG CAAAATCCAC AGCCGAAGAT TCATCGCGCG
CAAACGCGCA TTCAGCAACT GGAATATCGT TTAGCAGAAA CCCTGCGCGC ACAGCTTAGC
GCCACGCGTG AACGTTTCGG TAATGCAGTA ACGCACCTCG AAGCCGTAAG CCCACTGTCA
ACGCTGGCGC GTGGATACAG CGTTACTACT GCTACTGACG GCAATGTACT GAAAAAAGTG
AAGCAAGTTA AAGCGGGTGA AATGCTAACC ACACGTCTGG AAGACGGCTG GATAGAAAGT
GAAGTAAAAA ACATCCAGCC AGTAAAAAAA TCGCGTAAAA AGGTGCATTA A
 
Protein sequence
MLPSQSPAIF TVSRLNQTVR LLLEHEMGQV WISGEISNFT QPASGHWYFT LKDDTAQVRC 
AMFRNSNRRV TFRPQHGQQV LVRANITLYE PRGDYQIIVE SMQPAGEGLL QQKYEQLKAK
LQAEGLFDQQ YKKPLPSPAH CVGVITSKTG AALHDILHVL KRRDPSLPVI IYPAAVQGDD
APGQIVRAIE LANQRNECDV LIVGRGGGSL EDLWSFNDER VARAIFTSRI PVVSAVGHET
DVTIADFVAD LRAPTPSAAA EVVSRNQQEL LRQVQSTRQR LEMAMDYYLA NRTRRFTQIH
HRLQQQHPQL RLARQQTMLE RLQKRMSFAL ENQLKRTGQQ QQRLTQRLNQ QNPQPKIHRA
QTRIQQLEYR LAETLRAQLS ATRERFGNAV THLEAVSPLS TLARGYSVTT ATDGNVLKKV
KQVKAGEMLT TRLEDGWIES EVKNIQPVKK SRKKVH