Gene EcDH1_3338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3338 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3590183 
End bp3591325 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content62% 
IMG OID 
ProductIntegrase catalytic region 
Protein accessionACX40960 
Protein GI260450538 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.858997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCTGGG ATGCGAGAGA TACCATGTCA TTACGTACTG AGTTTGTTTT GTTCGCCTCG 
CAGGACGGGG CGAACATCCG TTCCCTCTGC CGTCGCTTCG GCATTTCACC TGCCACCGGC
TACAAGTGGC TCCAGCGCTG GGCTCAGGAA GGTGCCGCCG GTCTTCAGGA CCGCCCGCGC
ATTCCGCACC ATTCCCCGAA CCGCTCATCT GACGACATCA CGGCCCTGCT GCGTATGGCC
CATGACCGTC ATGAACGCTG GGGAGCCCGC AAGATTAAGC GCTGGCTCGA GGACCAGGGG
CACACCATGC CCGCCTTCAG CACCGTCCAT AACCTGATGG CCCGCCATGG CCTGCTGCCG
GGCGCTTCAC CGGGCATTCC CGCCACGGGC CGGTTCGAAC ACGACGCGCC GAACCGCCTC
TGGCAGATGG ATTTTAAGGG CCACTTTCCT TTTGGCGGTG GACGCTGCCA TCCGCTCACC
CTGCTGGACG ACCACTCCCG TTTTTCCCTG TGCCTGGCGC ACTGTACCGA TGAACGGCGC
GAGACCGTGC AGCAGCAGCT GGTCAGCGTG TTTGAGCGTT ACGGCCTGCC GGACCGGATG
ACCATGGATA ACGGCTCACC GTGGGGCGAC ACCACCGGCA CCTGGACGGC GCTGGAGCTG
TGGCTGATGC GCCTGGGTAT TCGGGTGGGG CACTCCCGGC CTTATCATCC GCAGACGCAG
GGGAAGCTGG AGCGTTTTCA CCGCAGCCTG AAGGCGGAAG TGCTGCAGGG AAAATGGTTC
GCAGACAGCG GTGAACTGCA GCGCGCCTTC GACCACTGGC GGACGGTCTA TAACCTTGAA
CGCCCGCACG AGGCGCTGGA TATGGCGGTA CCGGGCTCGC GGTATCAGCC GTCAGCGCGG
CAGTACAGCG GCAACACAAC GCCCCCGGAA TACGATGAAG GGGTGATGGT CAGGAAAGTG
GATATCAGCG GAAAGCTGAG CGTGAAAGGG GTAAGTCTGA GCGCAGGCAA GGCGTTCAGG
GGAGAACGGG TCGGGCTGAA GGAGATGCAG GAAGACGGCA GCTACGAGGT GTGGTGGTAC
AGCACGAAAG TGGGGGTGAT CGACCTGAAG AAAAAGTCGA TCACCATGGG TAAAGGATGT
TAA
 
Protein sequence
MPWDARDTMS LRTEFVLFAS QDGANIRSLC RRFGISPATG YKWLQRWAQE GAAGLQDRPR 
IPHHSPNRSS DDITALLRMA HDRHERWGAR KIKRWLEDQG HTMPAFSTVH NLMARHGLLP
GASPGIPATG RFEHDAPNRL WQMDFKGHFP FGGGRCHPLT LLDDHSRFSL CLAHCTDERR
ETVQQQLVSV FERYGLPDRM TMDNGSPWGD TTGTWTALEL WLMRLGIRVG HSRPYHPQTQ
GKLERFHRSL KAEVLQGKWF ADSGELQRAF DHWRTVYNLE RPHEALDMAV PGSRYQPSAR
QYSGNTTPPE YDEGVMVRKV DISGKLSVKG VSLSAGKAFR GERVGLKEMQ EDGSYEVWWY
STKVGVIDLK KKSITMGKGC