Gene EcDH1_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1053 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1126362 
End bp1127603 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content48% 
IMG OID 
Productintegrase family protein 
Protein accessionACX38728 
Protein GI260448306 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGAA AAACCAAGCC GTTAACTGAT ACGGAAATCA AAGCCGCCAA ACCTAAAGAT 
GCCGATTACC AGCTTTATGA CGGTGACGGG CTTACTCTGT TAATCAAGTC CAGTGGCAGT
AAGCTTTGGC AATTCCGTTA CTATCGGCCT TTGACCAAGC AGCGAACCAA ACAGAGCTTC
GGTGCCTATC CTGCCGTCTC GCTTTCTGAT GCACGTAAAC TCAGAGCCGA ATCTAAAGTT
TTATTGGCGA AAGACATTGA TCCTCAGGAA CATCAGAAAG AACAGGTGAG GAATTCTCAA
GAGGCCAAAA CCAATACCTT CTTGTTAGTT GCCGAGCGTT GGTGGAATGT GAAGAAAACC
AGCGTAACAG AGGACTATGC CGACGATATC TGGCGCTCGC TTGAGAGAGA TATTTTCCCG
GCAATCGGTG ATATCAGTAT CACTGAGATT AAGGCTCATA CTCTGGTTAA AGCAGTTCAG
CCGGTTCAGG CCAGAGGTGC ATTAGAGACT GTTCGCCGCC TTTGTCAGCG TATTAACGAA
GTCATGATTT ATGCGCAGAA CACAGGCCTG ATTGATGCTG TTCCTAGTGT AAATATCGGA
AAAGCTTTCG AGAAACCGCA AAAGAAAAAC ATGCCAAGCA TCCGGCCGGA TCAACTTCCG
CAGCTAATGC ACACCATGCG TACGGCAAGT ATCAGCATGT CCACAAGATG CCTGTTCATG
TGGCAACTTC TAACCATCAC CCGCCCTGCC GAAGCTGCTG AGGCTCGATG GGATGAGATC
GATTTCAATG CTAGCGAATG GAAAATTCCT GCAGCTCGAA TGAAGATGAA CCGGGACCAT
ACGGTTCCAC TATCTGATGG GGCTCTTGCT ATTCTGGAAA TGATGAAGCC TCTCAGTGGT
GGCCGAGAAT TTATCTTTCC TAGCCGTATC AAGCCCAACC AACCAATGAA TAGCCAAACA
GTGAATGCAG CACTCAAGCG TGCTGGCTTA GGAGGTGTAC TTGTTTCACA CGGCTTGCGT
TCTATCGCCA GTACGGCACT CAATGAGGAA GGATTTCCAC CTGATGTCAT TGAAGCAGCG
CTTGCTCATG TAGACAAAAA TGAGGTGCGT CGCGCTTATA ACCGCAGTGA TTATCTTGAG
CAACGTCGTC CGATGATGCA ATGGTGGGCT GATCTCGTAA AAGCAGCAGA TAGTGGTAGC
ATCGTTTTAA CTCATTTGAG CAAAATTCGT CTTGTCGGAT AA
 
Protein sequence
MARKTKPLTD TEIKAAKPKD ADYQLYDGDG LTLLIKSSGS KLWQFRYYRP LTKQRTKQSF 
GAYPAVSLSD ARKLRAESKV LLAKDIDPQE HQKEQVRNSQ EAKTNTFLLV AERWWNVKKT
SVTEDYADDI WRSLERDIFP AIGDISITEI KAHTLVKAVQ PVQARGALET VRRLCQRINE
VMIYAQNTGL IDAVPSVNIG KAFEKPQKKN MPSIRPDQLP QLMHTMRTAS ISMSTRCLFM
WQLLTITRPA EAAEARWDEI DFNASEWKIP AARMKMNRDH TVPLSDGALA ILEMMKPLSG
GREFIFPSRI KPNQPMNSQT VNAALKRAGL GGVLVSHGLR SIASTALNEE GFPPDVIEAA
LAHVDKNEVR RAYNRSDYLE QRRPMMQWWA DLVKAADSGS IVLTHLSKIR LVG