Gene EcDH1_3649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3649 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3931567 
End bp3932961 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content40% 
IMG OID 
Productrestriction modification system DNA specificity domain protein 
Protein accessionACX41261 
Protein GI260450839 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCGG GGAAATTGCC GGAGGGGTGG GTTATCGCCC CAGTATCTAC GGTCACAACT 
CTAATCCGAG GAGTAACGTA TAAAAAAGAG CAGGCAATAA ATTATCTAAA AGATGATTAT
TTGCCTCTTA TCCGTGCGAA CAATATTCAG AATGGCAAGT TTGATACTAC GGACTTGGTT
TTTGTTCCTA AAAATCTTGT TAAAGAAAGT CAAAAAATAT CTCCTGAAGA TATTGTTATT
GCAATGTCAT CAGGGAGCAA ATCCGTAGTT GGTAAATCCG CACATCAGCA TCTACCATTT
GAATGTAGTT TCGGCGCATT TTGCGGTGTA TTACGTCCTG AAAAACTTAT ATTTTCTGGT
TTTATTGCTC ATTTCACAAA ATCTTCTCTT TATCGAAACA AAATTTCATC ACTTTCTGCT
GGTGCAAATA TTAATAATAT TAAGCCGGCA AGCTTTGATT TGATAAATAT ACCAATCCCA
CCACTTGCCG AACAAAAAAT CATCGCTGAA AAACTCGATA CGCTGCTGGC GCAGGTAGAC
AGCACCAAAG CACGTTTTGA GCAAATCCCA CAAATCCTGA AACGTTTTCG TCAAGCGGTA
TTGGGGGGCG CAGTTAATGG AAAATTGACA GAAAAATGGC GTAATTTTGA GCCGCAACAT
TCTGTATTTA AGAAGTTAAA TTTTGAATCT ATCTTAACTG AATTACGTAA TGGTCTTTCA
TCAAAGCCAA ATGAAAGTGG TGTTGGTCAT CCAATACTAC GCATTAGTTC TGTACGTGCT
GGCCATGTAG ATCAAAACGA TATTCGGTTT CTAGAATGTT CAGAAAGTGA ACTAAACCGC
CACAAATTAC AAGATGGAGA TCTTTTATTT ACTCGCTATA ACGGAAGTTT AGAATTTGTT
GGTGTTTGTG GGTTATTGAA AAAATTACAA CATCAAAATT TGCTATATCC TGATAAACTT
ATTCGAGCTC GATTAACCAA AGATGCTTTA CCAGAATATA TCGAAATATT TTTTTCATCC
CCCTCAGCAC GAAATGCAAT GATGAACTGC GTGAAAACAA CTTCTGGTCA AAAAGGTATT
TCAGGAAAAG ATATCAAATC CCAAGTTGTT TTATTACCTC CAGTAAAAGA ACAAGCCGAA
ATCGTTCGCC GCGTCGAGCA ACTCTTCGCC TACGCCGACA CCATAGAAAA ACAGGTCAAC
AACGCCTTAG CCCGCGTCAA CAACCTGACG CAATCCATCC TGGCAAAAGC GTTCCGTGGT
GAACTTACCG CCCAGTGGCG GGCCGAAAAC CCGGATTTGA TCAGCGGAGA AAACAGCGCC
GCCGCGTTGC TGGAAAAAAT CAAAGCTGAA CGCGCAGCTA GCGGGGGTAA AAAAGCCTCA
CGTAAAAAAT CCTGA
 
Protein sequence
MSAGKLPEGW VIAPVSTVTT LIRGVTYKKE QAINYLKDDY LPLIRANNIQ NGKFDTTDLV 
FVPKNLVKES QKISPEDIVI AMSSGSKSVV GKSAHQHLPF ECSFGAFCGV LRPEKLIFSG
FIAHFTKSSL YRNKISSLSA GANINNIKPA SFDLINIPIP PLAEQKIIAE KLDTLLAQVD
STKARFEQIP QILKRFRQAV LGGAVNGKLT EKWRNFEPQH SVFKKLNFES ILTELRNGLS
SKPNESGVGH PILRISSVRA GHVDQNDIRF LECSESELNR HKLQDGDLLF TRYNGSLEFV
GVCGLLKKLQ HQNLLYPDKL IRARLTKDAL PEYIEIFFSS PSARNAMMNC VKTTSGQKGI
SGKDIKSQVV LLPPVKEQAE IVRRVEQLFA YADTIEKQVN NALARVNNLT QSILAKAFRG
ELTAQWRAEN PDLISGENSA AALLEKIKAE RAASGGKKAS RKKS