Gene EcDH1_3114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3114 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3348125 
End bp3350539 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content58% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionACX40740 
Protein GI260450318 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGCAC GTTGGTTCTG GCGCGAATGG CGTTCGCCGT CGCTATTAAT TGTCTGGCTG 
GCGCTAAGCC TGGCGGTGGC CTGCGTGCTG GCGCTGGGCA ATATCAGCGA TCGCATGGAG
AAGGGCTTAA GCCAGCAAAG CCGTGAGTTT ATGGCGGGCG ATCGGGCGTT GCGCAGTTCA
CGCGAAGTGC CGCAAGCGTG GCTGGAGGAA GCGCAAAAGC GCGGCCTGAA AGTCGGCAAG
CAGCTGACTT TCGCCACAAT GACCTTTGCA GGCGACACAC CGCAGCTGGC GAACGTCAAA
GCGGTGGATG ATATCTACCC GATGTATGGC GATCTGCAAA CTAATCCCCC TGGCCTGAAA
CCGCAGGCGG GCAGCGTATT GCTGGCCCCA CGCCTGATGG CACTGCTTAA CCTGAAAACG
GGCGACACCA TTGACGTGGG CGATGCCACC TTGCGGATTG CCGGAGAAGT GATTCAGGAA
CCGGATTCCG GTTTTAACCC CTTCCAGATG GCTCCGCGTC TGATGATGAA TCTGGCGGAT
GTCGATAAAA CCGGAGCCGT GCAGCCGGGG AGTCGGGTCA CCTGGCGTTA TAAATTCGGC
GGCAACGAGA ACCAGCTCGA CGGCTATGAG AAATGGTTGT TACCTCAGCT TAAACCCGAA
CAACGCTGGT ACGGTCTGGA ACAGGACGAA GGCGCGCTGG GGCGATCGAT GGAACGCTCG
CAACAGTTCC TGCTGCTTTC GGCGCTTCTG ACCTTGCTGC TGGCAGTGGC AGCGGTGGCG
GTAGCGATGA ATCATTACTG TCGCAGTCGC TACGATCTGG TGGCGATCCT CAAAACGCTG
GGGGCAGGGC GAGCGCAACT GCGTAAGCTA ATCGTCGGTC AGTGGTTGAT GGTGCTGACG
CTTTCAGCCG TTACCGGTGG GGCCATAGGC CTGTTGTTCG AAAACGTGTT GATGGTGCTG
CTCAAGCCCG TTCTGCCTGC TGCACTACCG CCAGCCAGCC TCTGGCCGTG GCTGTGGGCG
CTTGGCACCA TGACGGTCAT CTCGCTGCTG GTGGGGCTAC GACCATATCG CTTGTTGCTG
GCAACGCAGC CTTTACGCGT ATTACGTAAT GATGTGGTAG CGAACGTCTG GCCGCTGAAG
TTTTATCTGC CGATTGTCAG TGTGGTGGTT GTGCTGCTGC TCGCCGGATT AATGGGTGGC
AGCATGCTGC TTTGGGCGGT GCTGGCGGGC GCGGTAGTAC TGGCTTTGCT GTGCGGTGTG
CTGGGCTGGA TGCTGCTGAA TGTACTTCGC CGCATGACGC TGAAATCGCT GCCTCTGCGC
CTGGCGGTTA GCCGCCTGTT ACGTCAGCCG TGGTCAACGT TAAGTCAGCT TTCGGCATTT
TCGCTCTCCT TTATGCTGCT GGCACTGCTG CTGGTGTTGC GTGGCGATCT GCTCGACCGC
TGGCAGCAGC AGCTACCTCC AGAAAGCCCG AACTACTTTT TAATTAACAT CGCCACAGAA
CAGGTAGCAC CGCTAAAAGC GTTCCTCGCG GAACATCAGA TAGTCCCGGA ATCGTTTTAT
CCGGTGGTGC GGGCGCGGCT GACGGCGATT AACGATAAGC CGACAGAAGG TAATGAAGAT
GAGGCGCTTA ACCGCGAACT CAATCTTACC TGGCAAAATA CGCGGCCCGA TCATAATCCG
ATTGTCGCCG GTAACTGGCC GCCAAAAGCC GATGAAGTGT CGATGGAAGA GGGGCTGGCG
AAACGCTTAA ACGTTGCCCT CGGTGATACC GTGACTTTTA TGGGCGATAC CCAGGAGTTC
CGCGCTAAAG TGACCAGCCT GCGCAAAGTG GACTGGGAAA GTCTGCGGCC TAATTTCTAT
TTTATTTTCC CTGAAGGGGC ATTAGACGGG CAACCGCAGA GCTGGCTTAC CAGTTTCCGC
TGGGAGAATG GCAACGGCAT GTTGACACAA CTCAACCGCC AGTTCCCGAC CATTAGCCTG
TTAGATATTG GCGCGATTTT AAAACAGGTC GGTCAGGTGC TGGAGCAGGT AAGTCGGGCG
CTGGAAGTGA TGGTGGTACT GGTCACCGCC TGCGGTATGT TGCTGTTGCT GGCACAGGTG
CAGGTGGGAA TGCGTCAGCG TCATCAGGAG CTGGTGGTGT GGCGCACACT CGGTGCGGGG
AAAAAACTGC TGCGTACCAC GTTGTGGTGT GAGTTCGCCA TGCTTGGGTT TGTTTCCGGC
CTGGTGGCCG CAATTGGTGC GGAAACGGCG CTGGCAGTGT TGCAGGCGAA AGTGTTTGAT
TTTCCGTGGG AGCCAGACTG GCGATTGTGG ATTGTTCTGC CGTGCAGCGG AGCGCTGCTG
CTGTCGCTTT TCGGCGGCTG GCTGGGTGCG CGACTGGTTA AGGGTAAGGC GCTGTTCAGG
CAGTTTGCGG GGTGA
 
Protein sequence
MIARWFWREW RSPSLLIVWL ALSLAVACVL ALGNISDRME KGLSQQSREF MAGDRALRSS 
REVPQAWLEE AQKRGLKVGK QLTFATMTFA GDTPQLANVK AVDDIYPMYG DLQTNPPGLK
PQAGSVLLAP RLMALLNLKT GDTIDVGDAT LRIAGEVIQE PDSGFNPFQM APRLMMNLAD
VDKTGAVQPG SRVTWRYKFG GNENQLDGYE KWLLPQLKPE QRWYGLEQDE GALGRSMERS
QQFLLLSALL TLLLAVAAVA VAMNHYCRSR YDLVAILKTL GAGRAQLRKL IVGQWLMVLT
LSAVTGGAIG LLFENVLMVL LKPVLPAALP PASLWPWLWA LGTMTVISLL VGLRPYRLLL
ATQPLRVLRN DVVANVWPLK FYLPIVSVVV VLLLAGLMGG SMLLWAVLAG AVVLALLCGV
LGWMLLNVLR RMTLKSLPLR LAVSRLLRQP WSTLSQLSAF SLSFMLLALL LVLRGDLLDR
WQQQLPPESP NYFLINIATE QVAPLKAFLA EHQIVPESFY PVVRARLTAI NDKPTEGNED
EALNRELNLT WQNTRPDHNP IVAGNWPPKA DEVSMEEGLA KRLNVALGDT VTFMGDTQEF
RAKVTSLRKV DWESLRPNFY FIFPEGALDG QPQSWLTSFR WENGNGMLTQ LNRQFPTISL
LDIGAILKQV GQVLEQVSRA LEVMVVLVTA CGMLLLLAQV QVGMRQRHQE LVVWRTLGAG
KKLLRTTLWC EFAMLGFVSG LVAAIGAETA LAVLQAKVFD FPWEPDWRLW IVLPCSGALL
LSLFGGWLGA RLVKGKALFR QFAG