Gene EcDH1_2258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2258 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2424836 
End bp2426881 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content54% 
IMG OID 
Productphenylacetic acid degradation protein paaN 
Protein accessionACX39905 
Protein GI260449483 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAGT TAGCCAGTTT CTTATCCGGT ACCTGGCAGT CTGGCCGGGG CCGTAGCCGT 
TTGATTCACC ACGCTATTAG CGGCGAGGCG TTATGGGAAG TGACCAGTGA AGGTCTTGAT
ATGGCGGCTG CCCGCCAGTT TGCCATTGAA AAAGGTGCCC CCGCCCTTCG CGCTATGACC
TTTATCGAAC GTGCGGCGAT GCTTAAAGCG GTCGCTAAAC ATCTGCTGAG TGAAAAAGAG
CGTTTCTATG CTCTTTCTGC GCAAACAGGC GCAACGCGGG CAGACAGTTG GGTTGATATT
GAAGGTGGCA TTGGGACGTT ATTTACTTAC GCCAGCCTCG GTAGCCGGGA GCTGCCTGAC
GATACGCTGT GGCCGGAAGA TGAATTGATC CCCTTATCGA AAGAAGGTGG ATTTGCCGCG
CGCCATTTAC TGACCTCAAA GTCAGGCGTG GCAGTGCATA TTAACGCCTT TAACTTCCCC
TGCTGGGGAA TGCTGGAAAA GCTGGCACCA ACGTGGCTGG GCGGAATGCC AGCCATCATC
AAACCAGCTA CCGCGACGGC CCAACTGACT CAGGCGATGG TGAAATCAAT TGTCGATAGT
GGTCTTGTTC CCGAAGGCGC AATTAGTCTG ATCTGCGGTA GTGCTGGCGA CTTGTTGGAT
CATCTGGACA GCCAGGATGT GGTGACTTTC ACGGGGTCAG CGGCGACCGG ACAGATGCTG
CGAGTTCAGC CAAATATCGT CGCCAAATCT ATCCCCTTCA CTATGGAAGC TGATTCCCTG
AACTGCTGCG TACTGGGCGA AGATGTCACC CCGGATCAAC CGGAGTTTGC GCTGTTTATT
CGTGAAGTTG TGCGTGAGAT GACCACAAAA GCCGGGCAAA AATGTACGGC AATCCGGCGG
ATTATTGTGC CGCAGGCATT GGTTAATGCT GTCAGTGATG CTCTGGTTGC GCGATTACAG
AAAGTCGTGG TCGGTGATCC TGCTCAGGAA GGCGTGAAAA TGGGCGCACT GGTAAATGCT
GAGCAGCGTG CCGATGTGCA GGAAAAAGTG AACATATTGC TGGCTGCAGG ATGCGAGATT
CGCCTCGGTG GTCAGGCGGA TTTATCTGCT GCGGGTGCCT TCTTCCCGCC AACCTTATTG
TACTGTCCGC AGCCGGATGA AACACCGGCG GTACATGCAA CAGAAGCCTT TGGCCCTGTC
GCAACGCTGA TGCCAGCACA AAACCAGCGA CATGCTCTGC AACTGGCTTG TGCAGGCGGC
GGTAGCCTTG CGGGAACGCT GGTGACGGCT GATCCGCAAA TTGCGCGTCA GTTTATTGCC
GACGCGGCAC GTACGCATGG GCGAATTCAG ATCCTCAATG AAGAGTCGGC AAAAGAATCC
ACCGGGCATG GCTCCCCACT GCCACAACTG GTACATGGTG GGCCTGGTCG CGCAGGAGGC
GGTGAAGAAT TAGGCGGTTT ACGAGCGGTG AAACATTACA TGCAGCGAAC CGCTGTTCAG
GGTAGTCCGA CGATGCTTGC CGCTATCAGT AAACAGTGGG TGCGCGGTGC GAAAGTCGAA
GAAGATCGTA TTCATCCGTT CCGCAAATAT TTTGAGGAGC TACAACCAGG CGACAGCCTG
TTGACTCCCC GCCGCACAAT GACAGAGGCC GATATTGTTA ACTTTGCTTG CCTCAGCGGC
GATCATTTCT ATGCACATAT GGATAAGATT GCTGCTGCCG AATCTATTTT CGGTGAGCGG
GTGGTGCATG GGTATTTTGT GCTTTCTGCG GCTGCGGGTC TGTTTGTCGA TGCCGGTGTC
GGTCCGGTCA TTGCTAACTA CGGGCTGGAA AGCTTGCGTT TTATCGAACC CGTAAAGCCA
GGCGATACCA TCCAGGTGCG TCTCACCTGT AAGCGCAAGA CGCTGAAAAA ACAGCGTAGC
GCAGAAGAAA AACCAACAGG TGTGGTGGAA TGGGCTGTAG AGGTATTCAA TCAGCATCAA
ACCCCGGTGG CGCTGTATTC AATTCTGACG CTGGTGGCCA GGCAGCACGG TGATTTTGTC
GATTAA
 
Protein sequence
MQQLASFLSG TWQSGRGRSR LIHHAISGEA LWEVTSEGLD MAAARQFAIE KGAPALRAMT 
FIERAAMLKA VAKHLLSEKE RFYALSAQTG ATRADSWVDI EGGIGTLFTY ASLGSRELPD
DTLWPEDELI PLSKEGGFAA RHLLTSKSGV AVHINAFNFP CWGMLEKLAP TWLGGMPAII
KPATATAQLT QAMVKSIVDS GLVPEGAISL ICGSAGDLLD HLDSQDVVTF TGSAATGQML
RVQPNIVAKS IPFTMEADSL NCCVLGEDVT PDQPEFALFI REVVREMTTK AGQKCTAIRR
IIVPQALVNA VSDALVARLQ KVVVGDPAQE GVKMGALVNA EQRADVQEKV NILLAAGCEI
RLGGQADLSA AGAFFPPTLL YCPQPDETPA VHATEAFGPV ATLMPAQNQR HALQLACAGG
GSLAGTLVTA DPQIARQFIA DAARTHGRIQ ILNEESAKES TGHGSPLPQL VHGGPGRAGG
GEELGGLRAV KHYMQRTAVQ GSPTMLAAIS KQWVRGAKVE EDRIHPFRKY FEELQPGDSL
LTPRRTMTEA DIVNFACLSG DHFYAHMDKI AAAESIFGER VVHGYFVLSA AAGLFVDAGV
GPVIANYGLE SLRFIEPVKP GDTIQVRLTC KRKTLKKQRS AEEKPTGVVE WAVEVFNQHQ
TPVALYSILT LVARQHGDFV D