Gene EcDH1_3740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3740 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4032879 
End bp4034075 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content37% 
IMG OID 
Productprotein of unknown function DUF898 transmembrane 
Protein accessionACX41345 
Protein GI260450923 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAAG TTATTAATGA AATGGATGTT CCGTCCCATT CGTTTGTTTT TCATGGTACA 
GGTGAGAGAT ATTTTCTTAT TTGTGTGGTG AATGTGTTGT TAACGATTAT AACGCTAGGT
ATCTATTTAC CATGGGCATT AATGAAATGT AAGCGTTATC TTTATGCTAA TATGGAAGTT
AACGGACAAC GATTTTCTTA TGGAATTACC GGTGGGAATG TTTTTGTTAG TTGTCTTTTT
TTTGTTTTTT TCTATTTCGC AATCTTAATG ACAGTGTCAG CAGATATGCC GCTTGTTGGT
TGTGTTTTGA CTTTGTTACT GTTGGTTTTG CTTATATTTA TGGCAGCAAA AGGACTGCGT
CATCAGGCCT TAATGACCAG TCTCAACGGC GTAAGATTTA GTTTTAATTG CTCTATGAAA
GGGTTCTGGT GGGTGACCTT TTTCTTGCCG ATTTTAATGG CCATTGGGAT GGGGACTGTT
TTCTTTATCT CGACAAAGAT GCTACCTGCC AATAGTTCAA GTAGTGTTAT TATATCCATG
GTTCTGATGG CAATAGTTGG TATTGTTTCC ATTGGTATTT TTAATGGTAC TTTATATAGT
CTGGTAATGA GTTTTCTCTG GAGTAATACC AGTTTCGGTA TACATCGTTT CAAGGTGAAA
TTAGATACTA CGTATTGTAT AAAATATGCC ATTCTCGCAT TTTTAGCTTT ATTGCCTTTT
CTCGCTGTTG CTGGTTATAT TATCTTCGAT CAAATATTAA ATGCGTATGA TAGTTCTGTA
TATGCAAATG ATGACATTGA GAATTTACAG CAATTTATGG AAATGCAACG TAAAATGATA
ATCGCGCAGT TAATCTATTA TTTTGGGATT GCTGTTAGCA CAAGTTATTT AACGGTGTCT
TTGCGAAACC ATTTTATGAG CAACCTGTCA CTGAATGATG GGCGTATTCG TTTTCGCTTA
ACTTTAACGT ACCACGGTAT GCTTTATCGC ATGTGTGCGT TGGTGGTGAT ATCCGGGATT
ACGGGCGGTC TGGCTTATCC ACTGCTGAAA ATATGGATGA TTGACTGGCA GGCAAAAAAT
ACGTATTTGC TGGGCGATTT GGATGACCTT CCTTTAATCA ATAAAGAAGA ACAACCAGAT
AAAGGCTTCT TAGCCAGTAT TTCACGGGGA GTTATGCCTT CTTTACCATT TCTGTAA
 
Protein sequence
MAQVINEMDV PSHSFVFHGT GERYFLICVV NVLLTIITLG IYLPWALMKC KRYLYANMEV 
NGQRFSYGIT GGNVFVSCLF FVFFYFAILM TVSADMPLVG CVLTLLLLVL LIFMAAKGLR
HQALMTSLNG VRFSFNCSMK GFWWVTFFLP ILMAIGMGTV FFISTKMLPA NSSSSVIISM
VLMAIVGIVS IGIFNGTLYS LVMSFLWSNT SFGIHRFKVK LDTTYCIKYA ILAFLALLPF
LAVAGYIIFD QILNAYDSSV YANDDIENLQ QFMEMQRKMI IAQLIYYFGI AVSTSYLTVS
LRNHFMSNLS LNDGRIRFRL TLTYHGMLYR MCALVVISGI TGGLAYPLLK IWMIDWQAKN
TYLLGDLDDL PLINKEEQPD KGFLASISRG VMPSLPFL