Gene EcDH1_1612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1612 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1759652 
End bp1760932 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content54% 
IMG OID 
Productpolysaccharide pyruvyl transferase 
Protein accessionACX39277 
Protein GI260448855 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00148194 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAC TTATTCTGGG CAACCACACT TGCGGCAATC GTGGCGACAG CGCCATCCTG 
CGCGGCTTAC TTGATGCCAT CAACATTCTC AATCCACACG CCGAAGTGGA TGTGATGAGC
CGCTATCCGG TCAGTTCTTC CTGGCTGCTC AACCGCCCGG TAATGGGCGA TCCGCTGTTC
CTGCAAATGA AACAACACAA CAGCGCGGCG GGCGTTGTCG GGCGCGTTAA AAAAGTCCTC
CGTCGCCGCT ACCAGCATCA GGTATTGCTC TCACGCGTCA CCGACACTGG CAAGCTGCGC
AATATCGCCA TCGCCCAGGG ATTCACCGAC TTCGTGCGCC TGCTGTCAGG TTACGACGCC
ATTATCCAGG TCGGCGGATC GTTTTTTGTC GATCTCTATG GCGTGCCGCA GTTTGAACAT
GCACTTTGCA CGTTTATGGC GAAAAAGCCG CTGTTTATGA TTGGTCACAG TGTTGGCCCG
TTCCAGGATG AGCAATTTAA CCAACTGGCG AACTACGTTT TTGGTCACTG CGACGCGCTG
ATCCTGCGCG AATCGGTCAG CTTTGATCTG ATGAAACGCA GCAATATCAC CACCGCAAAA
GTGGAACATG GCGTCGATAC CGCGTGGCTG GTCGATCACC ACACAGAAGA CTTCACCGCC
AGCTATGCCG TTCAACACTG GCTGGACGTT GCCGCACAAC AGAAAACGGT GGCCATTACC
CTGCGCGAAC TGGCACCGTT TGACAAACGT CTCGGCACCA CTCAACAAGC GTATGAAAAA
GCCTTTGCCG GGGTGGTCAA TCGCATTCTC GATGAAGGGT ATCAGGTGAT TGCGCTCTCC
ACCTGTACGG GCATTGACAG CTATAACAAA GACGACCGCA TGGTGGCGCT CAACCTGCGC
CAGCACATCA GCGATCCTGC CCGTTACCAC GTAGTGATGG ATGAACTCAA CGATCTGGAA
ATGGGCAAAA TTCTGGGGGC CTGTGAACTC ACCGTCGGTA CGCGCCTGCA CTCTGCCATT
ATCTCGATGA ATTTTGCCAC TCCGGCAATT GCCATCAACT ACGAACATAA ATCCGCCGGG
ATTATGCAGC AGCTGGGACT ACCGGAGATG GCAATTGATA TCCGTCATTT ATTAGACGGC
AGCCTGCAAG CGATGGTTGC GGATACCTTA GGCCAGCTTC CGGCGCTGAA TGCGCGACTT
AGTGAAGCCG TCAGTCGTGA GCGTCAGACA GGAATGCAGA TGGTGCAGTC TGTGCTTGAG
CGCATCGGGG AGGTGAAATG A
 
Protein sequence
MKLLILGNHT CGNRGDSAIL RGLLDAINIL NPHAEVDVMS RYPVSSSWLL NRPVMGDPLF 
LQMKQHNSAA GVVGRVKKVL RRRYQHQVLL SRVTDTGKLR NIAIAQGFTD FVRLLSGYDA
IIQVGGSFFV DLYGVPQFEH ALCTFMAKKP LFMIGHSVGP FQDEQFNQLA NYVFGHCDAL
ILRESVSFDL MKRSNITTAK VEHGVDTAWL VDHHTEDFTA SYAVQHWLDV AAQQKTVAIT
LRELAPFDKR LGTTQQAYEK AFAGVVNRIL DEGYQVIALS TCTGIDSYNK DDRMVALNLR
QHISDPARYH VVMDELNDLE MGKILGACEL TVGTRLHSAI ISMNFATPAI AINYEHKSAG
IMQQLGLPEM AIDIRHLLDG SLQAMVADTL GQLPALNARL SEAVSRERQT GMQMVQSVLE
RIGEVK