Gene EcDH1_1607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1607 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1752310 
End bp1753533 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content56% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionACX39272 
Protein GI260448850 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0834278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAC TGGTCTACGG CATTAACTAC TCGCCGGAGT TAACCGGCAT CGGCAAATAC 
ACCGGCGAGA TGGTGGAATG GCTGGCGGCA CAAGGTCATG AGGTGCGGGT CATTACCGCA
CCGCCTTACT ACCCGCAATG GCAGGTGGGC GAGAACTATT CCGCCTGGCG CTACAAACGA
GAAGAGGGGG CCGCCACGGT GTGGCGCTGC CCGCTGTATG TGCCAAAACA GCCGAGCACC
CTGAAACGCC TGTTGCATCT GGGCAGTTTT GCCGTCAGCA GTTTCTTTCC GCTGATGGCG
CAACGTCGCT GGAAGCCGGA TCGCATTATT GGCGTGGTGC CAACGCTGTT TTGCGCGCCG
GGAATGCGCC TGCTGGCGAA ACTCTCTGGT GCGCGTACCG TGCTGCATAT TCAGGATTAC
GAAGTGGACG CCATGCTGGG GCTGGGCCTT GCCGGAAAAG GCAAAGGCGG CAAAGTGGCA
CAGCTGGCAA CGGCGTTCGA ACGTAGCGGA CTGCATAACG TCGATAACGT CTCCACGATT
TCGCGTTCGA TGATGAATAA AGCCATCGAA AAAGGCGTGG CGGCGGAAAA CGTCATCTTC
TTCCCCAACT GGTCGGAAAT TGCCCGTTTT CAGCATGTTG CAGATGCCGA TGTTGATGCC
CTTCGTAACC AGCTTGACCT GCCGGATAAC AAAAAAATCA TTCTTTACTC CGGCAATATT
GGTGAAAAGC AGGGGCTGGA AAACGTTATT GAAGCTGCCG ATCGTCTGCG CGATGAACCG
CTGATTTTTG CCATTGTCGG GCAGGGCGGC GGCAAAGCGC GGCTGGAAAA AATGGCGCAG
CAGCGTGGAC TGCGCAACAT GCAATTTTTC CCGCTGCAAT CGTATGACGC TTTACCCGCA
CTGCTGAAGA TGGGCGATTG CCATCTGGTG GTGCAAAAAC GCGGCGCGGC AGATGCCGTA
TTGCCGTCGA AACTGACCAA TATTCTGGCA GTAGGCGGTA ACGCGGTGAT TACTGCTGAA
GCCTACACAG AACTGGGGCA GCTTTGCGAA ACCTTTCCGG GCATTGCGGT TTGCGTTGAA
CCGGAATCGG TCGAGGCGCT GGTGGCGGGG ATCCGTCAGG CGCTCCTGCT GCCCAAACAC
AACACGGTGG CACGTGAATA TGCCGAACGC ACGCTCGATA AAGAGAACGT GTTACGTCAA
TTTATAAATG ATATTCGGGG ATAA
 
Protein sequence
MKILVYGINY SPELTGIGKY TGEMVEWLAA QGHEVRVITA PPYYPQWQVG ENYSAWRYKR 
EEGAATVWRC PLYVPKQPST LKRLLHLGSF AVSSFFPLMA QRRWKPDRII GVVPTLFCAP
GMRLLAKLSG ARTVLHIQDY EVDAMLGLGL AGKGKGGKVA QLATAFERSG LHNVDNVSTI
SRSMMNKAIE KGVAAENVIF FPNWSEIARF QHVADADVDA LRNQLDLPDN KKIILYSGNI
GEKQGLENVI EAADRLRDEP LIFAIVGQGG GKARLEKMAQ QRGLRNMQFF PLQSYDALPA
LLKMGDCHLV VQKRGAADAV LPSKLTNILA VGGNAVITAE AYTELGQLCE TFPGIAVCVE
PESVEALVAG IRQALLLPKH NTVAREYAER TLDKENVLRQ FINDIRG