Gene EcDH1_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0072 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp73946 
End bp75223 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content54% 
IMG OID 
ProductThree-deoxy-D-manno-octulosonic-acid transferase domain protein 
Protein accessionACX37770 
Protein GI260447348 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGAAT TGCTTTACAC CGCCCTTCTC TACCTTATTC AGCCGCTGAT CTGGATACGG 
CTCTGGGTGC GCGGACGTAA GGCTCCGGCC TATCGAAAAC GCTGGGGTGA ACGTTACGGT
TTTTACCGCC ATCCGCTAAA ACCAGGCGGC ATTATGCTGC ACTCCGTCTC CGTCGGTGAA
ACTCTGGCGG CAATCCCGTT GGTGCGCGCG CTGCGTCATC GTTATCCTGA TTTACCGATT
ACCGTAACAA CCATGACGCC AACCGGTTCG GAGCGCGTAC AATCGGCTTT CGGGAAGGAT
GTTCAGCACG TTTATCTGCC GTATGATCTG CCCGATGCAC TCAACCGTTT CCTGAATAAA
GTCGACCCTA AACTGGTGTT GATTATGGAA ACCGAACTAT GGCCTAACCT GATTGCGGCG
CTACATAAAC GTAAAATTCC GCTGGTGATC GCTAACGCGC GACTCTCTGC CCGCTCGGCC
GCAGGTTATG CCAAACTGGG TAAATTCGTC CGTCGCTTGC TGCGTCGTAT TACGCTGATT
GCTGCGCAAA ATGAAGAAGA TGGTGCACGT TTTGTGGCGC TGGGCGCAAA AAATAATCAG
GTGACCGTTA CCGGTAGCCT GAAATTCGAT ATTTCTGTAA CGCCGCAGTT GGCTGCTAAA
GCCGTGACGC TGCGCCGCCA GTGGGCACCA CACCGCCCGG TATGGATTGC CACCAGCACT
CACGAAGGCG AAGAGAGTGT GGTGATCGCC GCACATCAGG CATTGTTACA GCAATTCCCG
AATTTATTGC TCATCCTGGT ACCCCGTCAT CCGGAACGCT TCCCGGATGC GATTAACCTT
GTCCGCCAGG CTGGACTAAG CTATATCACA CGCTCTTCAG GGGAAGTCCC CTCCACCAGC
ACGCAGGTTG TGGTTGGCGA TACGATGGGC GAGTTGATGT TACTGTATGG CATTGCCGAT
CTCGCCTTTG TTGGCGGTTC ACTGGTTGAA CGTGGTGGGC ATAATCCGCT GGAAGCTGCC
GCACACGCTA TTCCGGTATT GATGGGGCCG CATACTTTTA ACTTTAAAGA CATTTGCGCG
CGGCTGGAGC AGGCAAGCGG GCTGATTACC GTTACCGATG CCACTACGCT TGCAAAAGAG
GTTTCCTCTT TACTCACCGA CGCCGATTAC CGTAGTTTCT ATGGCCGTCA TGCCGTTGAA
GTACTGTATC AAAACCAGGG CGCGCTACAG CGTCTGCTTC AACTGCTGGA ACCTTACCTG
CCACCGAAAA CGCATTGA
 
Protein sequence
MLELLYTALL YLIQPLIWIR LWVRGRKAPA YRKRWGERYG FYRHPLKPGG IMLHSVSVGE 
TLAAIPLVRA LRHRYPDLPI TVTTMTPTGS ERVQSAFGKD VQHVYLPYDL PDALNRFLNK
VDPKLVLIME TELWPNLIAA LHKRKIPLVI ANARLSARSA AGYAKLGKFV RRLLRRITLI
AAQNEEDGAR FVALGAKNNQ VTVTGSLKFD ISVTPQLAAK AVTLRRQWAP HRPVWIATST
HEGEESVVIA AHQALLQQFP NLLLILVPRH PERFPDAINL VRQAGLSYIT RSSGEVPSTS
TQVVVGDTMG ELMLLYGIAD LAFVGGSLVE RGGHNPLEAA AHAIPVLMGP HTFNFKDICA
RLEQASGLIT VTDATTLAKE VSSLLTDADY RSFYGRHAVE VLYQNQGALQ RLLQLLEPYL
PPKTH