Gene EcDH1_3173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3173 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3414524 
End bp3415822 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content51% 
IMG OID 
Producttrigger factor 
Protein accessionACX40799 
Protein GI260450377 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000000698649 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGTTT CAGTTGAAAC CACTCAAGGC CTTGGCCGCC GTGTAACGAT TACTATCGCT 
GCTGACAGCA TCGAGACCGC TGTTAAAAGC GAGCTGGTCA ACGTTGCGAA AAAAGTACGT
ATTGACGGCT TCCGCAAAGG CAAAGTGCCA ATGAATATCG TTGCTCAGCG TTATGGCGCG
TCTGTACGCC AGGACGTTCT GGGTGACCTG ATGAGCCGTA ACTTCATTGA CGCCATCATT
AAAGAAAAAA TCAATCCGGC TGGCGCACCG ACTTATGTTC CGGGCGAATA CAAGCTGGGT
GAAGACTTCA CTTACTCTGT AGAGTTTGAA GTTTATCCGG AAGTTGAACT GCAGGGTCTG
GAAGCGATCG AAGTTGAAAA ACCGATCGTT GAAGTGACCG ACGCTGACGT TGACGGCATG
CTGGATACTC TGCGTAAACA GCAGGCGACC TGGAAAGAAA AAGACGGCGC TGTTGAAGCA
GAAGACCGCG TAACCATCGA CTTCACCGGT TCTGTAGACG GCGAAGAGTT CGAAGGCGGT
AAAGCGTCTG ATTTCGTACT GGCGATGGGC CAGGGTCGTA TGATCCCGGG CTTTGAAGAC
GGTATCAAAG GCCACAAAGC TGGCGAAGAG TTCACCATCG ACGTGACCTT CCCGGAAGAA
TACCACGCAG AAAACCTGAA AGGTAAAGCA GCGAAATTCG CTATCAACCT GAAGAAAGTT
GAAGAGCGTG AACTGCCGGA ACTGACTGCA GAATTCATCA AACGTTTCGG CGTTGAAGAT
GGTTCCGTAG AAGGTCTGCG CGCTGAAGTG CGTAAAAACA TGGAGCGCGA GCTGAAGAGC
GCCATCCGTA ACCGCGTTAA GTCTCAGGCG ATCGAAGGTC TGGTAAAAGC TAACGACATC
GACGTACCGG CTGCGCTGAT CGACAGCGAA ATCGACGTTC TGCGTCGCCA GGCTGCACAG
CGTTTCGGTG GCAACGAAAA ACAAGCTCTG GAACTGCCGC GCGAACTGTT CGAAGAACAG
GCTAAACGCC GCGTAGTTGT TGGCCTGCTG CTGGGCGAAG TTATCCGCAC CAACGAGCTG
AAAGCTGACG AAGAGCGCGT GAAAGGCCTG ATCGAAGAGA TGGCTTCTGC GTACGAAGAT
CCGAAAGAAG TTATCGAGTT CTACAGCAAA AACAAAGAAC TGATGGACAA CATGCGCAAT
GTTGCTCTGG AAGAACAGGC TGTTGAAGCT GTACTGGCGA AAGCGAAAGT GACTGAAAAA
GAAACCACTT TCAACGAGCT GATGAACCAG CAGGCGTAA
 
Protein sequence
MQVSVETTQG LGRRVTITIA ADSIETAVKS ELVNVAKKVR IDGFRKGKVP MNIVAQRYGA 
SVRQDVLGDL MSRNFIDAII KEKINPAGAP TYVPGEYKLG EDFTYSVEFE VYPEVELQGL
EAIEVEKPIV EVTDADVDGM LDTLRKQQAT WKEKDGAVEA EDRVTIDFTG SVDGEEFEGG
KASDFVLAMG QGRMIPGFED GIKGHKAGEE FTIDVTFPEE YHAENLKGKA AKFAINLKKV
EERELPELTA EFIKRFGVED GSVEGLRAEV RKNMERELKS AIRNRVKSQA IEGLVKANDI
DVPAALIDSE IDVLRRQAAQ RFGGNEKQAL ELPRELFEEQ AKRRVVVGLL LGEVIRTNEL
KADEERVKGL IEEMASAYED PKEVIEFYSK NKELMDNMRN VALEEQAVEA VLAKAKVTEK
ETTFNELMNQ QA