Gene EcDH1_3261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3261 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3503436 
End bp3504527 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content56% 
IMG OID 
Producttranscriptional regulator, LacI family 
Protein accessionACX40885 
Protein GI260450463 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000862656 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGAATG TGAAACCAGT AACGTTATAC GATGTCGCAG AGTATGCCGG TGTCTCTTAT 
CAGACCGTTT CCCGCGTGGT GAACCAGGCC AGCCACGTTT CTGCGAAAAC GCGGGAAAAA
GTGGAAGCGG CGATGGCGGA GCTGAATTAC ATTCCCAACC GCGTGGCACA ACAACTGGCG
GGCAAACAGT CGTTGCTGAT TGGCGTTGCC ACCTCCAGTC TGGCCCTGCA CGCGCCGTCG
CAAATTGTCG CGGCGATTAA ATCTCGCGCC GATCAACTGG GTGCCAGCGT GGTGGTGTCG
ATGGTAGAAC GAAGCGGCGT CGAAGCCTGT AAAGCGGCGG TGCACAATCT TCTCGCGCAA
CGCGTCAGTG GGCTGATCAT TAACTATCCG CTGGATGACC AGGATGCCAT TGCTGTGGAA
GCTGCCTGCA CTAATGTTCC GGCGTTATTT CTTGATGTCT CTGACCAGAC ACCCATCAAC
AGTATTATTT TCTCCCATGA AGACGGTACG CGACTGGGCG TGGAGCATCT GGTCGCATTG
GGTCACCAGC AAATCGCGCT GTTAGCGGGC CCATTAAGTT CTGTCTCGGC GCGTCTGCGT
CTGGCTGGCT GGCATAAATA TCTCACTCGC AATCAAATTC AGCCGATAGC GGAACGGGAA
GGCGACTGGA GTGCCATGTC CGGTTTTCAA CAAACCATGC AAATGCTGAA TGAGGGCATC
GTTCCCACTG CGATGCTGGT TGCCAACGAT CAGATGGCGC TGGGCGCAAT GCGCGCCATT
ACCGAGTCCG GGCTGCGCGT TGGTGCGGAT ATCTCGGTAG TGGGATACGA CGATACCGAA
GACAGCTCAT GTTATATCCC GCCGTTAACC ACCATCAAAC AGGATTTTCG CCTGCTGGGG
CAAACCAGCG TGGACCGCTT GCTGCAACTC TCTCAGGGCC AGGCGGTGAA GGGCAATCAG
CTGTTGCCCG TCTCACTGGT GAAAAGAAAA ACCACCCTGG CGCCCAATAC GCAAACCGCC
TCTCCCCGCG CGTTGGCCGA TTCATTAATG CAGCTGGCAC GACAGGTTTC CCGACTGGAA
AGCGGGCAGT GA
 
Protein sequence
MVNVKPVTLY DVAEYAGVSY QTVSRVVNQA SHVSAKTREK VEAAMAELNY IPNRVAQQLA 
GKQSLLIGVA TSSLALHAPS QIVAAIKSRA DQLGASVVVS MVERSGVEAC KAAVHNLLAQ
RVSGLIINYP LDDQDAIAVE AACTNVPALF LDVSDQTPIN SIIFSHEDGT RLGVEHLVAL
GHQQIALLAG PLSSVSARLR LAGWHKYLTR NQIQPIAERE GDWSAMSGFQ QTMQMLNEGI
VPTAMLVAND QMALGAMRAI TESGLRVGAD ISVVGYDDTE DSSCYIPPLT TIKQDFRLLG
QTSVDRLLQL SQGQAVKGNQ LLPVSLVKRK TTLAPNTQTA SPRALADSLM QLARQVSRLE
SGQ