Gene EcDH1_4177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4177 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4526588 
End bp4527784 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content55% 
IMG OID 
ProductHemY protein 
Protein accessionACX41777 
Protein GI260451355 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.849055 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAAG TGTTATTGCT CTTTGTGTTG CTGATTGCGG GGATCGTGGT TGGCCCGATG 
ATTGCCGGCC ATCAGGGTTA TGTGCTGATC CAGACCGACA ACTACAATAT CGAAACCAGC
GTCACGGGCC TGGCGATCAT ATTGATTCTG GCGATGGTAG TGCTGTTTGC CATTGAGTGG
CTACTGCGGC GGATCTTCCG CACTGGCGCG CACACCCGTG GGTGGTTTGT CGGACGTAAG
CGTCGCCGTG CACGTAAGCA GACCGAACAG GCGCTGCTGA AACTGGCGGA AGGCGATTAT
CAGCAAGTTG AAAAGCTGAT GGCGAAAAAT GCCGATCACG CGGAACAACC GGTGGTGAAC
TATCTACTGG CTGCCGAAGC CGCGCAACAA CGTGGTGATG AAGCACGCGC CAACCAACAT
CTGGAACGCG CAGCGGAGCT GGCCGGCAAC GACACCATTC CGGTAGAAAT CACCCGCGTA
CGTCTGCAAC TGGCCCGTAA TGAAAACCAT GCTGCACGCC ACGGCGTGGA TAAGCTGCTG
GAAGTTACGC CACGCCATCC GGAAGTATTA CGTCTGGCGG AACAGGCGTA TATCCGCACA
GGTGCATGGA GTTCGCTGCT GGATATTATC CCATCAATGG CGAAAGCCCA TGTTGGTGAT
GAAGAACATC GTGCAATGCT GGAACAACAG GCATGGATTG GCCTGATGGA TCAGGCGCGT
GCCGATAACG GTAGCGAAGG TTTGCGTAAC TGGTGGAAAA ACCAAAGCCG GAAAACGCGT
CATCAGGTAG CGTTGCAGGT GGCAATGGCG GAACATCTTA TTGAATGTGA CGATCATGAT
ACTGCCCAGC AAATTATCAT CGATGGCCTG AAACGCCAGT ACGACGATCG CCTACTGCTG
CCGATTCCTC GACTGAAAAC AAACAATCCG GAACAGCTGG AAAAAGTGCT GCGCCAGCAA
ATCAAAAACG TCGGCGATCG CCCGCTGTTG TGGAGCACAC TGGGCCAGTC ACTGATGAAG
CACGGAGAAT GGCAGGAAGC ATCGCTCGCC TTCCGCGCAG CGCTGAAACA ACGTCCGGAC
GCCTACGATT ACGCATGGCT TGCCGACGCG CTGGACAGAC TGCACAAGCC GGAAGAAGCT
GCAGCTATGC GTCGCGACGG TTTGATGTTA ACGTTGCAGA ATAACCCGCC ACAGTAG
 
Protein sequence
MLKVLLLFVL LIAGIVVGPM IAGHQGYVLI QTDNYNIETS VTGLAIILIL AMVVLFAIEW 
LLRRIFRTGA HTRGWFVGRK RRRARKQTEQ ALLKLAEGDY QQVEKLMAKN ADHAEQPVVN
YLLAAEAAQQ RGDEARANQH LERAAELAGN DTIPVEITRV RLQLARNENH AARHGVDKLL
EVTPRHPEVL RLAEQAYIRT GAWSSLLDII PSMAKAHVGD EEHRAMLEQQ AWIGLMDQAR
ADNGSEGLRN WWKNQSRKTR HQVALQVAMA EHLIECDDHD TAQQIIIDGL KRQYDDRLLL
PIPRLKTNNP EQLEKVLRQQ IKNVGDRPLL WSTLGQSLMK HGEWQEASLA FRAALKQRPD
AYDYAWLADA LDRLHKPEEA AAMRRDGLML TLQNNPPQ