Gene EcDH1_1062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1062 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1133702 
End bp1134943 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content52% 
IMG OID 
Productprotein of unknown function DUF21 
Protein accessionACX38736 
Protein GI260448314 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value7.90091e-08 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGTCA TTTCAGCCTA TTTTTCCGGG TCCGAAACCG GAATGATGAC CCTCAACCGC 
TATCGTCTGC GACATATGGC GAAACAGGGT AATCGCTCGG CCAAACGCGT CGAAAAATTG
CTGCGTAAGC CAGACCGCCT GATAAGCCTG GTGTTAATCG GCAATAACCT GGTCAATATT
CTTGCCTCCG CGCTCGGCAC TATTGTTGGG ATGCGTTTGT ACGGCGATGC GGGCGTGGCA
ATTGCGACTG GTGTGCTGAC TTTTGTCGTA CTGGTATTTG CTGAGGTATT GCCGAAAACC
ATTGCCGCGC TGTACCCGGA AAAAGTCGCT TATCCGAGTA GTTTTCTGCT GGCTCCGCTG
CAAATTTTGA TGATGCCGCT GGTCTGGTTG CTGAATGCTA TCACCCGTAT GCTGATGCGC
ATGATGGGTA TCAAAACCGA TATCGTGGTT AGCGGCTCTT TGAGCAAAGA AGAGTTGCGC
ACTATCGTGC ACGAATCACG CTCACAAATT TCCCGTCGCA ATCAGGATAT GCTGCTGTCG
GTGCTCGATC TGGAAAAAAT GACCGTTGAT GACATCATGG TGCCGCGCAG TGAAATTATC
GGTATTGATA TCAACGATGA CTGGAAATCG ATTCTGCGCC AACTCTCCCA CTCACCTCAC
GGGCGCATCG TGCTCTACCG TGATTCGCTG GACGACGCCA TCAGTATGCT GCGAGTACGT
GAAGCCTGGC GGTTGATGTC GGAGAAAAAA GAGTTCACCA AAGAAACCAT GCTGCGCGCC
GCGGACGAGA TCTATTTTGT GCCGGAAGGT ACGCCGCTCA GCACGCAGTT GGTAAAGTTT
CAGCGCAACA AAAAGAAAGT CGGCCTGGTC GTCAACGAGT ATGGAGACAT TCAGGGGCTG
GTGACGGTTG AAGATATTCT GGAAGAGATT GTCGGCGATT TCACTACGTC GATGTCGCCA
ACACTTGCCG AAGAGGTCAC GCCGCAAAAC GACGGTTCGG TGATTATCGA TGGCACCGCC
AACGTGCGGG AAATCAACAA AGCCTTTAAC TGGCATCTAC CGGAAGATGA TGCCCGCACG
GTTAATGGCG TCATTCTTGA GGCACTGGAA GAGATCCCTG TCGCAGGCAC CCGCGTGCGT
ATTGGCGAGT ACGATATCGA TATTCTCGAC GTACAGGACA ATATGATTAA GCAGGTAAAA
GTTTTTCCTG TGAAACCGCT GCGCGAGAGT GTGGCGGAGT AA
 
Protein sequence
MVVISAYFSG SETGMMTLNR YRLRHMAKQG NRSAKRVEKL LRKPDRLISL VLIGNNLVNI 
LASALGTIVG MRLYGDAGVA IATGVLTFVV LVFAEVLPKT IAALYPEKVA YPSSFLLAPL
QILMMPLVWL LNAITRMLMR MMGIKTDIVV SGSLSKEELR TIVHESRSQI SRRNQDMLLS
VLDLEKMTVD DIMVPRSEII GIDINDDWKS ILRQLSHSPH GRIVLYRDSL DDAISMLRVR
EAWRLMSEKK EFTKETMLRA ADEIYFVPEG TPLSTQLVKF QRNKKKVGLV VNEYGDIQGL
VTVEDILEEI VGDFTTSMSP TLAEEVTPQN DGSVIIDGTA NVREINKAFN WHLPEDDART
VNGVILEALE EIPVAGTRVR IGEYDIDILD VQDNMIKQVK VFPVKPLRES VAE