Gene EcDH1_0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0019 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp17489 
End bp19150 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content53% 
IMG OID 
ProductYidE/YbjL duplication 
Protein accessionACX37719 
Protein GI260447297 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATA TAGCATTAAC GGTCAGTATT CTGGCTTTGG TGGCAGTCGT CGGTTTGTTT 
ATCGGCAACG TCAAATTTCG CGGCATAGGA TTAGGTATTG GCGGCGTGCT GTTTGGTGGG
ATCATCGTCG GCCATTTTGT TTCTCAGGCG GGGATGACAT TAAGTAGCGA TATGCTGCAT
GTTATTCAGG AATTTGGCCT GATCCTGTTC GTTTATACTA TCGGGATTCA GGTAGGGCCG
GGCTTCTTTG CCTCATTGCG CGTCTCCGGA TTACGCCTCA ACCTGTTTGC TGTTCTGATC
GTCATCATCG GTGGTCTGGT TACCGCCATC CTGCATAAAC TGTTTGATAT TCCACTGCCG
GTAGTGCTGG GGATTTTCTC CGGTGCGGTT ACCAATACGC CAGCGCTGGG GGCAGGGCAG
CAGATTTTGC GCGACCTGGG TACACCAATG GAAATGGTCG ATCAGATGGG GATGAGTTAC
GCGATGGCGT ATCCATTCGG CATTTGCGGG ATTTTGTTCA CCATGTGGAT GTTGCGGGTT
ATTTTCCGCG TCAATGTCGA GACAGAAGCT CAGCAGCACG AGTCTTCACG CACCAATGGC
GGCGCGCTGA TCAAGACTAT CAATATTCGC GTTGAGAACC CTAACCTGCA TGATTTAGCC
ATTAAAGATG TACCGATTCT CAACGGCGAC AAAATTATCT GCTCGCGTCT GAAACGCGAA
GAAACCCTAA AAGTTCCTTC GCCAGATACC ATTATCCAAC TGGGCGATTT GCTGCATCTG
GTGGGTCAGC CAGCGGATTT ACATAATGCG CAACTGGTGA TTGGTCAGGA GGTCGATACT
TCGCTGTCCA CGAAAGGCAC TGATTTGCGC GTCGAGCGTG TGGTGGTCAC CAATGAAAAC
GTGCTCGGAA AACGTATTCG CGACCTGCAC TTTAAAGAAC GCTATGACGT TGTTATCTCG
CGCCTGAACC GTGCCGGGGT CGAACTGGTC GCCAGTGGCG ATATCAGCCT GCAGTTCGGC
GATATCCTCA ATCTGGTGGG GCGTCCGTCC GCAATTGATG CCGTTGCCAA TGTGCTGGGG
AATGCGCAGC AAAAACTGCA ACAGGTTCAG ATGCTGCCAG TGTTTATTGG CATCGGGCTA
GGCGTATTGT TAGGTTCTAT TCCCGTCTTT GTGCCAGGAT TCCCGGCCGC GTTGAAACTG
GGGCTGGCGG GCGGACCGCT GATTATGGCG TTGATCCTCG GGCGTATCGG CAGTATCGGC
AAGCTGTACT GGTTTATGCC GCCAAGCGCC AACCTCGCGC TGCGGGAGCT GGGGATCGTG
CTGTTCCTCT CGGTCGTTGG TCTGAAATCT GGTGGGGATT TTGTGAATAC CCTGGTCAAT
GGCGAAGGGC TAAGCTGGAT TGGTTATGGT GCCCTGATCA CCGCCGTTCC GCTGATTACT
GTTGGCATTC TGGCGCGGAT GTTAGCCAAA ATGAATTACC TGACCATGTG CGGGATGCTG
GCAGGTTCCA TGACCGATCC TCCGGCGCTG GCGTTTGCTA ATAATCTTCA TCCAACCAGC
GGTGCGGCGG CGCTCTCTTA CGCCACTGTC TATCCGTTGG TAATGTTCCT GCGCATTATC
ACCCCCCAAT TACTGGCGGT GCTCTTCTGG AGTATCGGTT AA
 
Protein sequence
MSDIALTVSI LALVAVVGLF IGNVKFRGIG LGIGGVLFGG IIVGHFVSQA GMTLSSDMLH 
VIQEFGLILF VYTIGIQVGP GFFASLRVSG LRLNLFAVLI VIIGGLVTAI LHKLFDIPLP
VVLGIFSGAV TNTPALGAGQ QILRDLGTPM EMVDQMGMSY AMAYPFGICG ILFTMWMLRV
IFRVNVETEA QQHESSRTNG GALIKTINIR VENPNLHDLA IKDVPILNGD KIICSRLKRE
ETLKVPSPDT IIQLGDLLHL VGQPADLHNA QLVIGQEVDT SLSTKGTDLR VERVVVTNEN
VLGKRIRDLH FKERYDVVIS RLNRAGVELV ASGDISLQFG DILNLVGRPS AIDAVANVLG
NAQQKLQQVQ MLPVFIGIGL GVLLGSIPVF VPGFPAALKL GLAGGPLIMA LILGRIGSIG
KLYWFMPPSA NLALRELGIV LFLSVVGLKS GGDFVNTLVN GEGLSWIGYG ALITAVPLIT
VGILARMLAK MNYLTMCGML AGSMTDPPAL AFANNLHPTS GAAALSYATV YPLVMFLRII
TPQLLAVLFW SIG