Gene EcDH1_2274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2274 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2446067 
End bp2449429 
Gene Length3363 bp 
Protein Length1120 aa 
Translation table11 
GC content53% 
IMG OID 
ProductProphage tail fibre domain protein 
Protein accessionACX39921 
Protein GI260449499 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.817873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTAA AGATTTCAGG TGTACTGAAA GACGGCACAG GAAAACCGGT ACAGAACTGC 
ACAATCCAGC TGAAAGCAAA ACGTAACAGC ACCACGGTGG TGGTGAACAC GCTGGCCTCA
GAAAATCCGG ATGAAGCCGG GCGTTACAGC ATGGACGTTG AGTACGGTCA GTACAGCGTT
ATTCTGTTGG TGGAAGGATT CCCGCCGTCA CATGCCGGGA CCATTACCGT GTATGAAGAT
TCTCAACCCG GTACGCTGAA TGATTTTCTC GGTGCCATGA CGGAGGATGA TGCCCGTCCG
GAGGCACTGC GCCGTTTTGA ACTGATGGTG GAAGAGGTGG CGCGTAACGC GTCCGCGGTG
GCACAGAACA CGGCAGCCGC GAAGAAGTCA GCCAGTGATG CCAGCACATC AGCCCGTGAG
GCGGCAACCC ATGCGGCTGA TGCTGCGGAC TCAGCACGCG CAGCCAGCAC GTCAGCCGGA
CAGGCCGCGT CGTCGGCTCA GTCAGCGTCT TCCAGCGCAG GAACGGCATC AACAAAGGCC
ACTGAAGCAT CAAAAAGTGC TGCCGCTGCA GAGTCCTCAA AAAGCGCGGC GGCCACCAGT
GCCGGTGCGG CGAAAACGTC AGAAACGAAT GCTTCAGCGT CACTACAATC AGCAGCCACA
TCTGCATCCA CCGCGACCAC GAAGGCATCA GAAGCTGCGA CCTCGGCCCG GGATGCGGCG
GCCTCAAAAG AAGCGGCAAA ATCATCAGAA ACGAACGCAT CATCAAGCGC CAGTAGTGCA
GCTTCCTCGG CAACGGCGGC AGGAAATTCC GCGAAGGCGG CAAAAACGTC CGAGACGAAC
GCCAGGTCTT CTGAAACGGC AGCGGGACAG AGCGCCTCGG CTGCGGCAGG CTCAAAAACA
GCGGCTGCGT CGTCTGCCAG TGCAGCGTCA ACAAGTGCCG GGCAGGCCTC AGCCAGTGCC
ACCGCCGCCG GAAAATCGGC AGAAAGCGCC GCATCGTCTG CTTCAACAGC CACAACGAAG
GCTGGCGAAG CCACTGAACA GGCCAGCGCA GCAGCGAGGT CTGCTTCCGC AGCGAAGACA
TCCGAAACGA ACGCGAAAGC GTCGGAAACA AGCGCAGAAT CCTCAAAAAC GGCTGCCGCA
TCGTCAGCCA GTTCGGCGGC GTCATCGGCA TCATCGGCGT CTGCTTCAAA AGATGAGGCG
ACCAGACAAG CGTCAGCAGC GAAGAGCAGC GCCACGACGG CATCCACGAA GGCGACAGAG
GCTGCTGGCA GTGCGACGGC GGCAGCTCAG AGCAAAAGTA CGGCGGAATC CGCGGCAACG
CGCGCCGAGA CAGCAGCTAA ACGGGCAGAG GATATTGCAT CCGCCGTGGC GCTTGAGGAT
GCAAGTACGA CGAAAAAGGG GATAGTACAG CTCAGCAGTG CGACCAACAG TACGTCTGAA
ACGCTGGCGG CAACGCCAAA GGCAGTAAAA TCAGCCTATG ACAATGCAGA GAAACGTCTG
CAGAAAGACC AGAACGGCGC TGATATACCC GATAAGGGAT GCTTCCTGAA CAACATTAAC
GCGGTCAGTA AAACAGACTT TGCTGATAAG CGTGGTATGC GTTATGTGCG GGTTAACGCT
CCTGCAGGTG CAACATCTGG AAAATATTAC CCTGTTGTTG TTATGCGTTC TGCTGGCTCA
GTAAGCGAAC TGGCATCAAG AGTCATTATC ACCACGGCAA CGCGAACCGC AGGCGATCCG
ATGAATAACT GCGAGTTTAA CGGATTTGTT ATGCCTGGTG GCTGGACTGA CAGGGGGCGT
TATGCTTATG GCATGTTCTG GCAATATCAA AACAATGAAC GAGCCATTCA CTCAATAATG
ATGAGTAATA AGGGCGATGA TTTGCGCTCT GTGTTCTATG TTGATGGCGC TGCTTTCCCT
GTTTTTGCGT TTATTGAAGA TGGCCTGTCA ATATCCGCAC CTGGTGCTGA TCTCGTTGTT
AATGATACGA CCTATAAGTT TGGGGCAACA AATCCGGCGA CTGAATGTAT CGCGGCGGAC
GTTATCCTTG ATTTTAAGAG TGGGCGTGGT TTTTATGAGT CTCATTCGTT AATCGTTAAC
GATAACTTGT CGTGCAAAAA ACTTTTTGCC ACAGACGAAA TTGTAGCGCG TGGTGGTAAT
CAGATTCGAA TGATAGGTGG GGAGTATGGT GCATTATGGC GTAATGATGG CGCTAAAACT
TACCTGCTGC TTACCAATCA AGGTGATGTT TATGGTGGCT GGAATACATT AAGACCGTTT
GCTATTGATA ACGCAACCGG CGAACTGGTT ATTGGAACCA AACTGTCCGC AAGTCTGAAC
GGTAATGCAT TAACAGCAAC AAAGCTGCAA ACGCCAAGAC GGGTTTCTGG TGTTGAGTTT
GATGGTTCCA AAGATATTAC TTTAACCGCC GCGCATGTGG CTGCTTTTGC CAGAAGGGCA
ACGGATACAT ATGCCGATGC GGATGGTGGC GTTCCATGGA ATGCCGAATC TGGCGCTTAC
AATGTCACCC GCTCTGGCGA CAGCTATATT CTGGTTAACT TCTATACCGG AGTCGGAAGT
TGCCGGACCC TGCAGATGAA GGCGCATTAC AGAAATGGTG GTCTGTTCTA CCGTTCTTCA
AGAGACGGTT ATGGTTTTGA GGAAGACTGG GCAGAAGTTT ATACCTCGAA AAATCTTCCA
CCAGAAAGCT ACCCAGTCGG CGCACCAATC CCGTGGCCAT CAGATACCGT TCCGTCTGGT
TATGCCCTGA TGCAGGGGCA GGCTTTTGAC AAATCTGCTT ACCCGAAACT TGCAGCCGCT
TATCCGTCAG GCGTGATCCC TGATATGCGT GGCTGGACGA TTAAGGGCAA ACCTGCCAGT
GGTCGGGCCG TATTGTCTCA GGAACAGGAC GGCATTAAAT CGCATACCCA CAGCGCCAGC
GCATCCAGTA CAGATTTGGG GACGAAAACC ACATCGTCGT TTGATTACGG CACTAAATCC
ACGAATAACA CCGGGGCACA TACACACAGT GTGAGCGGCT CTACAAACTC GGCTGGAGCA
CACACACACT CACTAGCCAA CGTGAACACG GCTAGTGCTA ACTCCGGTGC TGGTAGTGCA
TCAACAAGAT TGTCTGTTGT GCATAATCAA AACTATGCAA CATCATCTGC TGGCGCACAT
ACCCACTCAC TGTCCGGCAC TGCTGCAAGC GCAGGTGCAC ACGCGCATAC TGTCGGTATT
GGTGCTCATA CGCACTCCGT TGCGATTGGT TCACATGGAC ACACCATCAC CGTTAACGCT
GCTGGTAACG CGGAAAACAC CGTCAAAAAC ATCGCATTTA ACTATATTGT GAGGCTTGCA
TAA
 
Protein sequence
MAVKISGVLK DGTGKPVQNC TIQLKAKRNS TTVVVNTLAS ENPDEAGRYS MDVEYGQYSV 
ILLVEGFPPS HAGTITVYED SQPGTLNDFL GAMTEDDARP EALRRFELMV EEVARNASAV
AQNTAAAKKS ASDASTSARE AATHAADAAD SARAASTSAG QAASSAQSAS SSAGTASTKA
TEASKSAAAA ESSKSAAATS AGAAKTSETN ASASLQSAAT SASTATTKAS EAATSARDAA
ASKEAAKSSE TNASSSASSA ASSATAAGNS AKAAKTSETN ARSSETAAGQ SASAAAGSKT
AAASSASAAS TSAGQASASA TAAGKSAESA ASSASTATTK AGEATEQASA AARSASAAKT
SETNAKASET SAESSKTAAA SSASSAASSA SSASASKDEA TRQASAAKSS ATTASTKATE
AAGSATAAAQ SKSTAESAAT RAETAAKRAE DIASAVALED ASTTKKGIVQ LSSATNSTSE
TLAATPKAVK SAYDNAEKRL QKDQNGADIP DKGCFLNNIN AVSKTDFADK RGMRYVRVNA
PAGATSGKYY PVVVMRSAGS VSELASRVII TTATRTAGDP MNNCEFNGFV MPGGWTDRGR
YAYGMFWQYQ NNERAIHSIM MSNKGDDLRS VFYVDGAAFP VFAFIEDGLS ISAPGADLVV
NDTTYKFGAT NPATECIAAD VILDFKSGRG FYESHSLIVN DNLSCKKLFA TDEIVARGGN
QIRMIGGEYG ALWRNDGAKT YLLLTNQGDV YGGWNTLRPF AIDNATGELV IGTKLSASLN
GNALTATKLQ TPRRVSGVEF DGSKDITLTA AHVAAFARRA TDTYADADGG VPWNAESGAY
NVTRSGDSYI LVNFYTGVGS CRTLQMKAHY RNGGLFYRSS RDGYGFEEDW AEVYTSKNLP
PESYPVGAPI PWPSDTVPSG YALMQGQAFD KSAYPKLAAA YPSGVIPDMR GWTIKGKPAS
GRAVLSQEQD GIKSHTHSAS ASSTDLGTKT TSSFDYGTKS TNNTGAHTHS VSGSTNSAGA
HTHSLANVNT ASANSGAGSA STRLSVVHNQ NYATSSAGAH THSLSGTAAS AGAHAHTVGI
GAHTHSVAIG SHGHTITVNA AGNAENTVKN IAFNYIVRLA