Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2274 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 2446067 |
End bp | 2449429 |
Gene Length | 3363 bp |
Protein Length | 1120 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | Prophage tail fibre domain protein |
Protein accession | ACX39921 |
Protein GI | 260449499 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.817873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTAA AGATTTCAGG TGTACTGAAA GACGGCACAG GAAAACCGGT ACAGAACTGC ACAATCCAGC TGAAAGCAAA ACGTAACAGC ACCACGGTGG TGGTGAACAC GCTGGCCTCA GAAAATCCGG ATGAAGCCGG GCGTTACAGC ATGGACGTTG AGTACGGTCA GTACAGCGTT ATTCTGTTGG TGGAAGGATT CCCGCCGTCA CATGCCGGGA CCATTACCGT GTATGAAGAT TCTCAACCCG GTACGCTGAA TGATTTTCTC GGTGCCATGA CGGAGGATGA TGCCCGTCCG GAGGCACTGC GCCGTTTTGA ACTGATGGTG GAAGAGGTGG CGCGTAACGC GTCCGCGGTG GCACAGAACA CGGCAGCCGC GAAGAAGTCA GCCAGTGATG CCAGCACATC AGCCCGTGAG GCGGCAACCC ATGCGGCTGA TGCTGCGGAC TCAGCACGCG CAGCCAGCAC GTCAGCCGGA CAGGCCGCGT CGTCGGCTCA GTCAGCGTCT TCCAGCGCAG GAACGGCATC AACAAAGGCC ACTGAAGCAT CAAAAAGTGC TGCCGCTGCA GAGTCCTCAA AAAGCGCGGC GGCCACCAGT GCCGGTGCGG CGAAAACGTC AGAAACGAAT GCTTCAGCGT CACTACAATC AGCAGCCACA TCTGCATCCA CCGCGACCAC GAAGGCATCA GAAGCTGCGA CCTCGGCCCG GGATGCGGCG GCCTCAAAAG AAGCGGCAAA ATCATCAGAA ACGAACGCAT CATCAAGCGC CAGTAGTGCA GCTTCCTCGG CAACGGCGGC AGGAAATTCC GCGAAGGCGG CAAAAACGTC CGAGACGAAC GCCAGGTCTT CTGAAACGGC AGCGGGACAG AGCGCCTCGG CTGCGGCAGG CTCAAAAACA GCGGCTGCGT CGTCTGCCAG TGCAGCGTCA ACAAGTGCCG GGCAGGCCTC AGCCAGTGCC ACCGCCGCCG GAAAATCGGC AGAAAGCGCC GCATCGTCTG CTTCAACAGC CACAACGAAG GCTGGCGAAG CCACTGAACA GGCCAGCGCA GCAGCGAGGT CTGCTTCCGC AGCGAAGACA TCCGAAACGA ACGCGAAAGC GTCGGAAACA AGCGCAGAAT CCTCAAAAAC GGCTGCCGCA TCGTCAGCCA GTTCGGCGGC GTCATCGGCA TCATCGGCGT CTGCTTCAAA AGATGAGGCG ACCAGACAAG CGTCAGCAGC GAAGAGCAGC GCCACGACGG CATCCACGAA GGCGACAGAG GCTGCTGGCA GTGCGACGGC GGCAGCTCAG AGCAAAAGTA CGGCGGAATC CGCGGCAACG CGCGCCGAGA CAGCAGCTAA ACGGGCAGAG GATATTGCAT CCGCCGTGGC GCTTGAGGAT GCAAGTACGA CGAAAAAGGG GATAGTACAG CTCAGCAGTG CGACCAACAG TACGTCTGAA ACGCTGGCGG CAACGCCAAA GGCAGTAAAA TCAGCCTATG ACAATGCAGA GAAACGTCTG CAGAAAGACC AGAACGGCGC TGATATACCC GATAAGGGAT GCTTCCTGAA CAACATTAAC GCGGTCAGTA AAACAGACTT TGCTGATAAG CGTGGTATGC GTTATGTGCG GGTTAACGCT CCTGCAGGTG CAACATCTGG AAAATATTAC CCTGTTGTTG TTATGCGTTC TGCTGGCTCA GTAAGCGAAC TGGCATCAAG AGTCATTATC ACCACGGCAA CGCGAACCGC AGGCGATCCG ATGAATAACT GCGAGTTTAA CGGATTTGTT ATGCCTGGTG GCTGGACTGA CAGGGGGCGT TATGCTTATG GCATGTTCTG GCAATATCAA AACAATGAAC GAGCCATTCA CTCAATAATG ATGAGTAATA AGGGCGATGA TTTGCGCTCT GTGTTCTATG TTGATGGCGC TGCTTTCCCT GTTTTTGCGT TTATTGAAGA TGGCCTGTCA ATATCCGCAC CTGGTGCTGA TCTCGTTGTT AATGATACGA CCTATAAGTT TGGGGCAACA AATCCGGCGA CTGAATGTAT CGCGGCGGAC GTTATCCTTG ATTTTAAGAG TGGGCGTGGT TTTTATGAGT CTCATTCGTT AATCGTTAAC GATAACTTGT CGTGCAAAAA ACTTTTTGCC ACAGACGAAA TTGTAGCGCG TGGTGGTAAT CAGATTCGAA TGATAGGTGG GGAGTATGGT GCATTATGGC GTAATGATGG CGCTAAAACT TACCTGCTGC TTACCAATCA AGGTGATGTT TATGGTGGCT GGAATACATT AAGACCGTTT GCTATTGATA ACGCAACCGG CGAACTGGTT ATTGGAACCA AACTGTCCGC AAGTCTGAAC GGTAATGCAT TAACAGCAAC AAAGCTGCAA ACGCCAAGAC GGGTTTCTGG TGTTGAGTTT GATGGTTCCA AAGATATTAC TTTAACCGCC GCGCATGTGG CTGCTTTTGC CAGAAGGGCA ACGGATACAT ATGCCGATGC GGATGGTGGC GTTCCATGGA ATGCCGAATC TGGCGCTTAC AATGTCACCC GCTCTGGCGA CAGCTATATT CTGGTTAACT TCTATACCGG AGTCGGAAGT TGCCGGACCC TGCAGATGAA GGCGCATTAC AGAAATGGTG GTCTGTTCTA CCGTTCTTCA AGAGACGGTT ATGGTTTTGA GGAAGACTGG GCAGAAGTTT ATACCTCGAA AAATCTTCCA CCAGAAAGCT ACCCAGTCGG CGCACCAATC CCGTGGCCAT CAGATACCGT TCCGTCTGGT TATGCCCTGA TGCAGGGGCA GGCTTTTGAC AAATCTGCTT ACCCGAAACT TGCAGCCGCT TATCCGTCAG GCGTGATCCC TGATATGCGT GGCTGGACGA TTAAGGGCAA ACCTGCCAGT GGTCGGGCCG TATTGTCTCA GGAACAGGAC GGCATTAAAT CGCATACCCA CAGCGCCAGC GCATCCAGTA CAGATTTGGG GACGAAAACC ACATCGTCGT TTGATTACGG CACTAAATCC ACGAATAACA CCGGGGCACA TACACACAGT GTGAGCGGCT CTACAAACTC GGCTGGAGCA CACACACACT CACTAGCCAA CGTGAACACG GCTAGTGCTA ACTCCGGTGC TGGTAGTGCA TCAACAAGAT TGTCTGTTGT GCATAATCAA AACTATGCAA CATCATCTGC TGGCGCACAT ACCCACTCAC TGTCCGGCAC TGCTGCAAGC GCAGGTGCAC ACGCGCATAC TGTCGGTATT GGTGCTCATA CGCACTCCGT TGCGATTGGT TCACATGGAC ACACCATCAC CGTTAACGCT GCTGGTAACG CGGAAAACAC CGTCAAAAAC ATCGCATTTA ACTATATTGT GAGGCTTGCA TAA
|
Protein sequence | MAVKISGVLK DGTGKPVQNC TIQLKAKRNS TTVVVNTLAS ENPDEAGRYS MDVEYGQYSV ILLVEGFPPS HAGTITVYED SQPGTLNDFL GAMTEDDARP EALRRFELMV EEVARNASAV AQNTAAAKKS ASDASTSARE AATHAADAAD SARAASTSAG QAASSAQSAS SSAGTASTKA TEASKSAAAA ESSKSAAATS AGAAKTSETN ASASLQSAAT SASTATTKAS EAATSARDAA ASKEAAKSSE TNASSSASSA ASSATAAGNS AKAAKTSETN ARSSETAAGQ SASAAAGSKT AAASSASAAS TSAGQASASA TAAGKSAESA ASSASTATTK AGEATEQASA AARSASAAKT SETNAKASET SAESSKTAAA SSASSAASSA SSASASKDEA TRQASAAKSS ATTASTKATE AAGSATAAAQ SKSTAESAAT RAETAAKRAE DIASAVALED ASTTKKGIVQ LSSATNSTSE TLAATPKAVK SAYDNAEKRL QKDQNGADIP DKGCFLNNIN AVSKTDFADK RGMRYVRVNA PAGATSGKYY PVVVMRSAGS VSELASRVII TTATRTAGDP MNNCEFNGFV MPGGWTDRGR YAYGMFWQYQ NNERAIHSIM MSNKGDDLRS VFYVDGAAFP VFAFIEDGLS ISAPGADLVV NDTTYKFGAT NPATECIAAD VILDFKSGRG FYESHSLIVN DNLSCKKLFA TDEIVARGGN QIRMIGGEYG ALWRNDGAKT YLLLTNQGDV YGGWNTLRPF AIDNATGELV IGTKLSASLN GNALTATKLQ TPRRVSGVEF DGSKDITLTA AHVAAFARRA TDTYADADGG VPWNAESGAY NVTRSGDSYI LVNFYTGVGS CRTLQMKAHY RNGGLFYRSS RDGYGFEEDW AEVYTSKNLP PESYPVGAPI PWPSDTVPSG YALMQGQAFD KSAYPKLAAA YPSGVIPDMR GWTIKGKPAS GRAVLSQEQD GIKSHTHSAS ASSTDLGTKT TSSFDYGTKS TNNTGAHTHS VSGSTNSAGA HTHSLANVNT ASANSGAGSA STRLSVVHNQ NYATSSAGAH THSLSGTAAS AGAHAHTVGI GAHTHSVAIG SHGHTITVNA AGNAENTVKN IAFNYIVRLA
|
| |