Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3736 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 4026487 |
End bp | 4027566 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | permease YjgP/YjgQ family protein |
Protein accession | ACX41341 |
Protein GI | 260450919 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.00233017 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGGGAGA CGCTCAAAAG CCAGCTGGCG ATACTCTTCA TCTTGCTTTT GATCTTCTTC TGTCAAAAGT TAGTGAGGAT CCTCGGCGCA GCGGTTGACG GCGATATTCC GGCGAATCTG GTGCTCTCCC TTCTCGGGTT GGGCGTGCCG GAAATGGCGC AGCTTATCCT GCCATTAAGC CTGTTCCTCG GGCTGCTGAT GACGCTGGGC AAACTGTATA CCGAAAGTGA AATTACGGTA ATGCATGCCT GCGGCCTGAG CAAAGCGGTT CTGGTGAAAG CGGCAATGAT CCTTGCGGTA TTCACGGCAA TCGTAGCGGC GGTTAACGTG ATGTGGGCGG GACCGTGGTC ATCGCGTCAT CAGGATGAAG TGTTAGCAGA AGCGAAAGCG AACCCTGGCA TGGCGGCGCT GGCGCAAGGG CAATTCCAGC AAGCGACTAA TGGCAGCTCG GTGCTGTTCA TCGAAAGCGT TGACGGCAGC GATTTCAAAG ATGTGTTCCT CGCGCAAATT CGACCAAAAG GTAATGCACG TCCTTCTGTG GTGGTGGCCG ATTCCGGACA TTTAACCCAG CTGCGCGACG GCTCCCAGGT CGTCACTCTC AACCAGGGAA CGCGCTTCGA AGGCACTGCA ATGTTACGTG ATTTCCGCAT TACGGACTTC CAGGATTATC AGGCGATCAT TGGTCACCAG GCGGTGGCGC TCGACCCGAA CGATACCGAC CAGATGGACA TGCGCACATT GTGGAACACT GACACCGATC GTGCTCGCGC AGAACTGAAC TGGTGTATCA CGTTGGTATT CACCGTGTTT ATGATGGCAC TTATGGTCGT ACCGCTGAGC GTGGTTAACC CACGTCAGGG ACGCGTACTG TCGATGCTGC CAGCCATGCT GCTGTATCTA CTTTTCTTCC TGATCCAGAC CTCCCTGAAA TCGAACGGCG GTAAAGGTAA GCTGGACCCG ACGCTGTGGA TGTGGACCGT TAACCTGATT TATCTGGCTT TAGCGATTGT TCTCAACCTT TGGGACACCG TGCCGGTCCG CCGCCTGCGC GCCAGTTTTT CGCGTAAAGG AGCGGTGTGA
|
Protein sequence | MRETLKSQLA ILFILLLIFF CQKLVRILGA AVDGDIPANL VLSLLGLGVP EMAQLILPLS LFLGLLMTLG KLYTESEITV MHACGLSKAV LVKAAMILAV FTAIVAAVNV MWAGPWSSRH QDEVLAEAKA NPGMAALAQG QFQQATNGSS VLFIESVDGS DFKDVFLAQI RPKGNARPSV VVADSGHLTQ LRDGSQVVTL NQGTRFEGTA MLRDFRITDF QDYQAIIGHQ AVALDPNDTD QMDMRTLWNT DTDRARAELN WCITLVFTVF MMALMVVPLS VVNPRQGRVL SMLPAMLLYL LFFLIQTSLK SNGGKGKLDP TLWMWTVNLI YLALAIVLNL WDTVPVRRLR ASFSRKGAV
|
| |