Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2160 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 2315997 |
End bp | 2317547 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | ACX39812 |
Protein GI | 260449390 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0024579 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGAT CGATATCGTT TCGTCCCACA TTGCTCGCGC TCGTCCTTGC CACAAATTTC CCGGTTGCGC ACGCCGCCGT ACCAAAAGAT ATGCTGGTGA TTGGTAAGGC CGCCGATCCA CAAACCCTCG ACCCGGCGGT AACAATAGAT AATAACGACT GGACAGTGAC CTACCCGTCT TATCAGCGGC TGGTTCAGTA CAAAACGGAC GGTGATAAAG GCTCAACCGA CGTTGAAGGC GATCTGGCAA GTAGCTGGAA AGCGTCTGAC GATCAAAAAG AGTGGACGTT CACCCTGAAA GATAATGCTA AATTTGCCGA TGGCACACCT GTCACTGCCG AAGCAGTAAA ACTTTCTTTT GAGCGGCTAC TAAAAATCGG CCAGGGGCCA GCAGAAGCAT TTCCCAAAGA TTTAAAGATT GATGCTCCCG ACGAACATAC GGTGAAGTTT ACCCTTAGCC AACCATTCGC ACCGTTCCTC TACACGCTGG CGAATGACGG TGCATCCATT ATCAATCCGG CGGTCTTAAA GGAACATGCA GCGGATGATG CTCGCGGCTT CCTCGCGCAA AATACCGCCG GTTCCGGACC ATTTATGCTG AAAAGCTGGC AAAAAGGTCA GCAATTAGTT CTGGTGCCAA ATCCGCATTA CCCAGGCAAT AAACCGAACT TCAAACGGGT ATCGGTAAAA ATTATTGGTG AAAGTGCCTC CCGTCGCCTG CAGCTCTCCC GTGGCGACAT TGACATTGCC GATGCGCTGC CGGTGGATCA ACTCAACGCC CTGAAGCAGG AAAATAAAGT CAATGTGGCA GAGTATCCGT CACTGCGCGT TACCTATCTG TATCTGAATA ACAGCAAAGC GCCTCTTAAT CAGGCGGATC TGCGTCGGGC CATTTCCTGG TCTACCGATT ATCAGGGCAT GGTTAACGGC ATTCTGAGTG GTAACGGAAA ACAGATGCGC GGCCCGATTC CGGAAGGCAT GTGGGGCTAC GATGCGACGG CAATGCAATA CAACCATGAC GAAACGAAAG CCAAAGCCGA ATGGGATAAA GTGACGAGCA AACCCACCAG CCTGACGTTT CTCTACTCCG ATAACGATCC GAACTGGGAG CCTATTGCTC TGGCGACACA ATCCAGTCTC AACAAGCTGG GCATCATTGT GAAGCTGGAA AAGCTGGCGA ACGCCACCAT GCGCGACAGA GTGGGTAAAG GTGATTACGA CATTGCGATT GGCAACTGGA GTCCGGATTT TGCCGACCCG TATATGTTTA TGAATTACTG GTTTGAGTCA GACAAAAAAG GTCTGCCGGG TAACCGCTCG TTCTATGAAA ACAGTGAGGT CGATAAGTTA CTGCGCAATG CGCTTGCGAC CACCGACCAG ACGCAGCGTA CCCGGGACTA CCAGCAGGCA CAGAAAATCG TCATTGATGA CGCTGCTTAT GTGTACCTGT TCCAGAAAAA CTACCAACTG GCGATGAACA AAGAGGTGAA AGGCTTTGTG TTCAATCCCA TGCTGGAACA GGTCTTCAAT ATCAATACCA TGAGTAAATA A
|
Protein sequence | MKRSISFRPT LLALVLATNF PVAHAAVPKD MLVIGKAADP QTLDPAVTID NNDWTVTYPS YQRLVQYKTD GDKGSTDVEG DLASSWKASD DQKEWTFTLK DNAKFADGTP VTAEAVKLSF ERLLKIGQGP AEAFPKDLKI DAPDEHTVKF TLSQPFAPFL YTLANDGASI INPAVLKEHA ADDARGFLAQ NTAGSGPFML KSWQKGQQLV LVPNPHYPGN KPNFKRVSVK IIGESASRRL QLSRGDIDIA DALPVDQLNA LKQENKVNVA EYPSLRVTYL YLNNSKAPLN QADLRRAISW STDYQGMVNG ILSGNGKQMR GPIPEGMWGY DATAMQYNHD ETKAKAEWDK VTSKPTSLTF LYSDNDPNWE PIALATQSSL NKLGIIVKLE KLANATMRDR VGKGDYDIAI GNWSPDFADP YMFMNYWFES DKKGLPGNRS FYENSEVDKL LRNALATTDQ TQRTRDYQQA QKIVIDDAAY VYLFQKNYQL AMNKEVKGFV FNPMLEQVFN INTMSK
|
| |