Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2170 |
Symbol | |
ID | 6067665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2381393 |
End bp | 2382943 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641601577 |
Product | extracellular solute-binding protein |
Protein accession | YP_001725136 |
Protein GI | 170020182 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.233404 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGAT CGATATCGTT TCGTCCCACA TTGCTCGCGC TCGTCCTTGC CACAACTTTC CCGGTTGCGC ACGCCGCCGT ACCGAAAGAT ATGCTGGTGA TTGGTAAGGC CGCCGATCCA CAAACCCTCG ACCCGGCGGT AACAATAGAT AATAACGACT GGACAGTGAC CTACCCGTCT TATCAGCGGC TGGTTCAGTA CAAAACGGAC GGTGATAAAG GCTCAACCGA CGTTGAAGGC GATCTGGCAA GTAGCTGGAA AGCGTCTGAC GATCAAAAAG AGTGGACGTT CACCCTGAAA GATAATGCTA AATTTGCCGA TGGCACACCT GTCACTGCCG AAGCAGTAAA ACTCTCTTTT GAGCGGTTAC TAAAAATCGG CCAGGGGCCA GCAGAAGCAT TTCCCAAAGA TTTAAAGATT GATACTCCCG ACGAACATAC GGTGAAGTTT ACCCTTAGCC AACCATTCGC ACCGTTCCTC TACACGCTGG CGAATGACGG TGCATCCATT ATCAATCCGG CGGTGTTAAA GGAACATGCA GCGGATGATG CTCGCGGCTT CCTCGCGCAA AATACCGCCG GTTCCGGACC ATTTATGCTA AAAAGCTGGC AAAAAGGTCA GCAATTAGTT CTGGTGCCAA ATCCGCATTA CCCCGGCAAT AAACCGAACT TTAAGCGAGT ATCGGTAAAA ATTATCGGTG AAAGTGCCTC CCGTCGCCTG CAGCTCTCCC GTGGCGACAT TGACATTGCC GATGCGCTGC CGGTGGATCA ACTCAACGCC CTGAAGCAGG AAAATAAAGT CAATGTGGCA GAGTATCCGT CACTGCGCGT CACCTATCTG TATCTGAATA ACAGCAAAGC GCCACTTAAT CAGGCGGATC TGCGTCGGGC CATTTCCTGG TCTACCGATT ATCAGGGCAT GGTTAACGGC ATTCTGAGTG GTAACGGAAA ACAGATGCGC GGCCCGATTC CGGAAGGCAT GTGGGGCTAC GATGCGACGG CAATGCAATA CAACCATGAC GAAACGAAAG CCAAAGCCGA ATGGGATAAA GTGACGAGCA AACCCACCAG CCTGACGTTT CTCTACTCCG ATAACGATCC GAACTGGGAG CCTATTGCTC TGGCGACACA ATCCAGTCTC AACAAGCTGG GCATCAATGT GAAGCTGGAA AAGCTGGCGA ACGCCACCAT GCGCGACAGA GTGGGTAAAG GTGATTACGA CATTGCGATT GGCAACTGGA GTCCGGATTT TGCCGACCCG TATATGTTTA TGAATTACTG GTTTGAGTCA GACAAAAAAG GTCTGCCGGG TAACCGCTCG TTCTATGAAA ACAGTGAGGT CGATAAGTTA CTGCGCAATG CGCTTGCGAC CACCGACCAG ACGCAGCGTA CCCGGGACTA CCAGCAGGCA CAGAAAATCG TCATTGATGA CGCTGCTTAT GTGTACCTGT TCCAGAAAAA CTACCAACTG GCGATGAACA AAGAGGTGAA AGGCTTTGTG TTCAATCCCA TGCTGGAACA GGTCTTCAAT ATCAATACCA TGAGTAAATA A
|
Protein sequence | MKRSISFRPT LLALVLATTF PVAHAAVPKD MLVIGKAADP QTLDPAVTID NNDWTVTYPS YQRLVQYKTD GDKGSTDVEG DLASSWKASD DQKEWTFTLK DNAKFADGTP VTAEAVKLSF ERLLKIGQGP AEAFPKDLKI DTPDEHTVKF TLSQPFAPFL YTLANDGASI INPAVLKEHA ADDARGFLAQ NTAGSGPFML KSWQKGQQLV LVPNPHYPGN KPNFKRVSVK IIGESASRRL QLSRGDIDIA DALPVDQLNA LKQENKVNVA EYPSLRVTYL YLNNSKAPLN QADLRRAISW STDYQGMVNG ILSGNGKQMR GPIPEGMWGY DATAMQYNHD ETKAKAEWDK VTSKPTSLTF LYSDNDPNWE PIALATQSSL NKLGINVKLE KLANATMRDR VGKGDYDIAI GNWSPDFADP YMFMNYWFES DKKGLPGNRS FYENSEVDKL LRNALATTDQ TQRTRDYQQA QKIVIDDAAY VYLFQKNYQL AMNKEVKGFV FNPMLEQVFN INTMSK
|
| |