Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1572 |
Symbol | |
ID | 5592255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1577743 |
End bp | 1579293 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640920725 |
Product | putative ABC transporter periplasmic-binding protein yddS precursor |
Protein accession | YP_001458281 |
Protein GI | 157160963 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 60 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGAT CGATATCGTT TCGTCCCACA TTGCTCGCGC TCGTCCTTGC CACAAATTTC CCGGTTGCGC ACGCCGCCGT ACCAAAAGAT ATGCTGGTGA TTGGTAAGGC CGCCGATCCA CAAACCCTCG ACCCGGCGGT AACAATAGAT AATAACGACT GGACAGTGAC CTACCCGTCT TATCAGCGGC TGGTTCAGTA CAAAACGGAC GGTGATAAAG GCTCAACCGA CGTTGAAGGC GATCTGGCAA GTAGCTGGAA AGCGTCTGAC GATCAAAAAG AGTGGACGTT CACCCTGAAA GATAATGCTA AATTTGCCGA TGGCACACCT GTCACTGCCG AAGCAGTAAA ACTTTCTTTT GAGCGGCTAC TAAAAATCGG CCAGGGGCCA GCAGAAGCAT TTCCCAAAGA TTTAAAGATT GATGCTCCCG ACGAACATAC GGTGAAGTTT ACCCTTAGCC AGCCATTCGC ACCGTTCCTC TACACGCTGG CGAATGACGG TGCATCCATT ATCAATCCGG CGGTGTTAAA GGAACATGCA GCGGATGATG CCCGCGGCTT CCTCGCGCAA AATACCGCCG GTTCCGGACC ATTTATGCTG AAAAGCTGGC AAAAAGGTCA GCAATTAGTT CTGGTGCCAA ATCCGCATTA CCCTGGCAAT AAACCGAACT TTAAGCGAGT ATCGGTAAAA ATTATTGGTG AAAGTGCCTC CCGTCGCCTG CAGCTCTCCC GTGGTGATAT TGACATTGCC GATGCGCTGC CGGTGGATCA ACTCAACGCC CTGAAGCAGG AAAACAAAGT CAATGTGGCA GAGTATCCGT CACTGCGCGT CACCTACCTG TATCTGAATA ACAGCAAAGC GCCACTTAAT CAGGCGGATC TGCGTCGGGC CATTTCCTGG TCTACCGATT ACCAGGGAAT GGTTAACGGC ATTCTGAGTG GTAACGGAAA ACAGATGCGC GGCCCGATTC CGGAAGGCAT GTGGGGCTAC GATGCGACGG CAATGCAATA CAACCATGAC GAAACGAAAG CCAAAGCTGA ATGGGATAAA GTGACGAACA AACCCACCAG CCTGACGTTT CTCTATTCTG ATAATGATCC GAACTGGGAG CCTATTGCTC TGGCGACACA ATCCAGTCTC AACAAGCTGG GCATCAATGT GAAGCTGGAA AAGCTGGCGA ACGCCACCAT GCGCGACAGA GTGGGTAAAG GTGATTACGA CATTGCGATT GGCAACTGGA GTCCGGATTT TGCCGACCCG TATATGTTTA TGAATTACTG GTTTGAGTCA GACAAAAAAG GTCTGCCGGG TAACCGCTCG TTCTATGAAA ACAGTGAGGT CGATAAGTTA CTGCGCAATG CGCTTGCGAC CACCGACCAG ACGCAGCGTA CCCGGGACTA CCAGCAGGCA CAGAAAATCG TCATTGATGA CGCTGCTTAT ATGTACCTGT TCCAGAAAAA CTACCAACTG GCGATGAACA AAGGGGTGAA AGGCTTTGTG TTCAATCCCA TGCTGGAACA GGTCTTCAAT ATCAATACCA TGAGTAAATA A
|
Protein sequence | MKRSISFRPT LLALVLATNF PVAHAAVPKD MLVIGKAADP QTLDPAVTID NNDWTVTYPS YQRLVQYKTD GDKGSTDVEG DLASSWKASD DQKEWTFTLK DNAKFADGTP VTAEAVKLSF ERLLKIGQGP AEAFPKDLKI DAPDEHTVKF TLSQPFAPFL YTLANDGASI INPAVLKEHA ADDARGFLAQ NTAGSGPFML KSWQKGQQLV LVPNPHYPGN KPNFKRVSVK IIGESASRRL QLSRGDIDIA DALPVDQLNA LKQENKVNVA EYPSLRVTYL YLNNSKAPLN QADLRRAISW STDYQGMVNG ILSGNGKQMR GPIPEGMWGY DATAMQYNHD ETKAKAEWDK VTNKPTSLTF LYSDNDPNWE PIALATQSSL NKLGINVKLE KLANATMRDR VGKGDYDIAI GNWSPDFADP YMFMNYWFES DKKGLPGNRS FYENSEVDKL LRNALATTDQ TQRTRDYQQA QKIVIDDAAY MYLFQKNYQL AMNKGVKGFV FNPMLEQVFN INTMSK
|
| |