Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4481 |
Symbol | |
ID | 5593899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4485157 |
End bp | 4486113 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640923579 |
Product | putative sugar ABC transporter, periplasmic sugar-binding protein |
Protein accession | YP_001461020 |
Protein GI | 157163702 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 66 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGAAAC GCTTACTTGT AGTCTCTGCA GTCTCGGCAG CCATGTCGTC TATGGCGTTG GCCGCTCCAT TAACCGTAGG ATTTTCGCAG GTCGGATCGG AATCCGGCTG GCGCGCTGCA GAAACCAATG TGGCGAAAAG TGAGGCCGAA AAACGCGGAA TTACGTTGAA AATTGCCGAT GGTCAGCAAA AGCAGGAAAA CCAGATTAAA GCGGTACGTT CCTTCGTTGC ACAAGGGGTG GATGCGATCT TTATCGCTCC AGTGGTTGCG ACGGGTTGGG AACCGGTATT AAAAGAGGCG AAAGATGCCG AAATCCCGGT CTTCTTGCTT GACCGTTCCA TCGATGTGAA AGACAAATCT CTCTATATGA CCACCGTCAC TGCCGACAAC ATCCTCGAAG GCAAGTTGAT TGGTGACTGG CTGGTAAAAG AAGTGAATGG CAAACCATGC AACGTGGTGG AACTGCAGGG CACCGTTGGG GCCAGCGTCG CCATTGACCG TAAGAAAGGC TTTGCCGAAG CCATTAAGAA TGCGCCAAAT ATCAAAATCA TCCGCTCGCA GTCAGGTGAC TTCACCCGCA GTAAAGGCAA AGAAGTTATG GAGAGCTTTA TCAAAGCGGA AAACAACGGC AAAAACATCT GCATGGTTTA CGCCCATAAC GATGACATGG TGATTGGTGC AATTCAGGCA ATTAAAGAAG CGGGCCTGAA ACCGGGCAAA GATATCCTCA CGGGTTCCAT TGACGGCGTA CCGGATATCT ATAAAGCGAT GATTGATGGC GAAGCGAACG CCAGCGTTGA ACTGACGCCG AACATGGCAG GCCCCGCTTT TGACGCGCTG GAGAAATACA AAAAAGACGG CACCATGCCT GAAAAGCTGA CCCTGACCAA ATCCACCCTT TATCTGCCTG ATACCGCAAA AGAAGAGTTA GAGAAGAAGA AAAATATGGG GTATTGA
|
Protein sequence | MWKRLLVVSA VSAAMSSMAL AAPLTVGFSQ VGSESGWRAA ETNVAKSEAE KRGITLKIAD GQQKQENQIK AVRSFVAQGV DAIFIAPVVA TGWEPVLKEA KDAEIPVFLL DRSIDVKDKS LYMTTVTADN ILEGKLIGDW LVKEVNGKPC NVVELQGTVG ASVAIDRKKG FAEAIKNAPN IKIIRSQSGD FTRSKGKEVM ESFIKAENNG KNICMVYAHN DDMVIGAIQA IKEAGLKPGK DILTGSIDGV PDIYKAMIDG EANASVELTP NMAGPAFDAL EKYKKDGTMP EKLTLTKSTL YLPDTAKEEL EKKKNMGY
|
| |