Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2315 |
Symbol | |
ID | 5594745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2314542 |
End bp | 2316356 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640921441 |
Product | ABC transporter, peripllasmic solute-binding proteins |
Protein accession | YP_001458977 |
Protein GI | 157161659 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGTGC GCATACTGCT GCTGTTTATC GCCCTGTTCA CCTTCGGTGC GCAGGCGCAG GCTATCAAGG AAAGCTATGC CTTTGCCGTG CTGGGCGAAC CCCGGTACGC ATTTAATTTC AACCATTTTG ATTATGTGAA CCCCGCCGCG CCAAAAGGTG GGCAGATAAC GTTGTCAGCC CTCGGCACCT TCGATAATTT CAACCGCTAT GCACTGCGCG GCAACCCGGG CGCACGCACC GAGCAGTTGT ACGACACGCT ATTTACGACT TCCGATGACG AACCAGGCAG TTATTACCCG CTGATTGCTG AAAGTGCACG CTATGCTGAC GATTATTCCT GGGTGGAGGT CGCTATTAAT CCGCGCGCCC GTTTTCATGA TGGTTCGCCC ATTACTGCCC GCGATGTAGA GTTTACTTTT CAAAAATTTA TGACCGAAGG CGTGCCGCAA TTTCGTCTGG TCTACAAAGG CACCACCGTC AAAGCCATTG CACCGTTAAC CGTGCGCATT GAGTTAGCTA AACCCGGCAA AGAAGATATG CTGAGTCTGT TTTCGCTGCC GGTATTTCCA GAAAAGTACT GGAAGGATCA CAAACTTAGC GACCCGCTCG CCACGCCTCC GCTTGCCAGT GGTCCGTACC GCGTTACGTC CTGGAAAATG GGGCAAAATA TTGTCTATTC CCGTGTGAAA GATTACTGGG CAGCAAACTT ACCGGTAAAC CGTGGACGCT GGAATTTCGA CACCATTCGC TACGATTATT ACCTCGATGA TAATGTCGCC TTTGAAGCGT TTAAAGCAGG TGCCTTTGAT TTGCGTATGG AAAACGACGC CAAAAACTGG GCCACGCGTT ATACCGGTAA AAATTTCGAT AAAAAATACA TCATCAAAGA TGAGCAAAAG AACGAATCAG CCCAGGATAC GCGTTGGCTG GCGTTTAATA TCCAACGTCC GGTATTCAGC GATCGCCGGG TCCGAGAAGC TATCACTCTC GCCTTTGACT TTGAATGGAT GAACAAGGCG TTGTTTTACA ATGCCTGGAG TCGCACGAAC AGTTATTTTC AGAATACCGA ATACGCGGCC AGAAATTACC CCGACGCCGC GGAGCTGGTG CTTCTGGCAC CAATGAAAAA AGATCTACCG TCAGAAGTCT TCACACAAAT CTACCAGCCG CCGGTATCCA AAGGCGATGG CTACGATCGT GACAACCTGT TAAAAGCCGA CAAACTTCTC AACGAAGCGG GCTGGGTGCT GAAGGGTCAG CAACGCGTTA ATGCCACAAC GGGTCAGCCA CTCAGCTTTG AATTATTGCT TCCCGCAAGC AGCAATAGTC AGTGGGTATT GCCGTTCCAG CACAGCCTGC AACGGCTGGG TATCAACATG GACATTCGCA AGGTGGATAA CTCGCAAATC ACTAACCGCA TGCGCAGTCG CGACTATGAC ATGATGCCGC GCGTATGGCG GGCGATGCCG TGGCCCAGTT CCGATTTACA GATTTCCTGG TCATCGGAAT ATATCAATTC CACTTATAAT GCCCCCGGCG TGCAAAGCCC GGTTATCGAC TCGCTGATCA ACCAAATTAT TGCCGCGCAG GGAAATAAAG AAAAATTACT GCCGTTGGGG CGAGCACTGG ATCGCGTATT AACGTGGAAT TATTACATGC TGCCAATGTG GTACATGGCG GAAGACCGTC TCGCCTGGTG GGATAAATTC TCCCAGCCGG CCGTGCGCCC CATCTATAGC CTCGGTATCG ATACCTGGTG GTATGACGTC AATAAAGCGG CCAAACTGCC GTCCGCCAGC AAACAGGGAG AGTAG
|
Protein sequence | MIVRILLLFI ALFTFGAQAQ AIKESYAFAV LGEPRYAFNF NHFDYVNPAA PKGGQITLSA LGTFDNFNRY ALRGNPGART EQLYDTLFTT SDDEPGSYYP LIAESARYAD DYSWVEVAIN PRARFHDGSP ITARDVEFTF QKFMTEGVPQ FRLVYKGTTV KAIAPLTVRI ELAKPGKEDM LSLFSLPVFP EKYWKDHKLS DPLATPPLAS GPYRVTSWKM GQNIVYSRVK DYWAANLPVN RGRWNFDTIR YDYYLDDNVA FEAFKAGAFD LRMENDAKNW ATRYTGKNFD KKYIIKDEQK NESAQDTRWL AFNIQRPVFS DRRVREAITL AFDFEWMNKA LFYNAWSRTN SYFQNTEYAA RNYPDAAELV LLAPMKKDLP SEVFTQIYQP PVSKGDGYDR DNLLKADKLL NEAGWVLKGQ QRVNATTGQP LSFELLLPAS SNSQWVLPFQ HSLQRLGINM DIRKVDNSQI TNRMRSRDYD MMPRVWRAMP WPSSDLQISW SSEYINSTYN APGVQSPVID SLINQIIAAQ GNKEKLLPLG RALDRVLTWN YYMLPMWYMA EDRLAWWDKF SQPAVRPIYS LGIDTWWYDV NKAAKLPSAS KQGE
|
| |