Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1408 |
Symbol | sapA |
ID | 5592605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1402921 |
End bp | 1404564 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640920563 |
Product | peptide ABC transporter, periplasmic peptide-binding protein |
Protein accession | YP_001458122 |
Protein GI | 157160804 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCAGG TATTATCGTC TCTTTTGGTG ATTGCTGGAC TTGTGAGTGG TCAGGCAATC GCCGCGCCTG AATCTCCCCC GCATGCTGAT ATCCGCGACA GCGGTTTTGT CTATTGCGTC AGCGGGCAAG TCAACACCTT TAACCCATCC AAAGCGAGCA GTGGGTTAAT TGTCGATACC CTTGCCGCCC AGTTTTATGA TCGACTGCTG GATGTCGATC CCTATACCTA TCGCCTGATG CCGGAACTTG CCGAAAGCTG GGAAGTACTC GACAACGGCG CGACCTATCG CTTCCACCTG CGTCGCGATG TTCCGTTTCA AAAAACCGAC TGGTTTACTC CCACTCGTAA AATGAATGCC GACGATGTGG TGTTTACCTT CCAGCGAATT TTTGACCGCA ACAACCCGTG GCATAACGTC AACGGCAGCA ACTTCCCCTA CTTCGACAGC CTGCAATTTG CCGATAACGT GAAAAGCGTC CGCAAACTGG ATAATCATAC CGTTGAGTTC CGACTCGCCC AGCCGGATGC TTCTTTTTTG TGGCACCTCG CAACCCATTA TGCTTCGGTC ATGTCGGCAG AATATGCCCG GAAGTTAGAG AAAGAAGATC GCCAGGAGCA ACTCGACCGT CAACCGGTCG GCACCGGTCC GTATCAGTTG TCGGAATACC GCGCCGGGCA ATTTATTCGC CTACAACGTC ATGATGACTT CTGGCGCGGT AAACCGTTAA TGCCGCAGGT AGTGGTGGAT TTAGGCTCCG GCGGCACCGG ACGTCTGTCG AAACTCCTGA CCGGGGAATG CGACGTTCTG GCCTGGCCTG CTGCCAGCCA GCTATCCATT TTGCGTGACG ACCCGCGCTT GCGTTTAACG CTGCGTCCTG GGATGAACGT CGCCTATCTG GCATTTAACA CCGCCAAACC GCCGCTAAAT AATCCCGCTG TCCGCCATGC GCTGGCACTG GCGATTAATA ACCAGCGCCT GATGCAATCC ATCTATTATG GTACGGCTGA AACGGCGGCC TCTATTTTAC CGCGCGCCTC GTGGGCCTAT GACAACGAGG CTAAAATTAC TGAATACAAT CCGGCGAAAT CGCGCGAACA GTTGAAGTCG TTGGGGCTGG AAAATTTAAC GCTGAAACTG TGGGTGCCCA CACGTTCGCA GGCGTGGAAC CCCAGTCCAC TGAAAACTGC CGAACTGATT CAGGCGGATA TGGCGCAGGT TGGCGTAAAA GTGGTGATTG TGCCGGTAGA AGGTCGCTTT CAGGAGGCGC GGTTGATGGA TATGAGCCAT GATCTGACGT TATCCGGTTG GGCGACGGAC AGTAACGACC CGGACAGTTT CTTCCGTCCT TTACTGAGCT GCGCGGCAAT TCATTCACAG ACCAACCTCG CCCACTGGTG CGATCCGAAA TTCGACAGCG TGTTGCGTAA GGCGCTCTCC TCGCAGCAGC TGGCGGCGCG TATTGAAGCC TATGACGAAG CGCAGAGTAT TCTGGCGCAG GAATTGCCCA TTTTGCCGCT GGCGTCGTCA TTGCGTTTGC AGGCCTATCG GTACGATATC AAAGGTCTGG TACTTAGCCC GTTTGGTAAC GCCTCCTTTG CTGGGGTGTA TCGCGAGAAA CAGGATGAGG TGAAAAAACC ATGA
|
Protein sequence | MRQVLSSLLV IAGLVSGQAI AAPESPPHAD IRDSGFVYCV SGQVNTFNPS KASSGLIVDT LAAQFYDRLL DVDPYTYRLM PELAESWEVL DNGATYRFHL RRDVPFQKTD WFTPTRKMNA DDVVFTFQRI FDRNNPWHNV NGSNFPYFDS LQFADNVKSV RKLDNHTVEF RLAQPDASFL WHLATHYASV MSAEYARKLE KEDRQEQLDR QPVGTGPYQL SEYRAGQFIR LQRHDDFWRG KPLMPQVVVD LGSGGTGRLS KLLTGECDVL AWPAASQLSI LRDDPRLRLT LRPGMNVAYL AFNTAKPPLN NPAVRHALAL AINNQRLMQS IYYGTAETAA SILPRASWAY DNEAKITEYN PAKSREQLKS LGLENLTLKL WVPTRSQAWN PSPLKTAELI QADMAQVGVK VVIVPVEGRF QEARLMDMSH DLTLSGWATD SNDPDSFFRP LLSCAAIHSQ TNLAHWCDPK FDSVLRKALS SQQLAARIEA YDEAQSILAQ ELPILPLASS LRLQAYRYDI KGLVLSPFGN ASFAGVYREK QDEVKKP
|
| |