Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1501 |
Symbol | sapA |
ID | 5590636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 1487194 |
End bp | 1488837 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640925193 |
Product | peptide ABC transporter, periplasmic peptide-binding protein |
Protein accession | YP_001462598 |
Protein GI | 157158938 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCAGG TATTATCGTC TCTTTTGGTG ATTGCTGGAC TTGTGAGTGG TCAGGCAATC GCCGCGCCTG AATCCCCCCC GCATGCTGAT ATCCGCGACA GCGGTTTTGT CTATTGCGTC AGCGGGCAAG TCAACACCTT TAACCCATCC AAAGCGAGCA GTGGGTTAAT TGTCGATACC CTTGCCGCCC AGTTTTATGA TCGACTGCTG GATGTCGATC CCTATACCTA TCGCCTGATG CCGGAACTTG CCGAAAGCTG GGAAGTACTC GACAACGGCG CGACCTATCG CTTCCACCTG CGTCGCGATG TTCCGTTTCA AAAAACCGAC TGGTTTACTC CCACTCGTAA AATGAATGCC GACGATGTGG TGTTTACCTT CCAGCGAATT TTTGACCGCA ACAACCCGTG GCATAACGTC AACGGCAGCA ACTTCCCCTA CTTCGACAGC CTGCAATTTG CCGATAACGT GAAAAGCGTC CGCAAACTGG ATAATCATAC CGTTGAGTTC CGACTCGCCC AGCCGGATGC TTCTTTTTTG TGGCACCTCG CAACCCATTA TGCTTCGGTC ATGTCGGCAG AATATGCCCG GAAGTTAGAG AAAGAAGATC GCCAGGAGCA ACTCGACCGT CAACCGGTCG GCACCGGTCC GTATCAGTTG TCGGAATACC GCGCCGGGCA ATTTATTCGC CTACAACGTC ATGATGACTT CTGGCGCGGT AAACCGTTAA TGCCGCAGGT AGTGGTGGAT TTAGGCTCCG GCGGCACCGG ACGTCTGTCG AAACTCCTGA CCGGGGAATG CGACGTTCTG GCCTGGCCTG CTGCCAGCCA GCTATCCATT TTGCGTGACG ATCCGCGCTT GCGTTTAACG CTGCGTCCTG GGATGAACGT CGCCTATCTG GCATTTAACA CCGCCAAACC GCCGCTAAAT AATCCCGCTG TCCGCCATGC ACTGGCACTG GCGATTAATA ACCAGCGCCT GATGCAATCC ATCTATTATG GTACGGCTGA AACGGCGGCC TCTATTTTAC CGCGCGCCTC GTGGGCCTAT GACAACGAGG CTAAAATTAC TGAATACAAT CCGGCGAAAT CGCGCGAACA GTTGAAGGCG TTGGGGCTGG AAAACTTAAC GCTGAAACTG TGGGTGCCCA CACGTTCGCA GGCGTGGAAC CCCAGTCCAC TGAAAACTGC CGAACTGATT CAGGCGGATA TGGCGCAGGT TGGCGTAAAA GTGGTGATTG TGCCGGTAGA AGGTCGCTTT CAGGAGGCGC GGTTGATGGA TATGAGCCAT GATCTGACGT TATCCGGTTG GGCGACGGAC AGTAACGACC CGGACAGTTT CTTCCGTCCG TTACTGAGCT GCGCGGCAAT TCATTCACAG ACCAACCTCG CCCACTGGTG CGATCCGAAA TTCGACAGCG TGTTGCGTAA GGCGCTCTCC TCGCAGCAGC TGGCAGCGCG TATTGAAGCC TATGACGAAG CGCAGAGTAT TCTGGCGCAG GAATTACCTA TTCTGCCGCT GGCGTCGTCA TTGCGTTTGC AGGCCTATCG GTACGATATC AAAGGTCTGG TACTTAGCCC GTTTGGTAAC GCCTCCTTTG CTGGGGTGTA TCGCGAGAAA CAGGATGAGG TGAAAAAACC ATGA
|
Protein sequence | MRQVLSSLLV IAGLVSGQAI AAPESPPHAD IRDSGFVYCV SGQVNTFNPS KASSGLIVDT LAAQFYDRLL DVDPYTYRLM PELAESWEVL DNGATYRFHL RRDVPFQKTD WFTPTRKMNA DDVVFTFQRI FDRNNPWHNV NGSNFPYFDS LQFADNVKSV RKLDNHTVEF RLAQPDASFL WHLATHYASV MSAEYARKLE KEDRQEQLDR QPVGTGPYQL SEYRAGQFIR LQRHDDFWRG KPLMPQVVVD LGSGGTGRLS KLLTGECDVL AWPAASQLSI LRDDPRLRLT LRPGMNVAYL AFNTAKPPLN NPAVRHALAL AINNQRLMQS IYYGTAETAA SILPRASWAY DNEAKITEYN PAKSREQLKA LGLENLTLKL WVPTRSQAWN PSPLKTAELI QADMAQVGVK VVIVPVEGRF QEARLMDMSH DLTLSGWATD SNDPDSFFRP LLSCAAIHSQ TNLAHWCDPK FDSVLRKALS SQQLAARIEA YDEAQSILAQ ELPILPLASS LRLQAYRYDI KGLVLSPFGN ASFAGVYREK QDEVKKP
|
| |