Gene ECH74115_1935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1935 
SymbolsapA 
ID6972225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1830415 
End bp1832034 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content54% 
IMG OID643385865 
Productpeptide ABC transporter, periplasmic peptide-binding protein 
Protein accessionYP_002270354 
Protein GI209399749 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.101699 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTGATTG CTGGACTTGT GAGTGGTCAG GCAATCGCCG CGCCTGAATC CCCCCCGCAT 
GCTGATATCC GCGACAGCGG TTTTGTCTAT TGCGTCAGCG GGCAAGTCAA CACCTTTAAC
CCATCCAAAG CGAGCAGTGG GTTAATTGTC GATACCCTTG CCGCCCAGTT TTATGATCGA
CTGCTGGATG TCGATCCCTA TACCTATCGC CTGATGCCAG AACTTGCCGA AAGCTGGGAA
GTACTCGACA ACGGCGCGAC CTATCGCTTC CACCTGCGTC GCGATGTCCC GTTTCAAAAA
ACCGCCTGGT TTACTCCCAC TCGCAAAATG AATGCCGACG ATGTGGTGTT TACCTTCCAG
CGAATTTTTG ACCGCAACAA CCCGTGGCAT AACGTCAACG GCAGCAACTT CCCCTACTTC
GACAGCCTGC AATTTGCCGA TAACGTGAAA AGCGTCCGTA AACTGGATAA TCATACCGTT
GAGTTCCGTC TGGCTCAGCC GGATGCTTCT TTTTTGTGGC ACCTCGCAAC CCATTATGCT
TCGGTCATGT CGGCAGAATA TGCCCGGAAG TTAGAGAAAG AAGATCGCCA GGAGCAACTC
GACCGTCAAC CGGTCGGCAC TGGACCGTAT CAGTTGTCGG AATACCGCGC CGGGCAATTT
ATTCGCCTAC AACGTCATGA TGACTTCTGG CGCGGTAAAC CGTTAATGCC GCAGGTGGTG
GTGGATTTAG GCTCCGGCGG CACCGGACGT CTGTCGAAAC TCCTGACCGG GGAATGCGAC
GTTCTGGCCT GGCCTGCTGC CAGCCAGCTA TCCATTTTGC GTGACGACCC GCGCTTGCGT
TTAACGCTGC GTCCTGGGAT GAACGTCGCC TATCTGGCAT TTAACACCGC CAAACCGCCG
CTAAATAATC CCGCTGTCCG CCATGCGCTG GCACTGGCGA TTAATAACCA GCGCCTGATG
CAATCCATCT ATTATGGTAC GGCTGAAACG GCGGCCTCTA TTTTACCGCG CGCCTCGTGG
GCCTATGACA ACGAGGCTAA AATTACTGAA TACAATCCGG CGAAATCGCG CGAACAGTTG
AAGTTGTTGG GGCTGGAAAA TTTAACGCTG AAACTGTGGG TGCCCACACG TTCGCAGGCG
TGGAACCCCA GTCCACTGAA AACTGCCGAA CTGATTCAGG CGGATATGGC GCAGGTTGGC
GTAAAAGTGG TGATTGTGCC GGTAGAAGGT CGCTTTCAGG AGGCGCGGTT GATGGATATG
AGCCATGATC TGACGTTATC CGGTTGGGCG ACGGACAGTA ACGACCCGGA CAGTTTCTTC
CGTCCGTTAC TGAGCTGCGC GGCAATTCAT TCACAGACCA ACCTCGCCCA CTGGTGCGAT
CCGAAATTCG ACAGCGTGTT GCGTAAGACG CTCTCCTCGC AGCAGCTGGC GGCGCGTATT
GAAGCCTATG ACGAAGCGCA GAGTATTCTG GCGCAGGAAT TGCCCATTTT GCCGCTGGCG
TCGTCATTGC GTTTGCAGGC CTATCGGTAC GATATCAAAG GTCTGGTACT TAGCCCGTTT
GGTAACGCCT CCTTTGCTGG GGTGTATCGC GAGAAACAGG ATGAGGTGAA AAAACCATGA
 
Protein sequence
MVIAGLVSGQ AIAAPESPPH ADIRDSGFVY CVSGQVNTFN PSKASSGLIV DTLAAQFYDR 
LLDVDPYTYR LMPELAESWE VLDNGATYRF HLRRDVPFQK TAWFTPTRKM NADDVVFTFQ
RIFDRNNPWH NVNGSNFPYF DSLQFADNVK SVRKLDNHTV EFRLAQPDAS FLWHLATHYA
SVMSAEYARK LEKEDRQEQL DRQPVGTGPY QLSEYRAGQF IRLQRHDDFW RGKPLMPQVV
VDLGSGGTGR LSKLLTGECD VLAWPAASQL SILRDDPRLR LTLRPGMNVA YLAFNTAKPP
LNNPAVRHAL ALAINNQRLM QSIYYGTAET AASILPRASW AYDNEAKITE YNPAKSREQL
KLLGLENLTL KLWVPTRSQA WNPSPLKTAE LIQADMAQVG VKVVIVPVEG RFQEARLMDM
SHDLTLSGWA TDSNDPDSFF RPLLSCAAIH SQTNLAHWCD PKFDSVLRKT LSSQQLAARI
EAYDEAQSIL AQELPILPLA SSLRLQAYRY DIKGLVLSPF GNASFAGVYR EKQDEVKKP