Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0118 |
Symbol | aroP |
ID | 6970126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 124648 |
End bp | 126018 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643384195 |
Product | aromatic amino acid transporter |
Protein accession | YP_002268718 |
Protein GI | 209397094 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1113] Gamma-aminobutyrate permease and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0360235 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGGTC AACAGCACGG CGAGCAGCTA AAGCGCGGCC TTAAAAACCG CCATATTCAG CTTATCGCGC TGGGTGGCGC GATAGGGACA GGGTTATTCC TGGGTAGCGC CTCCGTAATA CAGTCCGCAG GGCCAGGGAT TATCCTGGGT TACGCCATTG CTGGTTTTAT CGCCTTTCTG ATCATGCGTC AGCTGGGTGA AATGGTGGTC GAAGAACCTG TCGCAGGCTC CTTTAGCCAC TTTGCTTATA AATACTGGGG CAGCTTTGCT GGCTTCGCTT CTGGCTGGAA CTACTGGGTA CTGTACGTTT TAGTTGCCAT GGCAGAGCTG ACTGCTGTGG GTAAATACAT TCAGTTCTGG TATCCGGAAA TCCCAACCTG GGTTTCTGCC GCCGTGTTCT TTGTGGTGAT TAACGCCATC AACCTGACCA ACGTAACAGT GTTTGGTGAG ATGGAGTTCT GGTTTGCCAT TATCAAAGTT ATTGCGGTAG TAGCGATGAT CATCTTCGGC GGCTGGCTGC TGTTCAGTGG TAACGGCGGT CCGCAGGCAA GCGTTAGCAA CCTGTGGGAT CAGGGCGGTT TCCTGCCGCA CGGCTTCACC GGGCTGGTGA TGATGATGGC GATTATCATG TTCTCGTTCG GTGGTCTGGA ACTGGTGGGG ATCACCGCAG CAGAAGCTGA TAACCCGGAG CAAAGTATCC CGAAAGCAAC TAACCAGGTT ATCTACCGCA TCCTGATTTT CTATATTGGT TCGTTAGCCG TTCTGCTCTC ACTGATGCCG TGGACCCGCG TTACCGCCGA TACCAGTCCG TTTGTGCTGA TCTTCCACGA GTTAGGCGAT ACCTTTGTGG CGAATGCGCT GAACATCGTG GTACTGACTG CGGCGCTCTC CGTGTACAAC AGCTGCGTAT ATTGCAACAG CCGTATGCTG TTTGGTCTGG CACAACAGGG TAACGCGCCA AAAGCGCTGG CGTCTGTCGA TAAACGCGGC GTACCGGTTA ACACCATTCT GGTGTCTGCG CTGGTTACAG CATTGTGCGT ATTGATTAAC TATCTTGCTC CGGAATCCGC ATTTGGCCTG TTAATGGCAC TGGTGGTATC CGCACTGGTG ATCAACTGGG CGATGATCAG TCTGGCGCAT ATGAAGTTCC GTCGCGCCAA GCAGGAACAA GGCGTGGTAA CTCGCTTCCC TGCTCTGCTT TATCCGCTGG GTAACTGGAT CTGCCTGCTG TTTATGGCGG TGGTACTGGT GATTATGCTG ATGACCCCAG GAATGGCGAT TTCGGTATAC CTGATCCCGG TATGGCTGGT GGTGTTAGGT ATCGGCTATC TGTTTAAAGA GAAAACCGCA AAAGCCGTAA AAGCACATTA A
|
Protein sequence | MEGQQHGEQL KRGLKNRHIQ LIALGGAIGT GLFLGSASVI QSAGPGIILG YAIAGFIAFL IMRQLGEMVV EEPVAGSFSH FAYKYWGSFA GFASGWNYWV LYVLVAMAEL TAVGKYIQFW YPEIPTWVSA AVFFVVINAI NLTNVTVFGE MEFWFAIIKV IAVVAMIIFG GWLLFSGNGG PQASVSNLWD QGGFLPHGFT GLVMMMAIIM FSFGGLELVG ITAAEADNPE QSIPKATNQV IYRILIFYIG SLAVLLSLMP WTRVTADTSP FVLIFHELGD TFVANALNIV VLTAALSVYN SCVYCNSRML FGLAQQGNAP KALASVDKRG VPVNTILVSA LVTALCVLIN YLAPESAFGL LMALVVSALV INWAMISLAH MKFRRAKQEQ GVVTRFPALL YPLGNWICLL FMAVVLVIML MTPGMAISVY LIPVWLVVLG IGYLFKEKTA KAVKAH
|
| |