Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0115 |
Symbol | aroP |
ID | 6144209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 126694 |
End bp | 128064 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615016 |
Product | aromatic amino acid transporter |
Protein accession | YP_001742232 |
Protein GI | 170680377 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1113] Gamma-aminobutyrate permease and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.329047 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGGTC AACAGCACGG CGAGCAGCTA AAGCGCGGCC TTAAAAACCG CCATATTCAG CTTATCGCGC TGGGTGGCGC GATAGGGACC GGGTTATTCC TGGGTAGCGC CTCCGTAATA CAGTCCGCAG GGCCAGGGAT TATCCTGGGT TACGCCATTG CTGGTTTTAT CGCCTTTCTG ATCATGCGTC AGCTGGGTGA AATGGTGGTC GAAGAACCTG TCGCAGGCTC CTTTAGCCAC TTTGCTTATA AATACTGGGG CAGTTTTGCC GGTTTCGCCT CTGGCTGGAA CTACTGGGTA CTGTACGTTT TAGTTGCCAT GGCAGAGCTG ACTGCCGTGG GTAAATACAT TCAGTTCTGG TATCCGGAAA TCCCAACCTG GGTTTCTGCC GCCGTGTTCT TTGTGGTGAT CAACGCCATC AACCTGACAA ACGTAAAAGT GTTTGGCGAG ATGGAGTTCT GGTTTGCCAT TATCAAAGTT ATCGCGGTGG TAGCGATGAT CATCTTCGGC GGCTGGCTGC TATTCAGTGG CAACGGCGGC CCGCAGGCGA CCGTTAGCAA CCTGTGGGAT CAGGGTGGTT TCCTGCCGCA CGGCTTCACC GGGCTGGTGA TGATGATGGC GATTATCATG TTCTCGTTCG GTGGTCTGGA ACTGGTGGGG ATCACCGCAG CAGAAGCTGA TAACCCGGAG CAAAGTATTC CAAAAGCGAC TAACCAGGTT ATCTACCGCA TCCTGATTTT CTACATTGGT TCGTTAGCCG TTCTGCTCTC ACTGATGCCG TGGACACGCG TTACCGCTGA CACCAGCCCG TTTGTGCTGA TCTTCCACGA GTTAGGCGAT ACCTTTGTGG CAAATGCGCT GAACATCGTG GTACTGACTG CGGCGCTCTC CGTGTACAAC AGCTGCGTAT ATTGCAACAG CCGTATGCTG TTTGGTCTGG CACAACAGGG CAACGCGCCA AAAGCGCTGG CGTCTGTCGA CAAACGCGGT GTACCGGTTA ATACTATTCT GGTGTCTGCG CTGGTTACGG CATTGTGCGT GCTGATTAAC TACCTTGCAC CGGAATCCGC ATTCGGCCTG TTAATGGCGC TGGTGGTATC TGCACTGGTC ATCAACTGGG CGATGATTAG CCTGGCGCAT ATGAAATTCC GTCGCGCCAA GCAGGAACAA GGCGTGGTAA CTCGCTTCCC TGCTCTGCTT TATCCGCTGG GTAACTGGAT CTGCCTGCTG TTTATGGCGG CGGTATTGGT GATTATGCTG ATGACCCCAG GAATGGCGAT TTCGGTATAC CTGATCCCGG TATGGCTGAT CGTGTTAGGT ATCGGCTATC TGTTTAAAGA GAAAACCGCC AAAGCCGTAA AAGCGCATTA A
|
Protein sequence | MEGQQHGEQL KRGLKNRHIQ LIALGGAIGT GLFLGSASVI QSAGPGIILG YAIAGFIAFL IMRQLGEMVV EEPVAGSFSH FAYKYWGSFA GFASGWNYWV LYVLVAMAEL TAVGKYIQFW YPEIPTWVSA AVFFVVINAI NLTNVKVFGE MEFWFAIIKV IAVVAMIIFG GWLLFSGNGG PQATVSNLWD QGGFLPHGFT GLVMMMAIIM FSFGGLELVG ITAAEADNPE QSIPKATNQV IYRILIFYIG SLAVLLSLMP WTRVTADTSP FVLIFHELGD TFVANALNIV VLTAALSVYN SCVYCNSRML FGLAQQGNAP KALASVDKRG VPVNTILVSA LVTALCVLIN YLAPESAFGL LMALVVSALV INWAMISLAH MKFRRAKQEQ GVVTRFPALL YPLGNWICLL FMAAVLVIML MTPGMAISVY LIPVWLIVLG IGYLFKEKTA KAVKAH
|
| |