Gene ECH74115_0118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0118 
SymbolaroP 
ID6970126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp124648 
End bp126018 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content53% 
IMG OID643384195 
Productaromatic amino acid transporter 
Protein accessionYP_002268718 
Protein GI209397094 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0360235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGTC AACAGCACGG CGAGCAGCTA AAGCGCGGCC TTAAAAACCG CCATATTCAG 
CTTATCGCGC TGGGTGGCGC GATAGGGACA GGGTTATTCC TGGGTAGCGC CTCCGTAATA
CAGTCCGCAG GGCCAGGGAT TATCCTGGGT TACGCCATTG CTGGTTTTAT CGCCTTTCTG
ATCATGCGTC AGCTGGGTGA AATGGTGGTC GAAGAACCTG TCGCAGGCTC CTTTAGCCAC
TTTGCTTATA AATACTGGGG CAGCTTTGCT GGCTTCGCTT CTGGCTGGAA CTACTGGGTA
CTGTACGTTT TAGTTGCCAT GGCAGAGCTG ACTGCTGTGG GTAAATACAT TCAGTTCTGG
TATCCGGAAA TCCCAACCTG GGTTTCTGCC GCCGTGTTCT TTGTGGTGAT TAACGCCATC
AACCTGACCA ACGTAACAGT GTTTGGTGAG ATGGAGTTCT GGTTTGCCAT TATCAAAGTT
ATTGCGGTAG TAGCGATGAT CATCTTCGGC GGCTGGCTGC TGTTCAGTGG TAACGGCGGT
CCGCAGGCAA GCGTTAGCAA CCTGTGGGAT CAGGGCGGTT TCCTGCCGCA CGGCTTCACC
GGGCTGGTGA TGATGATGGC GATTATCATG TTCTCGTTCG GTGGTCTGGA ACTGGTGGGG
ATCACCGCAG CAGAAGCTGA TAACCCGGAG CAAAGTATCC CGAAAGCAAC TAACCAGGTT
ATCTACCGCA TCCTGATTTT CTATATTGGT TCGTTAGCCG TTCTGCTCTC ACTGATGCCG
TGGACCCGCG TTACCGCCGA TACCAGTCCG TTTGTGCTGA TCTTCCACGA GTTAGGCGAT
ACCTTTGTGG CGAATGCGCT GAACATCGTG GTACTGACTG CGGCGCTCTC CGTGTACAAC
AGCTGCGTAT ATTGCAACAG CCGTATGCTG TTTGGTCTGG CACAACAGGG TAACGCGCCA
AAAGCGCTGG CGTCTGTCGA TAAACGCGGC GTACCGGTTA ACACCATTCT GGTGTCTGCG
CTGGTTACAG CATTGTGCGT ATTGATTAAC TATCTTGCTC CGGAATCCGC ATTTGGCCTG
TTAATGGCAC TGGTGGTATC CGCACTGGTG ATCAACTGGG CGATGATCAG TCTGGCGCAT
ATGAAGTTCC GTCGCGCCAA GCAGGAACAA GGCGTGGTAA CTCGCTTCCC TGCTCTGCTT
TATCCGCTGG GTAACTGGAT CTGCCTGCTG TTTATGGCGG TGGTACTGGT GATTATGCTG
ATGACCCCAG GAATGGCGAT TTCGGTATAC CTGATCCCGG TATGGCTGGT GGTGTTAGGT
ATCGGCTATC TGTTTAAAGA GAAAACCGCA AAAGCCGTAA AAGCACATTA A
 
Protein sequence
MEGQQHGEQL KRGLKNRHIQ LIALGGAIGT GLFLGSASVI QSAGPGIILG YAIAGFIAFL 
IMRQLGEMVV EEPVAGSFSH FAYKYWGSFA GFASGWNYWV LYVLVAMAEL TAVGKYIQFW
YPEIPTWVSA AVFFVVINAI NLTNVTVFGE MEFWFAIIKV IAVVAMIIFG GWLLFSGNGG
PQASVSNLWD QGGFLPHGFT GLVMMMAIIM FSFGGLELVG ITAAEADNPE QSIPKATNQV
IYRILIFYIG SLAVLLSLMP WTRVTADTSP FVLIFHELGD TFVANALNIV VLTAALSVYN
SCVYCNSRML FGLAQQGNAP KALASVDKRG VPVNTILVSA LVTALCVLIN YLAPESAFGL
LMALVVSALV INWAMISLAH MKFRRAKQEQ GVVTRFPALL YPLGNWICLL FMAVVLVIML
MTPGMAISVY LIPVWLVVLG IGYLFKEKTA KAVKAH