Gene EcSMS35_0115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0115 
SymbolaroP 
ID6144209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp126694 
End bp128064 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content53% 
IMG OID641615016 
Productaromatic amino acid transporter 
Protein accessionYP_001742232 
Protein GI170680377 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.329047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGTC AACAGCACGG CGAGCAGCTA AAGCGCGGCC TTAAAAACCG CCATATTCAG 
CTTATCGCGC TGGGTGGCGC GATAGGGACC GGGTTATTCC TGGGTAGCGC CTCCGTAATA
CAGTCCGCAG GGCCAGGGAT TATCCTGGGT TACGCCATTG CTGGTTTTAT CGCCTTTCTG
ATCATGCGTC AGCTGGGTGA AATGGTGGTC GAAGAACCTG TCGCAGGCTC CTTTAGCCAC
TTTGCTTATA AATACTGGGG CAGTTTTGCC GGTTTCGCCT CTGGCTGGAA CTACTGGGTA
CTGTACGTTT TAGTTGCCAT GGCAGAGCTG ACTGCCGTGG GTAAATACAT TCAGTTCTGG
TATCCGGAAA TCCCAACCTG GGTTTCTGCC GCCGTGTTCT TTGTGGTGAT CAACGCCATC
AACCTGACAA ACGTAAAAGT GTTTGGCGAG ATGGAGTTCT GGTTTGCCAT TATCAAAGTT
ATCGCGGTGG TAGCGATGAT CATCTTCGGC GGCTGGCTGC TATTCAGTGG CAACGGCGGC
CCGCAGGCGA CCGTTAGCAA CCTGTGGGAT CAGGGTGGTT TCCTGCCGCA CGGCTTCACC
GGGCTGGTGA TGATGATGGC GATTATCATG TTCTCGTTCG GTGGTCTGGA ACTGGTGGGG
ATCACCGCAG CAGAAGCTGA TAACCCGGAG CAAAGTATTC CAAAAGCGAC TAACCAGGTT
ATCTACCGCA TCCTGATTTT CTACATTGGT TCGTTAGCCG TTCTGCTCTC ACTGATGCCG
TGGACACGCG TTACCGCTGA CACCAGCCCG TTTGTGCTGA TCTTCCACGA GTTAGGCGAT
ACCTTTGTGG CAAATGCGCT GAACATCGTG GTACTGACTG CGGCGCTCTC CGTGTACAAC
AGCTGCGTAT ATTGCAACAG CCGTATGCTG TTTGGTCTGG CACAACAGGG CAACGCGCCA
AAAGCGCTGG CGTCTGTCGA CAAACGCGGT GTACCGGTTA ATACTATTCT GGTGTCTGCG
CTGGTTACGG CATTGTGCGT GCTGATTAAC TACCTTGCAC CGGAATCCGC ATTCGGCCTG
TTAATGGCGC TGGTGGTATC TGCACTGGTC ATCAACTGGG CGATGATTAG CCTGGCGCAT
ATGAAATTCC GTCGCGCCAA GCAGGAACAA GGCGTGGTAA CTCGCTTCCC TGCTCTGCTT
TATCCGCTGG GTAACTGGAT CTGCCTGCTG TTTATGGCGG CGGTATTGGT GATTATGCTG
ATGACCCCAG GAATGGCGAT TTCGGTATAC CTGATCCCGG TATGGCTGAT CGTGTTAGGT
ATCGGCTATC TGTTTAAAGA GAAAACCGCC AAAGCCGTAA AAGCGCATTA A
 
Protein sequence
MEGQQHGEQL KRGLKNRHIQ LIALGGAIGT GLFLGSASVI QSAGPGIILG YAIAGFIAFL 
IMRQLGEMVV EEPVAGSFSH FAYKYWGSFA GFASGWNYWV LYVLVAMAEL TAVGKYIQFW
YPEIPTWVSA AVFFVVINAI NLTNVKVFGE MEFWFAIIKV IAVVAMIIFG GWLLFSGNGG
PQATVSNLWD QGGFLPHGFT GLVMMMAIIM FSFGGLELVG ITAAEADNPE QSIPKATNQV
IYRILIFYIG SLAVLLSLMP WTRVTADTSP FVLIFHELGD TFVANALNIV VLTAALSVYN
SCVYCNSRML FGLAQQGNAP KALASVDKRG VPVNTILVSA LVTALCVLIN YLAPESAFGL
LMALVVSALV INWAMISLAH MKFRRAKQEQ GVVTRFPALL YPLGNWICLL FMAAVLVIML
MTPGMAISVY LIPVWLIVLG IGYLFKEKTA KAVKAH