Gene ECH74115_0008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0008 
SymbolagcS 
ID6972052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp6546 
End bp7976 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content53% 
IMG OID643384092 
Productamino acid carrier protein 
Protein accessionYP_002268615 
Protein GI209398740 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1115] Na+/alanine symporter 
TIGRFAM ID[TIGR00835] amino acid carrier protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.248678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGATT TTTTCTCCTT TATTAACAGC GTCCTTTGGG GATCGGTAAT GATTTACCTG 
CTCTTCGGCG CAGGTTGTTG GTTCACTTTT CGCACCGGAT TTGTGCAGTT TCGCTACATC
CGCCAGTTTG GCAAAAGTCT TAAAAATAGC ATTCATCCAC AGCCAGGCGG TTTAACCTCA
TTTCAGTCAT TGTGTACCAG TCTTGCGGCG CGCGTGGGTA GCGGCAACCT GGCCGGCGTT
GCGCTGGCTA TTACCGCCGG TGGACCTGGA GCCGTCTTTT GGATGTGGGT TGCCGCGTTT
ATCGGCATGG CGACCTCGTT TGCCGAATGT TCCCTTGCAC AACTTTATAA AGAACGTGAC
GTCAATGGGC AGTTTCGTGG CGGACCGGCA TGGTATATGG CGCGCGGGCT GGGGATGCGC
TGGATGGGCG TTCTGTTCGC CGTCTTTTTG CTCATCGCCT ACGGCATAAT TTTCAGCGGA
GTTCAGGCAA ACGCTGTTGC CCGAGCTCTG AGTTTTTCTT TTGATTTTCC CCCGCTGGTG
ACAGGCATTA TTCTTGCTGT CTTTGCTCTG CTGGCAATCA TTCGCGGTCT TCATGGCGTC
GCCCGGCTCA TGCAGGGGTT TGTCCCGTTG ATGGCGATAA TCTGGGTACT GACCAGCCTG
GTGATTTGCG TAATAAATAT CGGGCAACTT CCCCACGTCA TTTGGTCTAT TTTTGAGAGT
GCTTTTGGCT GGCAGGAAGC GGCAGGCGGC GCGGCGGGAT ATACCTTAAG CCAGGCGATT
ACTAACGGTT TTCAGCGCAG TATGTTTTCC AATGAGGCGG GAATGGGGTC GACGCCAAAC
GCGGCAGCGG CAGCGGCGTC CTGGCCTCCG CATCCGGCAG CGCAAGGAAT TGTCCAGATG
ATTGGCATTT TTATCGACAC CCTGGTCATC TGTACGGCAA GCGCCATGCT GATATTACTG
GCGGGTAACG GCACAACCTA CATGCCGCTG GAAGGTATTC AGCTTATCCA GAAGGCGATG
CGGGTGCTAA TGGGTTCCTG GGGTGCTGAA TTTGTTACCC TCGTGGTTAT TCTGTTTGCC
TTCAGCTCCA TCGTTGCCAA CTACATTTAT GCCGAAAACA ATCTCTTCTT TTTACGCCTG
AACAACCCTA AAGCGATCTG GTGTTTGCGG ATCTGCACCT TCGCAACGGT CATCGGCGGC
ACCTTGCTAA GTCTTCCGCT GATGTGGCAA CTGGCAGATA TCATAATGGC CTGCATGGCT
ATTACCAATT TGACTGCCAT ATTACTGCTC TCGCCTGTGG TTTATACCAT TGCCAGTGAT
TATCTACGCC AGCGTAAACT CGGCGTGCGC CCGGTGTTTG ATCCGTTGCG TTATCCGGAT
ATCGGTCGCC AGCTTTCTCG GGACGCGTGG GATGATGTTT CGCAGGAGTA A
 
Protein sequence
MPDFFSFINS VLWGSVMIYL LFGAGCWFTF RTGFVQFRYI RQFGKSLKNS IHPQPGGLTS 
FQSLCTSLAA RVGSGNLAGV ALAITAGGPG AVFWMWVAAF IGMATSFAEC SLAQLYKERD
VNGQFRGGPA WYMARGLGMR WMGVLFAVFL LIAYGIIFSG VQANAVARAL SFSFDFPPLV
TGIILAVFAL LAIIRGLHGV ARLMQGFVPL MAIIWVLTSL VICVINIGQL PHVIWSIFES
AFGWQEAAGG AAGYTLSQAI TNGFQRSMFS NEAGMGSTPN AAAAAASWPP HPAAQGIVQM
IGIFIDTLVI CTASAMLILL AGNGTTYMPL EGIQLIQKAM RVLMGSWGAE FVTLVVILFA
FSSIVANYIY AENNLFFLRL NNPKAIWCLR ICTFATVIGG TLLSLPLMWQ LADIIMACMA
ITNLTAILLL SPVVYTIASD YLRQRKLGVR PVFDPLRYPD IGRQLSRDAW DDVSQE