Gene ECH74115_5674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5674 
Symbol 
ID6966796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5313277 
End bp5314821 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content47% 
IMG OID643389307 
Productamino acid permease family protein 
Protein accessionYP_002273703 
Protein GI209396517 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.270283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.215143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGCGT TTTATACACG CGCTGAAATG AAGGATGGTT TCATGCCTCA CACGATAAAA 
AAGATGAGTC TGATAGGACT CATATTGATG ATCTTTACTT CCGTATTTGG ATTTGCCAAT
AGCCCATCGG CTTATTACTT AATGGGTTAT AGTGCGATTC CCTTTTATAT ATTTTCTGCA
TTGTTATTCT TTATTCCATT CGCCTTAATG ATGGCTGAAA TGGGAGCTGC TTATCGCAAA
GAAGAAGGCG GTATCTATTC CTGGATGAAT AATAGTGTCG GACCACGTTT TGCCTTCATT
GGTACGTTTA TGTGGTTTTC CTCTTATATC ATCTGGATGG TGAGCACCTC CGCGAAAGTT
TGGGTACCGT TCTCAACATT CCTCTATGGT AGCGACATGA CCCAGCACTG GCGTATTGCC
GGACTGGAGC CTACGCAGGT GGTTGGTCTG CTGGCAGTGG CATGGATGAT TCTGGTCACC
GTCGTTGCTT CAAAGGGGAT TAATAAAATT GCCCGCATTA CTGCGGTGGG CGGTATTGCA
GTAATGTGTC TGAATTTAGT ATTGCTGTTA GTAAGCATTA CTATTTTGTT ATTAAATGGT
GGGCATTTCG CGCAGGATAT TAATTTCCTT GCATCACCGA ACCCAGGTTA TCAGTCCGGT
CTGGCAATGC TATCGTTTGT GGTATTTGCT ATTTTTGCCT ATGGCGGAAT TGAAGCGGTT
GGTGGTCTGG TCGATAAAAC GGAAAATCCA GAAAAGAACT TTGCCAAAGG TATTGTTTTT
GCCGCTATTG TTATTTCAAT CGGTTATTCG CTGGCAATAT TTTTATGGGG CGTCAGCACA
AACTGGCAGC AGGTATTAAG TAATGGTTCC GTTAACCTCG GCAATATTAC CTATGTGCTG
ATGAAGAGCC TCGGGGTGAC GCTGGGTAAC GCACTGCATT TGTCACCTGA AGCGTCATTG
TCGCTGGGCG TATGGTTTGC GCGTATTACC GGACTTTCGA TGTTCCTCGC TTATACCGGT
GCGTTCTTTA CGCTTTGCTA TTCACCGCTG AAAGCCATCA TCCAGGGGAC GCCGAAAGCG
TTGTGGCCGG AACCGATGAC GCGCCTGAAT GCGATGGGGA TGCCGTCTAT CGCTATGTGG
ATGCAGTGCG GGTTGGTTAC TGTCTTCATC CTGCTGGTTT CGTTTGGTGG CGGTACCGCA
TCGGCGTTCT TTAACAAGCT GACGCTGATG GCGAACGTGT CTATGACGCT TCCTTACCTG
TTCCTCGCGC TGGCTTTCCC ATTCTTTAAA GCACGTCAGG ATCTCGACAG ACCATTTGTG
ATTTTCAAAA CGCGTATGTC GGCAATGATT GCGACGGTGG TTGTCGTTCT GGTGGTGACA
TTTGCGAACG TCTTCACCAT TATTCAGCCT GTGGTTGAAG CCGGAGACTG GGACAGCACA
TTGTGGATGA TTGGCGGCCC TGTCTTCTTC TCGCTGTTAG CGATGGCGAT TTACCAGAAC
TATTGCAGTC GCATGGCGAA TAAACCTGAG TTAGCTCTCG ACTGA
 
Protein sequence
MGAFYTRAEM KDGFMPHTIK KMSLIGLILM IFTSVFGFAN SPSAYYLMGY SAIPFYIFSA 
LLFFIPFALM MAEMGAAYRK EEGGIYSWMN NSVGPRFAFI GTFMWFSSYI IWMVSTSAKV
WVPFSTFLYG SDMTQHWRIA GLEPTQVVGL LAVAWMILVT VVASKGINKI ARITAVGGIA
VMCLNLVLLL VSITILLLNG GHFAQDINFL ASPNPGYQSG LAMLSFVVFA IFAYGGIEAV
GGLVDKTENP EKNFAKGIVF AAIVISIGYS LAIFLWGVST NWQQVLSNGS VNLGNITYVL
MKSLGVTLGN ALHLSPEASL SLGVWFARIT GLSMFLAYTG AFFTLCYSPL KAIIQGTPKA
LWPEPMTRLN AMGMPSIAMW MQCGLVTVFI LLVSFGGGTA SAFFNKLTLM ANVSMTLPYL
FLALAFPFFK ARQDLDRPFV IFKTRMSAMI ATVVVVLVVT FANVFTIIQP VVEAGDWDST
LWMIGGPVFF SLLAMAIYQN YCSRMANKPE LALD