Gene ECH74115_4392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4392 
Symbol 
ID6972075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4068848 
End bp4070281 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content49% 
IMG OID643388114 
Productamino acid permease family protein 
Protein accessionYP_002272551 
Protein GI209400233 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATA CCAAACGTAA TACAATCGGC AAATTCGGCT TGCTCTCGCT GACTTTTGCC 
GCCGTTTACA GCTTTAACAA CGTTATCAAT AATAATATTG AGCTTGGACT GGCCTCGGCA
CCGATGTTTT TCCTCGCGAC GATTTTTTAT TTTATTCCCT TCTGTCTGAT CATCGCAGAA
TTTGTTTCGT TAAATAAAAA CTCAGAAGCC GGTGTCTACG CGTGGGTAAA AAGTTCGCTG
GGCGGACGTT GGGCATTTAT TACTGCCTAT ACCTACTGGT TCGTAAACCT GTTCTTTTTC
ACCTCGCTGT TGCCGCGCGT TATTGCTTAT GCTTCGTATG CCTTCCTCGG CTATGAATAT
ATTATGACGC CGGTTGCCAC CACCATTATC AGTATGGTGC TGTTCGCCTT CTCCACCTGG
GTTTCCACCA ACGGGGCGAA AATGTTGGGG CCAATTACCT CCGTCACTTC AACGCTGATG
CTGCTGTTAA CGCTCTCCTA CATTTTACTG GCAGGTACGG CGCTGGTTGG CGGCGTACAG
CCTGCTGACC CCATCACCGT TGACGCGATG ATCCCGAACT TCAACTGGGC GTTCCTCGGC
GTTACCACCT GGATCTTTAT GGCCGCAGGT GGCGCGGAGT CCGTCGCTGT GTACGTTAAC
GACGTCAAAG GCGGTTCGAA ATCGTTCGTT AAAGTGATCA TCCTCGCCGG GATTTTTATC
GGTGTACTAT ATTCCGTCTC CTCGGTGCTG ATTAACGTCT TCGTCAGCAG CAAAGAGTTG
AAATTTACTG GCGGATCGGT GCAGGTATTC CACGGCATGG CGGCGTATTT TGGTCTGCCG
GAAGCGTTGA TGAATCGCTT TGTCGGTCTG GTGTCCTTTA CCGCAATGTT CGGTTCCCTG
CTGATGTGGA CCGCAACGCC GGTGAAAATT TTCTTCTCCG AAATCCCGGA AGGCATCTTT
GGTAAGAAAA CCGTCGAACT TAACGAAAAC GGCGTTCCGG CGCGCGCAGC GTGGATCCAG
TTCCTGATCG TCATCCCGCT GATGATTATC CCGATGCTCG GTTCCAATAC TGTGCAGGAT
CTGATGAATA CTATTATTAA TATGACCGCC GCAGCGTCCA TGCTTCCGCC GTTATTCATC
ATGCTGGCTT ACCTGAATTT ACGCGCCAAA TTAGATCACC TGCCACGCGA TTTCCGTATG
GGCTCCCGCC GCACCGGTAT TATCGTTGTT TCAATGCTGA TTGCGATATT TGCCGTAGGG
TTTGTCGCTT CGACATTCCC GACTGGCGCG AATATTCTGA CCATCATTTT TTATAACGTC
GGCGGTATTG TTATATTCCT TGGCTTTGCG TGGTGGAAAT ACAGTAAATA TATAAAGGGA
TTAACGGCTG AAGAGCGCCA TATTGAAGCG ACGCCAGCCA GCAATGTTGA TTAA
 
Protein sequence
MSDTKRNTIG KFGLLSLTFA AVYSFNNVIN NNIELGLASA PMFFLATIFY FIPFCLIIAE 
FVSLNKNSEA GVYAWVKSSL GGRWAFITAY TYWFVNLFFF TSLLPRVIAY ASYAFLGYEY
IMTPVATTII SMVLFAFSTW VSTNGAKMLG PITSVTSTLM LLLTLSYILL AGTALVGGVQ
PADPITVDAM IPNFNWAFLG VTTWIFMAAG GAESVAVYVN DVKGGSKSFV KVIILAGIFI
GVLYSVSSVL INVFVSSKEL KFTGGSVQVF HGMAAYFGLP EALMNRFVGL VSFTAMFGSL
LMWTATPVKI FFSEIPEGIF GKKTVELNEN GVPARAAWIQ FLIVIPLMII PMLGSNTVQD
LMNTIINMTA AASMLPPLFI MLAYLNLRAK LDHLPRDFRM GSRRTGIIVV SMLIAIFAVG
FVASTFPTGA NILTIIFYNV GGIVIFLGFA WWKYSKYIKG LTAEERHIEA TPASNVD