Gene ECH74115_4470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4470 
Symbol 
ID6966609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4141363 
End bp4142403 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content58% 
IMG OID643388185 
Productputative permease 
Protein accessionYP_002272622 
Protein GI209396489 
COG category[R] General function prediction only 
COG ID[COG0701] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGTC AGTCTTCATC TCAGGCGGCA ACGCCCATTC AGTGGTGGAA ACCCGCGCTT 
TTCTTTCTCG TTGTCATTGC CGGTCTCTGG TATGTGAAAT GGCAACCTTA CTACGGCAAA
GCGTTTACTG CTGCCGAAAC CCACAGTATC GGTAAATCTA TCCTTGCGCA GGCGGATGCT
AATCCATGGC AGGCGGCGTT GGATTACGCG ATGATCTATT TCCTCGCGGT ATGGAAAGCG
GCGGTGCTGG GGGTGATCCT CGGTTCGTTG ATTCAGGTGC TGATCCCGCG TGACTGGTTG
TTGCGTACGC TTGGGCAATC GCGCTTTCGC GGCACGCTGC TGGGAACGCT GTTTTCGCTG
CCAGGCATGA TGTGTACCTG CTGTGCGGCT CCGGTCGCGG CGGGAATGCG TCGCCAACAG
GTGTCGATGG GCGGTGCGCT GGCATTCTGG ATGGGCAATC CGGTGTTAAA CCCGGCGACG
CTGGTGTTTA TGGGGTTTGT CCTCGGCTGG GGTTTTGCGG CGATTCGTCT GGTGGCCGGG
CTGGTGATGG TGTTGCTGAT TGCGACGCTG GTGCAAAAAT GGGTGCGTGA AACACCGCAA
ACGCAAGCAC CGGTCGAAAT TGACATACCG GAAGCACAGG GCGGATTTTT TAGTCGCTGG
GGCAGGGCGC TATGGACGCT TTTCTGGAGT ACGATCCCGG TTTACATCCT TGCAGTACTG
GTGTTGGGTG CCGCTCGCGT CTGGTTATTC CCCCATGCCG ATGGTGCTGT CGATAACAGC
CTGATGTGGG TGGTGGCGAT GGCGGTAGCA GGGTGCTTGT TTGTCATTCC CACGGCAGCA
GAAATTCCGA TTGTACAAAC GATGATGCTG GCAGGTATGG GAACTGCTCC GGCGCTGGCA
TTGTTGATGA CGCTCCCGGC GGTGAGTTTG CCGTCACTGA TTATGCTGCG CAAAGCGTTC
CCGGCGAAAG CCTTATGGCT GACGGGGGCG ATGGTGGCAG TGTCAGGCGT GATTGTCGGC
GGGCTGGCGC TGTTATTCTG A
 
Protein sequence
MTGQSSSQAA TPIQWWKPAL FFLVVIAGLW YVKWQPYYGK AFTAAETHSI GKSILAQADA 
NPWQAALDYA MIYFLAVWKA AVLGVILGSL IQVLIPRDWL LRTLGQSRFR GTLLGTLFSL
PGMMCTCCAA PVAAGMRRQQ VSMGGALAFW MGNPVLNPAT LVFMGFVLGW GFAAIRLVAG
LVMVLLIATL VQKWVRETPQ TQAPVEIDIP EAQGGFFSRW GRALWTLFWS TIPVYILAVL
VLGAARVWLF PHADGAVDNS LMWVVAMAVA GCLFVIPTAA EIPIVQTMML AGMGTAPALA
LLMTLPAVSL PSLIMLRKAF PAKALWLTGA MVAVSGVIVG GLALLF