Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3462 |
Symbol | |
ID | 6972110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3206747 |
End bp | 3207925 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643387269 |
Product | hypothetical protein |
Protein accession | YP_002271732 |
Protein GI | 209400286 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCTG TAAGCCAAAC CGAAACACGA TCTTCTGCCA ATTTTTCGCT CTTCCGCATC GCTTTTGCGG TTTTTCTCAC CTACATGACC GTAGGGTTGC CGTTGCCGGT TATCCCGTTG TTTGTTCATC ATGATCTGGG CTATGGCAAT ACCATGGTCG GCATTGCCGT CGGCATTCAG TTTCTGGCTA CGGTGCTGAC GCGTGGCTAT GCCGGGCGAC TGGCCGATCA ATATGGCGCA AAACGTTCGG CGCTTCAGGG GATGTTAGCT TGTGGTCTGG CTGGCGGCGC GTTGCTGCTG GCGGCGATTT TGCCTGTCTC CGCACCGTTC AAATTTGCCC TGTTGGTCAT CGGGCGTTTG ATTCTTGGGT TTGGTGAAAG CCAGTTACTG ACAGGCGCTC TGACCTGGGG GTTAGGCATC GTAGGGCCAA AACACTCTGG CAAAGTGATG TCATGGAATG GAATGGCGAT TTACGGTGCC CTCGCTGTTG GTGCTCCGCT TGGCCTGTTG ATTCATAGCC ATTACGGTTT TGCCGCACTG GCGCTCACCA CAATGGCATT ACCCTTACTG GCGTGGGCCT GTAACGGCAC AGTGCGCAAA GTACCAGCCC TGGCGGGAGA ACGTCCATCT CTGTGGAGCG TTGTCGGACT TATCTGGAAA CCAGGGTTAG GTCTGGCACT ACAAGGCGTT GGTTTTGCGG TTATCGGGAC TTTCGTTTCG CTCTACTTTG CCAGCAAAGG ATGGGCGATG GCGGGCTTTA CTCTTACCGC GTTTGGCGGC GCATTTGTCG TGATGCGCGT CATGTTTGGC TGGATGCCGG ACCGTTTTGG CGGCGTGAAA GTGGCGATTG TCTCTCTGCT TGTAGAAACG GTGGGCTTGT TGCTGCTCTG GCAAGCCCCA GGGGCGTGGG TCGCATTAGC GGGCGCGGCG TTAACCGGAG CCGGATGTTC GCTTATCTTT CCTGCGCTGG GTGTGGAGGT GGTTAAACGC GTCCCCTCAC AAGTTCGCGG CACCGCACTG GGCGGTTACG CCGCGTTTCA GGATATCGCC CTCGGCGTCT CCGGGCCGCT TGCGGGAATG CTGGCGACCA CGTTTGGTTA CTCTTCGGTA TTTCTTGCCG GGGCGATCTC TGCGGTGCTG GGAATTATTG TCACGATACT GTCGTTTCGT CGGGGTTAA
|
Protein sequence | MTAVSQTETR SSANFSLFRI AFAVFLTYMT VGLPLPVIPL FVHHDLGYGN TMVGIAVGIQ FLATVLTRGY AGRLADQYGA KRSALQGMLA CGLAGGALLL AAILPVSAPF KFALLVIGRL ILGFGESQLL TGALTWGLGI VGPKHSGKVM SWNGMAIYGA LAVGAPLGLL IHSHYGFAAL ALTTMALPLL AWACNGTVRK VPALAGERPS LWSVVGLIWK PGLGLALQGV GFAVIGTFVS LYFASKGWAM AGFTLTAFGG AFVVMRVMFG WMPDRFGGVK VAIVSLLVET VGLLLLWQAP GAWVALAGAA LTGAGCSLIF PALGVEVVKR VPSQVRGTAL GGYAAFQDIA LGVSGPLAGM LATTFGYSSV FLAGAISAVL GIIVTILSFR RG
|
| |