Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5115 |
Symbol | |
ID | 6967871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4758700 |
End bp | 4760385 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643388788 |
Product | hypothetical protein |
Protein accession | YP_002273214 |
Protein GI | 209396187 |
COG category | [R] General function prediction only |
COG ID | [COG2985] Predicted permease |
TIGRFAM ID | [TIGR01625] AspT/YidE/YbjL antiporter duplication domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.44003 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTATCTC AAGAGAAATG GACGATGAGT GATATAGCAT TAACGGTCAG TATTTTGGCT TTGGTGGCAG TCGTCGGTTT GTTTATCGGC AACGTCAAAT TTCGCGGCAT AGGATTAGGT ATTGGCGGCG TGCTGTTTGG TGGGATCATC GTCGGCCATT TTGTTTCTCA GGCGGGGATG ACATTAAGTA GCGATATGCT GCATGTTATT CAGGAATTTG GCCTGATCCT GTTCGTTTAT ACCATCGGGA TTCAGGTAGG GCCGGGCTTC TTTGCCTCAT TGCGCGTCTC CGGATTACGC CTCAACCTGT TTGCTGTTCT GATCGTCATC ATCGGTGGTC TGGTTACCGC CATCCTGCAT AAACTGTTTG ATATTCCACT GCCGGTAGTG CTGGGGATTT TCTCCGGTGC GGTAACCAAT ACGCCAGCGC TGGGGGCAGG GCAGCAGATT TTGCGCGACC TGGGTACACC AATGGAAATG GTCGATCAGA TGGGGATGAG TTACGCGATG GCGTATCCAT TCGGCATTTG CGGGATATTG TTCACCATGT GGATGTTGCG GGTTATTTTC CGCGTCAATG TCGAGACAGA AGCCCAGCAG CACGAGTCTT CACGCACCAA TGGCGGCGCG CTGATCAAGA CTATCAATAT TCGCGTTGAG AACCCTAACC TGCATGATTT AGCCATTAAA GATGTACCGA TTCTCAACGG CGACAAAATT ATCTGCTCGC GTCTGAAACG CGAAGAAACC CTAAAAGTTC CTTCGCCAGA TACCATTATC CAACTGGGCG ATTTGCTGCA TCTGGTGGGT CAGCCAGCGG ATTTACATAA TGCGCAACTG GTGATTGGTC AGGAGGTCGA TACTTCGCTG TCCACGAAAG GCACTGATTT GCGCGTCGAG CGTGTGGTGG TCACCAATGA AAACGTGCTT GGTAAGCGTA TTCGCGACCT GCATTTTAAA GAACGCTATG ACGTTGTTAT CTCGCGCCTG AACCGTGCCG GGGTCGAACT GGTCGCCAGT GGCGATATCA GCCTGCAGTT CGGCGATATC CTCAACCTGG TGGGGCGTCC GTCCGCAATT GATGCCGTTG CCAATGTGCT GGGGAATGCG CAGCAAAAAC TGCAACAGGT TCAGATGCTG CCAGTGTTTA TTGGCATCGG GCTTGGCGTA TTGTTAGGTT CTATTCCCGT CTTTGTGCCA GGATTCCCGG CCGCGTTGAA ACTGGGGCTG GCGGGCGGTC CGCTGATTAT GGCGTTGATC CTCGGGCGTA TCGGCAGTAT CGGCAAGCTG TACTGGTTTA TGCCGCCAAG TGCCAACCTC GCGCTGCGGG AACTGGGGAT CGTGCTGTTC CTCTCGGTCG TTGGTCTGAA ATCTGGTGGG GATTTTGTGA ATACCCTGGT CAATGGCGAA GGGCTAAGCT GGATTGGTTA TGGTGCCCTG ATCACCGCCG TTCCGCTGAT TACTGTTGGC ATTCTGGCGC GGATGTTAGC CAAAATGAAT TACCTGACCA TGTGCGGGAT GCTGGCAGGT TCCATGACCG ATCCTCCGGC GCTGGCGTTT GCTAATAATC TTCATCCAAC CAGCGGTGCG GCGGCGCTCT CTTACGCCAC TGTCTATCCG TTAGTGATGT TCCTGCGCAT TATCACCCCC CAATTACTGG CGGTGCTCTT CTGGAGTATC GGTTAA
|
Protein sequence | MLSQEKWTMS DIALTVSILA LVAVVGLFIG NVKFRGIGLG IGGVLFGGII VGHFVSQAGM TLSSDMLHVI QEFGLILFVY TIGIQVGPGF FASLRVSGLR LNLFAVLIVI IGGLVTAILH KLFDIPLPVV LGIFSGAVTN TPALGAGQQI LRDLGTPMEM VDQMGMSYAM AYPFGICGIL FTMWMLRVIF RVNVETEAQQ HESSRTNGGA LIKTINIRVE NPNLHDLAIK DVPILNGDKI ICSRLKREET LKVPSPDTII QLGDLLHLVG QPADLHNAQL VIGQEVDTSL STKGTDLRVE RVVVTNENVL GKRIRDLHFK ERYDVVISRL NRAGVELVAS GDISLQFGDI LNLVGRPSAI DAVANVLGNA QQKLQQVQML PVFIGIGLGV LLGSIPVFVP GFPAALKLGL AGGPLIMALI LGRIGSIGKL YWFMPPSANL ALRELGIVLF LSVVGLKSGG DFVNTLVNGE GLSWIGYGAL ITAVPLITVG ILARMLAKMN YLTMCGMLAG SMTDPPALAF ANNLHPTSGA AALSYATVYP LVMFLRIITP QLLAVLFWSI G
|
| |