Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0337 |
Symbol | |
ID | 6966998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 343095 |
End bp | 345620 |
Gene Length | 2526 bp |
Protein Length | 841 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643384398 |
Product | hypothetical protein |
Protein accession | YP_002268913 |
Protein GI | 209400111 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.365908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.874038 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTTAC GACGGTTCTC CCCAGGACTG AAAGCCCAGT TTGCCTTCGG CATGGTCTTT TTGTTCGTTC AGCCCGATGC CAGCGCTGCT GACATAAGTG CGCAGCAAAT AGGTGGGGTG ATTATTCCGC AGGCCTTCAG TCAGGCGCTT CAGGACGGCA TGAGCGTCCC GCTCTATATT CATCTCGCCG GTAGCCAGGG TCGCCAGGAC GATCAGCGAA TCGGCAGCGC TTTTATCTGG TTGGATGATG GACAGCTACG CATCCGGAAA ATACAGCTGG AAGAGAGTGA AGATAACGCC AGTGTCAGCG AACAAACTCG ACAGCAGCTG ATGGCTCTGG CGAACGCCCC GTTCAATGAG GCCCTTACCA TCCCCCTGAC TGACAACGCG CAGCTGGATC TCAGCTTGCG CCAACTGCTG CTGCAGCTGG TGGTCAAGCG CGAAGCGCTG GGCACTGTAC TACGCTCACG TAGCGAAGAC ATCGGGCAGT CCAGTGTTAA CACCCTCAGC AGTAATCTGA GCTATAACTT CGGCATCTAT AACAACCAGT TGCGTAACGG CGGGAGCAAC ACATCCAGCT ATCTGTCGCT GAATAACGTT ACTGCACTGC GCGAACATCA TGTGGTGCTC GACGGCTCGC TGTACGGGAT CGGTAGCGGT CAACAGGACA GTGAATTATA TAAAGCGATG TATGAACGCG ATTTTGCCGG TCACCGATTT GCCGGTGGAA CGCTCGACAC CTGGAACTTG CAGTCCTTAG GGCCGATGAC CGCCATTTCA GCAGGGAAGA TTTACGGCCT TTCCTGGGGA AACCAGGCCA GCTCCACCAT CTTCGACAGC AGCCAGTCAG CCACGCCAGT GATCGCCTTT TTACCGGCGG CGGGTGAAGT ACATCTCACC CGTGATGGGC GGTTACTAAG CGTTCAGAAC TTCACCATGG GCAATCATGA AGTGGATACC CGGGGTCTAC CATACGGTAT TTACGATGTG GAAGTTGAGG TGATCGTTAA CGGTCGCGTG ATCAGCAAAC GCACCCAGCG GGTCAATAAG CTGTTTAGCC GGGGGCGCGG CGTCGGTGCA CCACTGGCGT GGCAGGTATG GGGCGGTAGC TTTCATATGG ATCGCTGGTC GGAAAACGGG AAAAAGACGC GACCAGCTAA AGAGAGTTGG CTGGCAGGTG CCTCGACCTC CGGCTCACTG AGTACGCTTA GCTGGGCGGC AACGGGATAT GGATACGATA ATCAGGCGGT GGGTGAAACC CGTCTGACGC TGCCGCTTGG GGGAGCGATC AACGTTAACC TGCAAAATAT GCTGGCCAGT GACAGCTCAT GGAGCAGCAT CGGCAGCATC AGCGCCACTC TACCGGGAGG CTTTAGTTCG CTGTGGGTTA ATCAGGAAAA AACCCGCATT GGCAATCAAT TGCGACGTAG CGATGCCGAC AACCGTGCTA TCGGCGGCAC ACTCAACCTG AACTCACTGT GGTCGAAGCT GGGCACATTC AGCATCAGCT ACAATGATGA CCGCCGTTAC AACAGCCATT ATTACACGGC AGATTACTAT CAAAATGTCT ACAGCGGTAC CTTTGGTTCG CTTGGCCTGC GGGCCGGTAT TCAGCGCTAT AACAACGGCG ACAGCAACGC CAATACAGGG AAATATATCG CTCTCGATCT CTCGCTACCA CTGGGCAACT GGTTTAGCGC AGGGATGACT CATCAAAACG GCTACACCAT GGCAAACCTG TCAGCACGCA AGCAGTTTGA TGAAGGGACC ATTCGCACTG TTGGTGCCAA TCTGTCACGA GCCATCTCCG GCGATACCGG TGATGACAAA ACTCTCAGTG GTGGGGCGTA TGCACAGTTC GACGCTCGCT ACGCCAGCGG AACGCTGAAC GTCAATAGCG CGGCGGACGG CTACGTCAAT ACCAATTTAA CCGCCAATGG CAGCGTCGGC TGGCAGGGTA AAAACATTGC TGCCAGCGGG CGGACTGATG GCAACGCTGG GGTGATATTC AACACCGGGC TGGAGGACGA CGGTCAGATC AGCGCCAAAA TCAACGGGCG GATTTTCCCG CTTAACGGCA AGCGTAACTA TCTCCCGCTC TCTCCCTATG GAAGATATGA GGTGGAGTTA CAGAACAGCA AAAACTCACT CGACAGTTAC GATATCGTCA GCGGTCGCAA AAGTCATCTG ACTCTCTATC CAGGCAATGT CGCTGTCATT GAGCCAGAGG TGAAGCAGAT GGTTACCGTC TCCGGTCGTA TCCGTGCGGA AGACGGCACA CTGCTGGCTA ACGCACGGAT TAACAACCAT ATCGGCCGAA CCCGAACCGA TGAAAACGGC GAGTTTGTCA TGGACGTGGA TAAGAAATAC CCCACTATCG ATTTTCGCTA CAGTGGCAAT AAAACCTGCG AAGTGGCACT GGAACTCAAC CAGGCGCGCG GTGCCGTCTG GGTCGGTGAT GTGGTCTGCA GCGGCCTCTC ATCGTGGGCG GCGGTGACGC AGACAGGAGA AGAGAATGAG AGTTAA
|
Protein sequence | MPLRRFSPGL KAQFAFGMVF LFVQPDASAA DISAQQIGGV IIPQAFSQAL QDGMSVPLYI HLAGSQGRQD DQRIGSAFIW LDDGQLRIRK IQLEESEDNA SVSEQTRQQL MALANAPFNE ALTIPLTDNA QLDLSLRQLL LQLVVKREAL GTVLRSRSED IGQSSVNTLS SNLSYNFGIY NNQLRNGGSN TSSYLSLNNV TALREHHVVL DGSLYGIGSG QQDSELYKAM YERDFAGHRF AGGTLDTWNL QSLGPMTAIS AGKIYGLSWG NQASSTIFDS SQSATPVIAF LPAAGEVHLT RDGRLLSVQN FTMGNHEVDT RGLPYGIYDV EVEVIVNGRV ISKRTQRVNK LFSRGRGVGA PLAWQVWGGS FHMDRWSENG KKTRPAKESW LAGASTSGSL STLSWAATGY GYDNQAVGET RLTLPLGGAI NVNLQNMLAS DSSWSSIGSI SATLPGGFSS LWVNQEKTRI GNQLRRSDAD NRAIGGTLNL NSLWSKLGTF SISYNDDRRY NSHYYTADYY QNVYSGTFGS LGLRAGIQRY NNGDSNANTG KYIALDLSLP LGNWFSAGMT HQNGYTMANL SARKQFDEGT IRTVGANLSR AISGDTGDDK TLSGGAYAQF DARYASGTLN VNSAADGYVN TNLTANGSVG WQGKNIAASG RTDGNAGVIF NTGLEDDGQI SAKINGRIFP LNGKRNYLPL SPYGRYEVEL QNSKNSLDSY DIVSGRKSHL TLYPGNVAVI EPEVKQMVTV SGRIRAEDGT LLANARINNH IGRTRTDENG EFVMDVDKKY PTIDFRYSGN KTCEVALELN QARGAVWVGD VVCSGLSSWA AVTQTGEENE S
|
| |