Gene ECH74115_0337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0337 
Symbol 
ID6966998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp343095 
End bp345620 
Gene Length2526 bp 
Protein Length841 aa 
Translation table11 
GC content55% 
IMG OID643384398 
Producthypothetical protein 
Protein accessionYP_002268913 
Protein GI209400111 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.365908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.874038 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTAC GACGGTTCTC CCCAGGACTG AAAGCCCAGT TTGCCTTCGG CATGGTCTTT 
TTGTTCGTTC AGCCCGATGC CAGCGCTGCT GACATAAGTG CGCAGCAAAT AGGTGGGGTG
ATTATTCCGC AGGCCTTCAG TCAGGCGCTT CAGGACGGCA TGAGCGTCCC GCTCTATATT
CATCTCGCCG GTAGCCAGGG TCGCCAGGAC GATCAGCGAA TCGGCAGCGC TTTTATCTGG
TTGGATGATG GACAGCTACG CATCCGGAAA ATACAGCTGG AAGAGAGTGA AGATAACGCC
AGTGTCAGCG AACAAACTCG ACAGCAGCTG ATGGCTCTGG CGAACGCCCC GTTCAATGAG
GCCCTTACCA TCCCCCTGAC TGACAACGCG CAGCTGGATC TCAGCTTGCG CCAACTGCTG
CTGCAGCTGG TGGTCAAGCG CGAAGCGCTG GGCACTGTAC TACGCTCACG TAGCGAAGAC
ATCGGGCAGT CCAGTGTTAA CACCCTCAGC AGTAATCTGA GCTATAACTT CGGCATCTAT
AACAACCAGT TGCGTAACGG CGGGAGCAAC ACATCCAGCT ATCTGTCGCT GAATAACGTT
ACTGCACTGC GCGAACATCA TGTGGTGCTC GACGGCTCGC TGTACGGGAT CGGTAGCGGT
CAACAGGACA GTGAATTATA TAAAGCGATG TATGAACGCG ATTTTGCCGG TCACCGATTT
GCCGGTGGAA CGCTCGACAC CTGGAACTTG CAGTCCTTAG GGCCGATGAC CGCCATTTCA
GCAGGGAAGA TTTACGGCCT TTCCTGGGGA AACCAGGCCA GCTCCACCAT CTTCGACAGC
AGCCAGTCAG CCACGCCAGT GATCGCCTTT TTACCGGCGG CGGGTGAAGT ACATCTCACC
CGTGATGGGC GGTTACTAAG CGTTCAGAAC TTCACCATGG GCAATCATGA AGTGGATACC
CGGGGTCTAC CATACGGTAT TTACGATGTG GAAGTTGAGG TGATCGTTAA CGGTCGCGTG
ATCAGCAAAC GCACCCAGCG GGTCAATAAG CTGTTTAGCC GGGGGCGCGG CGTCGGTGCA
CCACTGGCGT GGCAGGTATG GGGCGGTAGC TTTCATATGG ATCGCTGGTC GGAAAACGGG
AAAAAGACGC GACCAGCTAA AGAGAGTTGG CTGGCAGGTG CCTCGACCTC CGGCTCACTG
AGTACGCTTA GCTGGGCGGC AACGGGATAT GGATACGATA ATCAGGCGGT GGGTGAAACC
CGTCTGACGC TGCCGCTTGG GGGAGCGATC AACGTTAACC TGCAAAATAT GCTGGCCAGT
GACAGCTCAT GGAGCAGCAT CGGCAGCATC AGCGCCACTC TACCGGGAGG CTTTAGTTCG
CTGTGGGTTA ATCAGGAAAA AACCCGCATT GGCAATCAAT TGCGACGTAG CGATGCCGAC
AACCGTGCTA TCGGCGGCAC ACTCAACCTG AACTCACTGT GGTCGAAGCT GGGCACATTC
AGCATCAGCT ACAATGATGA CCGCCGTTAC AACAGCCATT ATTACACGGC AGATTACTAT
CAAAATGTCT ACAGCGGTAC CTTTGGTTCG CTTGGCCTGC GGGCCGGTAT TCAGCGCTAT
AACAACGGCG ACAGCAACGC CAATACAGGG AAATATATCG CTCTCGATCT CTCGCTACCA
CTGGGCAACT GGTTTAGCGC AGGGATGACT CATCAAAACG GCTACACCAT GGCAAACCTG
TCAGCACGCA AGCAGTTTGA TGAAGGGACC ATTCGCACTG TTGGTGCCAA TCTGTCACGA
GCCATCTCCG GCGATACCGG TGATGACAAA ACTCTCAGTG GTGGGGCGTA TGCACAGTTC
GACGCTCGCT ACGCCAGCGG AACGCTGAAC GTCAATAGCG CGGCGGACGG CTACGTCAAT
ACCAATTTAA CCGCCAATGG CAGCGTCGGC TGGCAGGGTA AAAACATTGC TGCCAGCGGG
CGGACTGATG GCAACGCTGG GGTGATATTC AACACCGGGC TGGAGGACGA CGGTCAGATC
AGCGCCAAAA TCAACGGGCG GATTTTCCCG CTTAACGGCA AGCGTAACTA TCTCCCGCTC
TCTCCCTATG GAAGATATGA GGTGGAGTTA CAGAACAGCA AAAACTCACT CGACAGTTAC
GATATCGTCA GCGGTCGCAA AAGTCATCTG ACTCTCTATC CAGGCAATGT CGCTGTCATT
GAGCCAGAGG TGAAGCAGAT GGTTACCGTC TCCGGTCGTA TCCGTGCGGA AGACGGCACA
CTGCTGGCTA ACGCACGGAT TAACAACCAT ATCGGCCGAA CCCGAACCGA TGAAAACGGC
GAGTTTGTCA TGGACGTGGA TAAGAAATAC CCCACTATCG ATTTTCGCTA CAGTGGCAAT
AAAACCTGCG AAGTGGCACT GGAACTCAAC CAGGCGCGCG GTGCCGTCTG GGTCGGTGAT
GTGGTCTGCA GCGGCCTCTC ATCGTGGGCG GCGGTGACGC AGACAGGAGA AGAGAATGAG
AGTTAA
 
Protein sequence
MPLRRFSPGL KAQFAFGMVF LFVQPDASAA DISAQQIGGV IIPQAFSQAL QDGMSVPLYI 
HLAGSQGRQD DQRIGSAFIW LDDGQLRIRK IQLEESEDNA SVSEQTRQQL MALANAPFNE
ALTIPLTDNA QLDLSLRQLL LQLVVKREAL GTVLRSRSED IGQSSVNTLS SNLSYNFGIY
NNQLRNGGSN TSSYLSLNNV TALREHHVVL DGSLYGIGSG QQDSELYKAM YERDFAGHRF
AGGTLDTWNL QSLGPMTAIS AGKIYGLSWG NQASSTIFDS SQSATPVIAF LPAAGEVHLT
RDGRLLSVQN FTMGNHEVDT RGLPYGIYDV EVEVIVNGRV ISKRTQRVNK LFSRGRGVGA
PLAWQVWGGS FHMDRWSENG KKTRPAKESW LAGASTSGSL STLSWAATGY GYDNQAVGET
RLTLPLGGAI NVNLQNMLAS DSSWSSIGSI SATLPGGFSS LWVNQEKTRI GNQLRRSDAD
NRAIGGTLNL NSLWSKLGTF SISYNDDRRY NSHYYTADYY QNVYSGTFGS LGLRAGIQRY
NNGDSNANTG KYIALDLSLP LGNWFSAGMT HQNGYTMANL SARKQFDEGT IRTVGANLSR
AISGDTGDDK TLSGGAYAQF DARYASGTLN VNSAADGYVN TNLTANGSVG WQGKNIAASG
RTDGNAGVIF NTGLEDDGQI SAKINGRIFP LNGKRNYLPL SPYGRYEVEL QNSKNSLDSY
DIVSGRKSHL TLYPGNVAVI EPEVKQMVTV SGRIRAEDGT LLANARINNH IGRTRTDENG
EFVMDVDKKY PTIDFRYSGN KTCEVALELN QARGAVWVGD VVCSGLSSWA AVTQTGEENE
S