Gene ECH74115_0642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0642 
SymbolnfrB 
ID6968533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp665374 
End bp667611 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content52% 
IMG OID643384680 
Productbacteriophage N4 adsorption protein B 
Protein accessionYP_002269193 
Protein GI209400631 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACTGGC TTCTTGATGT TTTTGCTACC TGGCTCTATG GCTTAAAAGT AATCGCGATA 
ACGTTAGCGG TCATCATGTT CATCAGCGGG CTGGACGATT TTTTTATTGA TGTCGTCTAC
TGGGTACGCC GCATTAAACG CAAGTTGAGT GTTTATCGCC GCTACCCGCG AATGAGTTAC
CGCGAACTGT ATAAACCAGA TGAAAAACCG TTAGCGATTA TGGTTCCGGC GTGGAATGAA
ACGGGCGTCA TCGGCAATAT GGCCGAGCTG GCGGCGACCA CGCTCGACTA CGAAAACTAT
CATATCTTTG TTGGCACCTA CCCCAACGAC CCGGACACTC AGCGTGATGT TGACGAAGTG
TGCGCTCGCT TCCCGAATGT GCATAAGGTA GTCTGCGCGC GTCCTGGCCC CACCAGCAAA
GCCGACTGTC TGAACAACGT GCTGGACGCC ATCACCCAGT TTGAGCGTAG CGCCAATTTC
GCTTTTGCTG GTTTTATTCT GCATGACGCC GAAGATGTGA TTTCACCGAT GGAATTGCGT
CTGTTCAACT ATCTGGTCGA GCGTAAAGAT CTGATTCAGA TCCCGGTGTA TCCGTTTGAA
CGCGAATGGA CGCACTTCAC CAGCATGACT TACATTGATG AGTTTTCAGA GCTGCATGGC
AAAGATGTTC CGGTGCGTGA AGCCCTCGCC GGACAAGTGC CCAGCGCAGG CGTCGGCACC
TGTTTCAGCC GCCGCGCCGT GACCGCTCTG TTAGCTGACG GTGACGGTAT TGCTTTCGAC
GTGCAGAGTC TGACTGAAGA TTACGACATT GGCTTCCGCC TGAAAGAAAA AGGTATGACG
GAAATTTTTG TCCGTTTTCC GGTGGTGGAC GAAGCCAAAG AACGCGAGCA GCGTAAATTT
TTACAGCACG CACGGACGTC AAACATGATC TGCGTGCGCG AATATTTCCC CGATACCTTT
TCGACTGCGG TTCGACAAAA ATCTCGCTGG ATCATCGGCA TTGTTTTCCA GGGCTTTAAA
ACCCATAAAT GGACCTCCAG CCTGACGCTG AACTACTTTC TCTGGCGCGA CCGCAAAGGG
GCAATCAGTA ACTTTGTCAG CTTCCTCGCA ATGCTGGTGA TGATCCAGCT TTTGCTGTTG
CTGGCGTATG AAAGTTTGTG GCCCGATGCC TGGCATTTCC TTTCTATTTT TAGCGGCAGC
GCATGGTTAA TGACCCTGCT GTGGCTAAAC TTTGGCTTGA TGGTTAACCG CATCGTGCAG
CGGGTGATTT TCGTCACTGG CTACTACGGC CTGACGCAGG GGCTACTATC TGTCCTGCGT
CTTTTCTGGG GCAACCTGAT TAACTTTATG GCCAACTGGC GCGCGTTAAA ACAGGTACTT
CAACACGGCG ATCCACGTCG CGTCGCGTGG GATAAAACAA CGCATGACTT CCCCAGCGTC
ACTGGCGATA CCCGCTCGTT GCGCCCGTTA GGTCAAATTC TGCTGGAAAA TCAGGTCATC
ACTGAAGAAC AGCTCGATAC AGCACTGCGT AATCGCGTCG AAGGTCTACG CCTGGGCGGT
TCAATGCTGA TGCAGGGGCT GATTAGCGCC GAGCAGCTGG CACAGGCGCT GGCAGAGCAA
AACGGCGTGG CGTGGGAATC CATCGATGCC TGGCAGATCC CGTCCTCGCT GATTGCCGAA
ATGCCGGCCT CCGTGGCGCT GCATTATGCG GTACTGCCGC TGCGTCTGGA AAATGATGAG
TTAATTGTCG GCAGTGAAGA TGGTATTGAC CCGGTTTCGC TGGCGGCCCT GACGCGTAAA
GTCGGACGCA AAGTGCGTTA TGTCATTGTT CTGCGGGGAC AAATTGTCAC GGGGTTACGT
CACTGGTATG CACGCCGACG CGGTCACGAT CCGCGGGCAA TGTTGTACAA TGCGGTTCAG
CATCAGTGGC TCACGGAACA GCAGGCCGGT GAAATCTGGC GGCAATATGT GCCGCATCAG
TTCCTGTTCG CCGAAATACT GACCACGCTC GGTCATATTA ATCGTTCAGC AATTAACGTG
TTGTTATTGC GCCATGAACG CAGTTCTCTG CCGCTCGGCA AGTTTTTGGT CACCGAAGGC
GTTATCAGCC AGGAAACGTT GGATCGCGTC CTGACAATTC AACGCGAATT ACAAGTTTCG
ATGCAATCAC TATTACTCAA AGCAGGTTTA AACACAGAAC AGGTTGCGCA ACTGGAGTCC
GAAAATGAAG GAGAATAA
 
Protein sequence
MDWLLDVFAT WLYGLKVIAI TLAVIMFISG LDDFFIDVVY WVRRIKRKLS VYRRYPRMSY 
RELYKPDEKP LAIMVPAWNE TGVIGNMAEL AATTLDYENY HIFVGTYPND PDTQRDVDEV
CARFPNVHKV VCARPGPTSK ADCLNNVLDA ITQFERSANF AFAGFILHDA EDVISPMELR
LFNYLVERKD LIQIPVYPFE REWTHFTSMT YIDEFSELHG KDVPVREALA GQVPSAGVGT
CFSRRAVTAL LADGDGIAFD VQSLTEDYDI GFRLKEKGMT EIFVRFPVVD EAKEREQRKF
LQHARTSNMI CVREYFPDTF STAVRQKSRW IIGIVFQGFK THKWTSSLTL NYFLWRDRKG
AISNFVSFLA MLVMIQLLLL LAYESLWPDA WHFLSIFSGS AWLMTLLWLN FGLMVNRIVQ
RVIFVTGYYG LTQGLLSVLR LFWGNLINFM ANWRALKQVL QHGDPRRVAW DKTTHDFPSV
TGDTRSLRPL GQILLENQVI TEEQLDTALR NRVEGLRLGG SMLMQGLISA EQLAQALAEQ
NGVAWESIDA WQIPSSLIAE MPASVALHYA VLPLRLENDE LIVGSEDGID PVSLAALTRK
VGRKVRYVIV LRGQIVTGLR HWYARRRGHD PRAMLYNAVQ HQWLTEQQAG EIWRQYVPHQ
FLFAEILTTL GHINRSAINV LLLRHERSSL PLGKFLVTEG VISQETLDRV LTIQRELQVS
MQSLLLKAGL NTEQVAQLES ENEGE