Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0642 |
Symbol | nfrB |
ID | 6968533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 665374 |
End bp | 667611 |
Gene Length | 2238 bp |
Protein Length | 745 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643384680 |
Product | bacteriophage N4 adsorption protein B |
Protein accession | YP_002269193 |
Protein GI | 209400631 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACTGGC TTCTTGATGT TTTTGCTACC TGGCTCTATG GCTTAAAAGT AATCGCGATA ACGTTAGCGG TCATCATGTT CATCAGCGGG CTGGACGATT TTTTTATTGA TGTCGTCTAC TGGGTACGCC GCATTAAACG CAAGTTGAGT GTTTATCGCC GCTACCCGCG AATGAGTTAC CGCGAACTGT ATAAACCAGA TGAAAAACCG TTAGCGATTA TGGTTCCGGC GTGGAATGAA ACGGGCGTCA TCGGCAATAT GGCCGAGCTG GCGGCGACCA CGCTCGACTA CGAAAACTAT CATATCTTTG TTGGCACCTA CCCCAACGAC CCGGACACTC AGCGTGATGT TGACGAAGTG TGCGCTCGCT TCCCGAATGT GCATAAGGTA GTCTGCGCGC GTCCTGGCCC CACCAGCAAA GCCGACTGTC TGAACAACGT GCTGGACGCC ATCACCCAGT TTGAGCGTAG CGCCAATTTC GCTTTTGCTG GTTTTATTCT GCATGACGCC GAAGATGTGA TTTCACCGAT GGAATTGCGT CTGTTCAACT ATCTGGTCGA GCGTAAAGAT CTGATTCAGA TCCCGGTGTA TCCGTTTGAA CGCGAATGGA CGCACTTCAC CAGCATGACT TACATTGATG AGTTTTCAGA GCTGCATGGC AAAGATGTTC CGGTGCGTGA AGCCCTCGCC GGACAAGTGC CCAGCGCAGG CGTCGGCACC TGTTTCAGCC GCCGCGCCGT GACCGCTCTG TTAGCTGACG GTGACGGTAT TGCTTTCGAC GTGCAGAGTC TGACTGAAGA TTACGACATT GGCTTCCGCC TGAAAGAAAA AGGTATGACG GAAATTTTTG TCCGTTTTCC GGTGGTGGAC GAAGCCAAAG AACGCGAGCA GCGTAAATTT TTACAGCACG CACGGACGTC AAACATGATC TGCGTGCGCG AATATTTCCC CGATACCTTT TCGACTGCGG TTCGACAAAA ATCTCGCTGG ATCATCGGCA TTGTTTTCCA GGGCTTTAAA ACCCATAAAT GGACCTCCAG CCTGACGCTG AACTACTTTC TCTGGCGCGA CCGCAAAGGG GCAATCAGTA ACTTTGTCAG CTTCCTCGCA ATGCTGGTGA TGATCCAGCT TTTGCTGTTG CTGGCGTATG AAAGTTTGTG GCCCGATGCC TGGCATTTCC TTTCTATTTT TAGCGGCAGC GCATGGTTAA TGACCCTGCT GTGGCTAAAC TTTGGCTTGA TGGTTAACCG CATCGTGCAG CGGGTGATTT TCGTCACTGG CTACTACGGC CTGACGCAGG GGCTACTATC TGTCCTGCGT CTTTTCTGGG GCAACCTGAT TAACTTTATG GCCAACTGGC GCGCGTTAAA ACAGGTACTT CAACACGGCG ATCCACGTCG CGTCGCGTGG GATAAAACAA CGCATGACTT CCCCAGCGTC ACTGGCGATA CCCGCTCGTT GCGCCCGTTA GGTCAAATTC TGCTGGAAAA TCAGGTCATC ACTGAAGAAC AGCTCGATAC AGCACTGCGT AATCGCGTCG AAGGTCTACG CCTGGGCGGT TCAATGCTGA TGCAGGGGCT GATTAGCGCC GAGCAGCTGG CACAGGCGCT GGCAGAGCAA AACGGCGTGG CGTGGGAATC CATCGATGCC TGGCAGATCC CGTCCTCGCT GATTGCCGAA ATGCCGGCCT CCGTGGCGCT GCATTATGCG GTACTGCCGC TGCGTCTGGA AAATGATGAG TTAATTGTCG GCAGTGAAGA TGGTATTGAC CCGGTTTCGC TGGCGGCCCT GACGCGTAAA GTCGGACGCA AAGTGCGTTA TGTCATTGTT CTGCGGGGAC AAATTGTCAC GGGGTTACGT CACTGGTATG CACGCCGACG CGGTCACGAT CCGCGGGCAA TGTTGTACAA TGCGGTTCAG CATCAGTGGC TCACGGAACA GCAGGCCGGT GAAATCTGGC GGCAATATGT GCCGCATCAG TTCCTGTTCG CCGAAATACT GACCACGCTC GGTCATATTA ATCGTTCAGC AATTAACGTG TTGTTATTGC GCCATGAACG CAGTTCTCTG CCGCTCGGCA AGTTTTTGGT CACCGAAGGC GTTATCAGCC AGGAAACGTT GGATCGCGTC CTGACAATTC AACGCGAATT ACAAGTTTCG ATGCAATCAC TATTACTCAA AGCAGGTTTA AACACAGAAC AGGTTGCGCA ACTGGAGTCC GAAAATGAAG GAGAATAA
|
Protein sequence | MDWLLDVFAT WLYGLKVIAI TLAVIMFISG LDDFFIDVVY WVRRIKRKLS VYRRYPRMSY RELYKPDEKP LAIMVPAWNE TGVIGNMAEL AATTLDYENY HIFVGTYPND PDTQRDVDEV CARFPNVHKV VCARPGPTSK ADCLNNVLDA ITQFERSANF AFAGFILHDA EDVISPMELR LFNYLVERKD LIQIPVYPFE REWTHFTSMT YIDEFSELHG KDVPVREALA GQVPSAGVGT CFSRRAVTAL LADGDGIAFD VQSLTEDYDI GFRLKEKGMT EIFVRFPVVD EAKEREQRKF LQHARTSNMI CVREYFPDTF STAVRQKSRW IIGIVFQGFK THKWTSSLTL NYFLWRDRKG AISNFVSFLA MLVMIQLLLL LAYESLWPDA WHFLSIFSGS AWLMTLLWLN FGLMVNRIVQ RVIFVTGYYG LTQGLLSVLR LFWGNLINFM ANWRALKQVL QHGDPRRVAW DKTTHDFPSV TGDTRSLRPL GQILLENQVI TEEQLDTALR NRVEGLRLGG SMLMQGLISA EQLAQALAEQ NGVAWESIDA WQIPSSLIAE MPASVALHYA VLPLRLENDE LIVGSEDGID PVSLAALTRK VGRKVRYVIV LRGQIVTGLR HWYARRRGHD PRAMLYNAVQ HQWLTEQQAG EIWRQYVPHQ FLFAEILTTL GHINRSAINV LLLRHERSSL PLGKFLVTEG VISQETLDRV LTIQRELQVS MQSLLLKAGL NTEQVAQLES ENEGE
|
| |