Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0586 |
Symbol | nfrB |
ID | 6143790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 593550 |
End bp | 595787 |
Gene Length | 2238 bp |
Protein Length | 745 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615478 |
Product | bacteriophage N4 adsorption protein B |
Protein accession | YP_001742684 |
Protein GI | 170680338 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.437214 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACTGGC TTCTTGATGT TTTTGCTACC TGGCTCTACG GCTTAAAAGT AATCGCGATA ACGTTAGCGG TCATCATGTT CATCAGCGGG CTGGACGATT TTTTTATTGA TGTCGTCTAC TGGGTACGCC GCATTAAACG CAAGTTGAGT GTTTATCGCC GCTACCCGCG AATGAGTTAC CGCGAACTGT ATAAACCAGA TGAAAAACCG TTAGCGATTA TGGTTCCGGC GTGGAATGAA ACGGGCGTCA TCGGCAATAT GGCCGAGCTG GCGGCGACCA CGCTCGACTA CGAAAACTAT CATATCTTTG TTGGCACCTA CCCCAACGAC CCCGATACTC AGCGTGATGT TGACGAAGTG TGCGCTCGCT TCCCGAACGT GCATAAGGTA GTCTGCGCGC GTCCTGGCCC CACCAGCAAA GCCGACTGTC TGAACAACGT GCTGGACGCC ATCACCCAAT TTGAACGTAG CGCCAATTTC GCTTTTGCTG GTTTTATTCT GCATGACGCC GAAGATGTGA TTTCACCGAT GGAATTGCGT CTGTTCAACT ATCTGGTCGA GCGTAAAGAT CTGATTCAGA TCCCAGTGTA TCCGTTCGAA CGCGAATGGA CGCACTTCAC CAGCATGACT TACATTGATG AGTTTTCAGA ACTGCATGGC AAAGATGTTC CGGTGCGTGA AGCCCTCGCC GGACAGGTGC CCAGCGCAGG CGTCGGCACC TGTTTCAGCC GCCGCGCCGT GACCGCTCTG TTAGCTGACG GTGACGGTAT TGCTTTCGAC GTGCAGAGTC TGACTGAAGA TTACGATATT GGCTTCCGCC TGAAAGAAAA AGGTATGACG GAAATTTTTG TCCGTTTTCC GGTGGTGGAC GAAGCCAAAG AACGCGAGCA GCGTAAATTT TTACAGCACG CACGGACGTC AAACATGATC TGCGTGCGCG AATATTTCCC CGATACCTTT TCGACTGCGG TTCGACAAAA ATCTCGCTGG ATCATCGGCA TTGTTTTCCA GGGCTTTAAA ACCCACAAAT GGACCTCCAG CCTGACGCTG AACTACTTTC TCTGGCGCGA CCGCAAAGGG GCAATCAGTA ACTTTGTCAG CTTCCTCGCG ATGCTGGTGA TGATCCAGCT TTTGCTGTTA CTGGCGTATG AAAGTTTGTG GCCCGATGCC TGGCATTTCC TTTCTATTTT CAGCGGCAGC GCATGGTTAA TGACCCTGCT GTGGCTAAAC TTTGGTTTGA TGGTTAACCG CATCGTGCAG CGGGTGATTT TTGTCACTGG CTACTACGGT CTGACGCAGG GGCTGCTATC TGTCCTGCGT CTTTTCTGGG GTAACCTGAT TAACTTCATG GCCAACTGGC GCGCGTTAAA ACAGGTACTT CAACACGGCG ATCCGCGTCG AGTGGCGTGG GATAAAACAA CGCATGACTT CCCCAGCGTG ACTGGCGATA CCCGCTCGTT GCGCCCGTTA GGTCAAATCC TGCTGGAAAA TCAGGTCATC ACTGAAGAAC AACTCGATAC AGCACTGCGT AATCGCGTCG AAGGTCTACG CCTGGGCGGT TCAATGCTGA TGCAGGGGCT GATTAGCGCC GAGCAACTGG CACAGGCGCT GGCAGAGCAA AACGGCGTGG CGTGGGAATC CATCGATGCC TGGCAGATCC CTTCCTCGCT GATTGCCGAA ATGCCGGCCT CCGTGGCGCT GCATTATGCG GTACTGCCGC TGCGTCTGGA CAATGATGAG TTAATTGTCG GCAGTGAAGA TGGTATTGAC CCGGTTTCGC TGGCGGCCCT GACGCGTAAA GTCGGACGCA AAGTGCGTTA CGTCATTGTT CTGCGGGGAC AAATTGTCAC CGGATTACGC CACTGGTATG CACGCCGACG CGGTCACGAT CCGCGGGCAA TGTTGTACAA TGCAGTTCAG CATCAGTGGC TCACGGAACA GCAGGCCGGT GAAATCTGGC GGCAATATGT GCCGCATCAG TTCCTGTTCG CCGAAATACT GACCACGCTC GGTCATATTA ATCGTTCAGC AATTAACGTG TTGTTATTGC GCCATGAACG CAGTTCTCTG CCGCTCGGCA AGTTTTTGGT CACCGAAGGC GTTATCAGCC AGGAAACGTT GGATCGCGTC CTGACAATTC AACGCGAATT ACAAGTTTCG ATGCAATCAC TATTACTCAA AGCAGGTTTA AACACAGAAA AGGTTGCGCA ACTGGAGTCC GAAAATGAAG GAGAATAA
|
Protein sequence | MDWLLDVFAT WLYGLKVIAI TLAVIMFISG LDDFFIDVVY WVRRIKRKLS VYRRYPRMSY RELYKPDEKP LAIMVPAWNE TGVIGNMAEL AATTLDYENY HIFVGTYPND PDTQRDVDEV CARFPNVHKV VCARPGPTSK ADCLNNVLDA ITQFERSANF AFAGFILHDA EDVISPMELR LFNYLVERKD LIQIPVYPFE REWTHFTSMT YIDEFSELHG KDVPVREALA GQVPSAGVGT CFSRRAVTAL LADGDGIAFD VQSLTEDYDI GFRLKEKGMT EIFVRFPVVD EAKEREQRKF LQHARTSNMI CVREYFPDTF STAVRQKSRW IIGIVFQGFK THKWTSSLTL NYFLWRDRKG AISNFVSFLA MLVMIQLLLL LAYESLWPDA WHFLSIFSGS AWLMTLLWLN FGLMVNRIVQ RVIFVTGYYG LTQGLLSVLR LFWGNLINFM ANWRALKQVL QHGDPRRVAW DKTTHDFPSV TGDTRSLRPL GQILLENQVI TEEQLDTALR NRVEGLRLGG SMLMQGLISA EQLAQALAEQ NGVAWESIDA WQIPSSLIAE MPASVALHYA VLPLRLDNDE LIVGSEDGID PVSLAALTRK VGRKVRYVIV LRGQIVTGLR HWYARRRGHD PRAMLYNAVQ HQWLTEQQAG EIWRQYVPHQ FLFAEILTTL GHINRSAINV LLLRHERSSL PLGKFLVTEG VISQETLDRV LTIQRELQVS MQSLLLKAGL NTEKVAQLES ENEGE
|
| |