Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3083 |
Symbol | nfrB |
ID | 6066206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3374517 |
End bp | 3376754 |
Gene Length | 2238 bp |
Protein Length | 745 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641602499 |
Product | bacteriophage N4 adsorption protein B |
Protein accession | YP_001726034 |
Protein GI | 170021080 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACTGGC TTCTTGATGT TTTTGCTACC TGGCTCTACG GCTTAAAAGT AATCGCGATA ACGTTAGCGG TCATCATGTT CATCAGCGGG CTGGACGATT TTTTTATTGA TGTCGTCTAC TGGGTACGCC GCATTAAACG CAAGTTGAGT GTTTATCGCC GCTACCCGCG AATGAGTTAC CGCGAACTGT ATAAACCAGA TGAAAAACCG TTAGCGATTA TGGTTCCGGC ATGGAATGAA ACGGGCGTCA TCGGCAATAT GGCCGAGCTG GCGGCGACCA CGCTCGACTA CGAAAACTAT CATATCTTTG TTGGCACCTA CCCCAACGAC CCCGATACTC AGCGTGATGT TGACGAAGTG TGCGCTCGCT TCCCGAATGT GCATAAGGTA GTCTGCGCGC GTCCTGGCCC CACCAGCAAA GCCGACTGTC TGAACAACGT GCTGGACGCC ATCACCCAAT TTGAGCGTAG CGCCAATTTC GCTTTTGCTG GTTTTATTCT GCATGACGCC GAAGATGTGA TTTCACCGAT GGAATTGCGT CTGTTCAACT ATCTGGTCGA GCGTAAAGAT CTGATTCAGA TCCCGGTGTA TCCGTTCGAA CGCGAATGGA CGCACTTCAC CAGCATGACT TACATTGATG AGTTTTCAGA GCTGCATGGC AAAGATGTTC CGGTGCGTGA AGCCCTCGCC GGACAAGTGC CCAGCGCAGG CGTCGGCACC TGTTTCAGCC GCCGCGCCGT GACCGCTCTG TTAGCTGACG GTGACGGTAT TGCTTTCGAC GTGCAGAGTC TGACTGAAGA TTACGACATT GGTTTCCGCC TGAAAGAGAA AGGTATGACG GAAATTTTTG TCCGTTTTCC GGTGGTGGAC GAAGCCAAAG AACGCGAGCA GCGTAAATTT TTACAGCACG CACGGACGTC AAACATGATC TGCGTGCGCG AATATTTCCC CGATACCTTT TCGACTGCAG TTCGACAAAA ATCTCGCTGG ATCATCGGCA TTGTTTTCCA GGGCTTTAAA ACCCACAAAT GGACCTCCAG CCTGACGCTG AACTACTTTC TCTGGCGCGA CCGCAAAGGG GCAATCAGTA ACTTTGTCAG CTTCCTCGCG ATGCTGGTGA TGCTCCAGCT TTTGCTGTTG CTGGCGTATG AAAGTTTGTG GCCCGATGCC TGGCATTTCC TTTCTATTTT TAGCGGCAGC GCATGGTTAA TGACCCTGCT GTGGCTAAAC TTTGGCTTGA TGGTTAACCG CATCGTGCAG CGGGTGATTT TCGTCACTGG CTACTACGGT CTGACGCAGG GGCTGCTATC TGTCCTGCGT CTTTTCTGGG GCAACCTGAT TAACTTTATG GCCAACTGGC GCGCGCTAAA ACAGGTACTT CAACACGGCG ATCCACGTCG CGTCGCGTGG GATAAAACAA CGCATGACTT CCCCAGCGTC ACTGGCGATA CCCGCTCGTT GCGCCCGTTA GGTCAAATTC TGCTGGAAAA TCAGGTCATC ACTGAAGAAC AACTCGATAC AGCACTGCGT AATCGCGTCG AAGGTCTACG CCTGGGCGGT TCAATGCTGA TGCAGGGGCT GATTAGCGCC GAGCAGCTGG CACAGGCGCT GGCAGAGCAA AACGGCGTGG CGTGGGAATC CATCGATGCC TGGCAGATCC CTTCCTCGCT GATTGCCGAA ATGCCGGCCT CCGTGGCGCT GCATTATGCA GTACTGCCGC TGCGTCTGGA AAATGATGAG TTAATTGTCG GCAGTGAAGA TGGTATTGAC CCTGTTTCGC TGGCGGCCCT GACGCGTAAA GTCGGACGCA AAGTGCGTTA CGTCATTGTT CTGCGGGGAC AAATTGTCAC GGGGTTACGT CACTGGTATG CACGCCGACG CGGTCACGAT CCGCGGGCAA TGTTGTACAA TGCGGTTCAG CATCAGTGGC TCACGGAACA GCAGACCGGT GAAATCTGGC GGCAATATGT GCCGCATCAG TTCCTGTTCG CCGAAATACT GACCACGCTC GGTCATATTA ATCGTTCAGC AATTAACGTG TTGTTATTGC GCCATGAACG CAGTTCTCTG CCGCTCGGCA AGTTTTTGGT CACCGAAGGC GTTATCAGCC AGGAAACGTT GGATCGCGTC CTGACAATTC AACGCGAATT ACAAGTTTCG ATGCAATCAC TATTACTCAA AGCAGGTTTA AACACAGAAC AGGTTGCGCA ACTGGAGTCC GAAAATGAAG GAGAATAA
|
Protein sequence | MDWLLDVFAT WLYGLKVIAI TLAVIMFISG LDDFFIDVVY WVRRIKRKLS VYRRYPRMSY RELYKPDEKP LAIMVPAWNE TGVIGNMAEL AATTLDYENY HIFVGTYPND PDTQRDVDEV CARFPNVHKV VCARPGPTSK ADCLNNVLDA ITQFERSANF AFAGFILHDA EDVISPMELR LFNYLVERKD LIQIPVYPFE REWTHFTSMT YIDEFSELHG KDVPVREALA GQVPSAGVGT CFSRRAVTAL LADGDGIAFD VQSLTEDYDI GFRLKEKGMT EIFVRFPVVD EAKEREQRKF LQHARTSNMI CVREYFPDTF STAVRQKSRW IIGIVFQGFK THKWTSSLTL NYFLWRDRKG AISNFVSFLA MLVMLQLLLL LAYESLWPDA WHFLSIFSGS AWLMTLLWLN FGLMVNRIVQ RVIFVTGYYG LTQGLLSVLR LFWGNLINFM ANWRALKQVL QHGDPRRVAW DKTTHDFPSV TGDTRSLRPL GQILLENQVI TEEQLDTALR NRVEGLRLGG SMLMQGLISA EQLAQALAEQ NGVAWESIDA WQIPSSLIAE MPASVALHYA VLPLRLENDE LIVGSEDGID PVSLAALTRK VGRKVRYVIV LRGQIVTGLR HWYARRRGHD PRAMLYNAVQ HQWLTEQQTG EIWRQYVPHQ FLFAEILTTL GHINRSAINV LLLRHERSSL PLGKFLVTEG VISQETLDRV LTIQRELQVS MQSLLLKAGL NTEQVAQLES ENEGE
|
| |