Gene EcSMS35_0586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0586 
SymbolnfrB 
ID6143790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp593550 
End bp595787 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content52% 
IMG OID641615478 
Productbacteriophage N4 adsorption protein B 
Protein accessionYP_001742684 
Protein GI170680338 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.437214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACTGGC TTCTTGATGT TTTTGCTACC TGGCTCTACG GCTTAAAAGT AATCGCGATA 
ACGTTAGCGG TCATCATGTT CATCAGCGGG CTGGACGATT TTTTTATTGA TGTCGTCTAC
TGGGTACGCC GCATTAAACG CAAGTTGAGT GTTTATCGCC GCTACCCGCG AATGAGTTAC
CGCGAACTGT ATAAACCAGA TGAAAAACCG TTAGCGATTA TGGTTCCGGC GTGGAATGAA
ACGGGCGTCA TCGGCAATAT GGCCGAGCTG GCGGCGACCA CGCTCGACTA CGAAAACTAT
CATATCTTTG TTGGCACCTA CCCCAACGAC CCCGATACTC AGCGTGATGT TGACGAAGTG
TGCGCTCGCT TCCCGAACGT GCATAAGGTA GTCTGCGCGC GTCCTGGCCC CACCAGCAAA
GCCGACTGTC TGAACAACGT GCTGGACGCC ATCACCCAAT TTGAACGTAG CGCCAATTTC
GCTTTTGCTG GTTTTATTCT GCATGACGCC GAAGATGTGA TTTCACCGAT GGAATTGCGT
CTGTTCAACT ATCTGGTCGA GCGTAAAGAT CTGATTCAGA TCCCAGTGTA TCCGTTCGAA
CGCGAATGGA CGCACTTCAC CAGCATGACT TACATTGATG AGTTTTCAGA ACTGCATGGC
AAAGATGTTC CGGTGCGTGA AGCCCTCGCC GGACAGGTGC CCAGCGCAGG CGTCGGCACC
TGTTTCAGCC GCCGCGCCGT GACCGCTCTG TTAGCTGACG GTGACGGTAT TGCTTTCGAC
GTGCAGAGTC TGACTGAAGA TTACGATATT GGCTTCCGCC TGAAAGAAAA AGGTATGACG
GAAATTTTTG TCCGTTTTCC GGTGGTGGAC GAAGCCAAAG AACGCGAGCA GCGTAAATTT
TTACAGCACG CACGGACGTC AAACATGATC TGCGTGCGCG AATATTTCCC CGATACCTTT
TCGACTGCGG TTCGACAAAA ATCTCGCTGG ATCATCGGCA TTGTTTTCCA GGGCTTTAAA
ACCCACAAAT GGACCTCCAG CCTGACGCTG AACTACTTTC TCTGGCGCGA CCGCAAAGGG
GCAATCAGTA ACTTTGTCAG CTTCCTCGCG ATGCTGGTGA TGATCCAGCT TTTGCTGTTA
CTGGCGTATG AAAGTTTGTG GCCCGATGCC TGGCATTTCC TTTCTATTTT CAGCGGCAGC
GCATGGTTAA TGACCCTGCT GTGGCTAAAC TTTGGTTTGA TGGTTAACCG CATCGTGCAG
CGGGTGATTT TTGTCACTGG CTACTACGGT CTGACGCAGG GGCTGCTATC TGTCCTGCGT
CTTTTCTGGG GTAACCTGAT TAACTTCATG GCCAACTGGC GCGCGTTAAA ACAGGTACTT
CAACACGGCG ATCCGCGTCG AGTGGCGTGG GATAAAACAA CGCATGACTT CCCCAGCGTG
ACTGGCGATA CCCGCTCGTT GCGCCCGTTA GGTCAAATCC TGCTGGAAAA TCAGGTCATC
ACTGAAGAAC AACTCGATAC AGCACTGCGT AATCGCGTCG AAGGTCTACG CCTGGGCGGT
TCAATGCTGA TGCAGGGGCT GATTAGCGCC GAGCAACTGG CACAGGCGCT GGCAGAGCAA
AACGGCGTGG CGTGGGAATC CATCGATGCC TGGCAGATCC CTTCCTCGCT GATTGCCGAA
ATGCCGGCCT CCGTGGCGCT GCATTATGCG GTACTGCCGC TGCGTCTGGA CAATGATGAG
TTAATTGTCG GCAGTGAAGA TGGTATTGAC CCGGTTTCGC TGGCGGCCCT GACGCGTAAA
GTCGGACGCA AAGTGCGTTA CGTCATTGTT CTGCGGGGAC AAATTGTCAC CGGATTACGC
CACTGGTATG CACGCCGACG CGGTCACGAT CCGCGGGCAA TGTTGTACAA TGCAGTTCAG
CATCAGTGGC TCACGGAACA GCAGGCCGGT GAAATCTGGC GGCAATATGT GCCGCATCAG
TTCCTGTTCG CCGAAATACT GACCACGCTC GGTCATATTA ATCGTTCAGC AATTAACGTG
TTGTTATTGC GCCATGAACG CAGTTCTCTG CCGCTCGGCA AGTTTTTGGT CACCGAAGGC
GTTATCAGCC AGGAAACGTT GGATCGCGTC CTGACAATTC AACGCGAATT ACAAGTTTCG
ATGCAATCAC TATTACTCAA AGCAGGTTTA AACACAGAAA AGGTTGCGCA ACTGGAGTCC
GAAAATGAAG GAGAATAA
 
Protein sequence
MDWLLDVFAT WLYGLKVIAI TLAVIMFISG LDDFFIDVVY WVRRIKRKLS VYRRYPRMSY 
RELYKPDEKP LAIMVPAWNE TGVIGNMAEL AATTLDYENY HIFVGTYPND PDTQRDVDEV
CARFPNVHKV VCARPGPTSK ADCLNNVLDA ITQFERSANF AFAGFILHDA EDVISPMELR
LFNYLVERKD LIQIPVYPFE REWTHFTSMT YIDEFSELHG KDVPVREALA GQVPSAGVGT
CFSRRAVTAL LADGDGIAFD VQSLTEDYDI GFRLKEKGMT EIFVRFPVVD EAKEREQRKF
LQHARTSNMI CVREYFPDTF STAVRQKSRW IIGIVFQGFK THKWTSSLTL NYFLWRDRKG
AISNFVSFLA MLVMIQLLLL LAYESLWPDA WHFLSIFSGS AWLMTLLWLN FGLMVNRIVQ
RVIFVTGYYG LTQGLLSVLR LFWGNLINFM ANWRALKQVL QHGDPRRVAW DKTTHDFPSV
TGDTRSLRPL GQILLENQVI TEEQLDTALR NRVEGLRLGG SMLMQGLISA EQLAQALAEQ
NGVAWESIDA WQIPSSLIAE MPASVALHYA VLPLRLDNDE LIVGSEDGID PVSLAALTRK
VGRKVRYVIV LRGQIVTGLR HWYARRRGHD PRAMLYNAVQ HQWLTEQQAG EIWRQYVPHQ
FLFAEILTTL GHINRSAINV LLLRHERSSL PLGKFLVTEG VISQETLDRV LTIQRELQVS
MQSLLLKAGL NTEKVAQLES ENEGE