Gene EcolC_3083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3083 
SymbolnfrB 
ID6066206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3374517 
End bp3376754 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content52% 
IMG OID641602499 
Productbacteriophage N4 adsorption protein B 
Protein accessionYP_001726034 
Protein GI170021080 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACTGGC TTCTTGATGT TTTTGCTACC TGGCTCTACG GCTTAAAAGT AATCGCGATA 
ACGTTAGCGG TCATCATGTT CATCAGCGGG CTGGACGATT TTTTTATTGA TGTCGTCTAC
TGGGTACGCC GCATTAAACG CAAGTTGAGT GTTTATCGCC GCTACCCGCG AATGAGTTAC
CGCGAACTGT ATAAACCAGA TGAAAAACCG TTAGCGATTA TGGTTCCGGC ATGGAATGAA
ACGGGCGTCA TCGGCAATAT GGCCGAGCTG GCGGCGACCA CGCTCGACTA CGAAAACTAT
CATATCTTTG TTGGCACCTA CCCCAACGAC CCCGATACTC AGCGTGATGT TGACGAAGTG
TGCGCTCGCT TCCCGAATGT GCATAAGGTA GTCTGCGCGC GTCCTGGCCC CACCAGCAAA
GCCGACTGTC TGAACAACGT GCTGGACGCC ATCACCCAAT TTGAGCGTAG CGCCAATTTC
GCTTTTGCTG GTTTTATTCT GCATGACGCC GAAGATGTGA TTTCACCGAT GGAATTGCGT
CTGTTCAACT ATCTGGTCGA GCGTAAAGAT CTGATTCAGA TCCCGGTGTA TCCGTTCGAA
CGCGAATGGA CGCACTTCAC CAGCATGACT TACATTGATG AGTTTTCAGA GCTGCATGGC
AAAGATGTTC CGGTGCGTGA AGCCCTCGCC GGACAAGTGC CCAGCGCAGG CGTCGGCACC
TGTTTCAGCC GCCGCGCCGT GACCGCTCTG TTAGCTGACG GTGACGGTAT TGCTTTCGAC
GTGCAGAGTC TGACTGAAGA TTACGACATT GGTTTCCGCC TGAAAGAGAA AGGTATGACG
GAAATTTTTG TCCGTTTTCC GGTGGTGGAC GAAGCCAAAG AACGCGAGCA GCGTAAATTT
TTACAGCACG CACGGACGTC AAACATGATC TGCGTGCGCG AATATTTCCC CGATACCTTT
TCGACTGCAG TTCGACAAAA ATCTCGCTGG ATCATCGGCA TTGTTTTCCA GGGCTTTAAA
ACCCACAAAT GGACCTCCAG CCTGACGCTG AACTACTTTC TCTGGCGCGA CCGCAAAGGG
GCAATCAGTA ACTTTGTCAG CTTCCTCGCG ATGCTGGTGA TGCTCCAGCT TTTGCTGTTG
CTGGCGTATG AAAGTTTGTG GCCCGATGCC TGGCATTTCC TTTCTATTTT TAGCGGCAGC
GCATGGTTAA TGACCCTGCT GTGGCTAAAC TTTGGCTTGA TGGTTAACCG CATCGTGCAG
CGGGTGATTT TCGTCACTGG CTACTACGGT CTGACGCAGG GGCTGCTATC TGTCCTGCGT
CTTTTCTGGG GCAACCTGAT TAACTTTATG GCCAACTGGC GCGCGCTAAA ACAGGTACTT
CAACACGGCG ATCCACGTCG CGTCGCGTGG GATAAAACAA CGCATGACTT CCCCAGCGTC
ACTGGCGATA CCCGCTCGTT GCGCCCGTTA GGTCAAATTC TGCTGGAAAA TCAGGTCATC
ACTGAAGAAC AACTCGATAC AGCACTGCGT AATCGCGTCG AAGGTCTACG CCTGGGCGGT
TCAATGCTGA TGCAGGGGCT GATTAGCGCC GAGCAGCTGG CACAGGCGCT GGCAGAGCAA
AACGGCGTGG CGTGGGAATC CATCGATGCC TGGCAGATCC CTTCCTCGCT GATTGCCGAA
ATGCCGGCCT CCGTGGCGCT GCATTATGCA GTACTGCCGC TGCGTCTGGA AAATGATGAG
TTAATTGTCG GCAGTGAAGA TGGTATTGAC CCTGTTTCGC TGGCGGCCCT GACGCGTAAA
GTCGGACGCA AAGTGCGTTA CGTCATTGTT CTGCGGGGAC AAATTGTCAC GGGGTTACGT
CACTGGTATG CACGCCGACG CGGTCACGAT CCGCGGGCAA TGTTGTACAA TGCGGTTCAG
CATCAGTGGC TCACGGAACA GCAGACCGGT GAAATCTGGC GGCAATATGT GCCGCATCAG
TTCCTGTTCG CCGAAATACT GACCACGCTC GGTCATATTA ATCGTTCAGC AATTAACGTG
TTGTTATTGC GCCATGAACG CAGTTCTCTG CCGCTCGGCA AGTTTTTGGT CACCGAAGGC
GTTATCAGCC AGGAAACGTT GGATCGCGTC CTGACAATTC AACGCGAATT ACAAGTTTCG
ATGCAATCAC TATTACTCAA AGCAGGTTTA AACACAGAAC AGGTTGCGCA ACTGGAGTCC
GAAAATGAAG GAGAATAA
 
Protein sequence
MDWLLDVFAT WLYGLKVIAI TLAVIMFISG LDDFFIDVVY WVRRIKRKLS VYRRYPRMSY 
RELYKPDEKP LAIMVPAWNE TGVIGNMAEL AATTLDYENY HIFVGTYPND PDTQRDVDEV
CARFPNVHKV VCARPGPTSK ADCLNNVLDA ITQFERSANF AFAGFILHDA EDVISPMELR
LFNYLVERKD LIQIPVYPFE REWTHFTSMT YIDEFSELHG KDVPVREALA GQVPSAGVGT
CFSRRAVTAL LADGDGIAFD VQSLTEDYDI GFRLKEKGMT EIFVRFPVVD EAKEREQRKF
LQHARTSNMI CVREYFPDTF STAVRQKSRW IIGIVFQGFK THKWTSSLTL NYFLWRDRKG
AISNFVSFLA MLVMLQLLLL LAYESLWPDA WHFLSIFSGS AWLMTLLWLN FGLMVNRIVQ
RVIFVTGYYG LTQGLLSVLR LFWGNLINFM ANWRALKQVL QHGDPRRVAW DKTTHDFPSV
TGDTRSLRPL GQILLENQVI TEEQLDTALR NRVEGLRLGG SMLMQGLISA EQLAQALAEQ
NGVAWESIDA WQIPSSLIAE MPASVALHYA VLPLRLENDE LIVGSEDGID PVSLAALTRK
VGRKVRYVIV LRGQIVTGLR HWYARRRGHD PRAMLYNAVQ HQWLTEQQTG EIWRQYVPHQ
FLFAEILTTL GHINRSAINV LLLRHERSSL PLGKFLVTEG VISQETLDRV LTIQRELQVS
MQSLLLKAGL NTEQVAQLES ENEGE