Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1460 |
Symbol | flgJ |
ID | 6969934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1440064 |
End bp | 1441005 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643385433 |
Product | flagellar rod assembly protein/muramidase FlgJ |
Protein accession | YP_002269927 |
Protein GI | 209396103 |
COG category | [M] Cell wall/membrane/envelope biogenesis [N] Cell motility [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3951] Rod binding protein |
TIGRFAM ID | [TIGR02541] flagellar rod assembly protein/muramidase FlgJ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.969398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0000147806 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCAGCG ACAGCAAACT ACTGGCAAGT GCGGCCTGGG ATGCGCAATC ACTCAACGAA CTAAAGGCGA AAGCGAGCGA AGATCCGGCG GCAAATATCC GTCCGGTGGC CCGTCAGGTG GAAGGGATGT TCGTGCAGAT GATGTTGAAA AGCATGCGCG ACGCGTTACC AAAAGATGGC CTGTTCAGCA GCGAGCACAC TCGCCTGTAT ACCAGTATGT ATGACCAGCA GATTGCCCAA CAGATGACGA CGGGCAAAGG TCTGGGGCTT GCAGAGATGA TGGTTAAGCA GATGACGCCA GAACAACCAT TGCCAGAGGA GTCCACGCCA GCAGCACCGA TGAAATTCCC GCTCGAAACC GTGGTGCGTT ATCAAAATCA GGCGCTTTCG CAGCTGGTGC AAAAGGCCGT GCCACGTAAC TACGATGATT CGCTGCCAGG TGACAGTAAA GCATTCCTCG CGCAACTCTC GCTGCCCGCC CAACTGGCAA GCCAGCAAAG CGGTGTGCCA CATCATTTGA TCCTCGCTCA GGCGGCGCTG GAATCTGGCT GGGGGCAACG GCAAATCCGC CGCGAAAACG GCGAGCCGAG CTATAACCTG TTTGGTGTCA AAGCCTCTGG CAACTGGAAA GGGCCAGTCA CTGAAATCAC CACGACTGAA TATGAAAATG GCGAAGCGAA GAAAGTAAAA GCGAAGTTTC GGGTCTACAG CTCGTATCTG GAAGCATTGT CGGATTACGT TGGGCTGTTA ACACGTAACC CGCGCTACGC CGCCGTGACG ACCGCCGCGA GTGCGGAGCA GGGGGCGCAG GCCCTACAGG ACGCGGGCTA TGCCACCGAT CCTCACTATG CCCGTAAACT CACCAACATG ATTCAGCAGA TGAAATCGAT AAGCGACAAG GTGAGCAAAA CCTACAGCAT GAACATTGAT AATCTGTTCT GA
|
Protein sequence | MISDSKLLAS AAWDAQSLNE LKAKASEDPA ANIRPVARQV EGMFVQMMLK SMRDALPKDG LFSSEHTRLY TSMYDQQIAQ QMTTGKGLGL AEMMVKQMTP EQPLPEESTP AAPMKFPLET VVRYQNQALS QLVQKAVPRN YDDSLPGDSK AFLAQLSLPA QLASQQSGVP HHLILAQAAL ESGWGQRQIR RENGEPSYNL FGVKASGNWK GPVTEITTTE YENGEAKKVK AKFRVYSSYL EALSDYVGLL TRNPRYAAVT TAASAEQGAQ ALQDAGYATD PHYARKLTNM IQQMKSISDK VSKTYSMNID NLF
|
| |