Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2047 |
Symbol | flgJ |
ID | 6142949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2069019 |
End bp | 2069960 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641616923 |
Product | flagellar rod assembly protein/muramidase FlgJ |
Protein accession | YP_001744099 |
Protein GI | 170683831 |
COG category | [M] Cell wall/membrane/envelope biogenesis [N] Cell motility [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3951] Rod binding protein |
TIGRFAM ID | [TIGR02541] flagellar rod assembly protein/muramidase FlgJ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.277131 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.106816 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAGCG ACAGCAAACT ACTGGCAAGT GCGGCCTGGG ATGCACAATC GCTCAACGAA CTTAAGGCGA AAGCGGGCGA AGATCCGGCG GCAAATATCC GTCCGGTGGC CCGCCAGGTG GAAGGGATGT TCGTGCAGAT GATGTTGAAA AGCATGCGCG ACGCTTTACC AAAAGATGGC CTGTTCAGCA GCGAGCACAC TCGCCTGTAT ACCAGTATGT ATGACCAGCA GATTGCCCAA CAGATGACGG CGGGCAAAGG TCTGGGGCTG GCAGAGATGA TGGTTAAACA GATGACGCCA GAACAACCAT TGCCAGAGGA GTCCATGCCA GCAGCACCGA TGAAATTCCC GCTCGAAACC GTGGTGCGTT ATCAAAATCA GACGCTTTCG CAGCTGGTGC AAAAGGCCGT ACCACGTAAC TACGATGATT CGCTGCCGGG TGACAGTAAA GCATTCCTCG CGCAACTCTC GTTGCCCGCC CAACTGGCAA GCCAGCAAAG CGGTGTGCCA CATCATTTGA TCCTCGCTCA GGCGGCGCTG GAATCTGGCT GGGGACAACG GCAAATCCGC CGTGAAAACG GCGAGCCGAG CTATAACCTG TTTGGCGTCA AAGCCTCTGG CAACTGGAAA GGGCCAGTCA CTGAAATCAC CACGACTGAA TATGAAAATG GCGAAGCGAA GAAAGTAAAA GCGAAGTTTC GGGTCTACAG CTCGTATCTG GAAGCATTGT CGGATTACGT TGGGCTGTTA ACGCGTAACC CGCGCTACGC CGCCGTGACG ACCGCCGCGA GTGCGGAGCA GGGGGCGCAG GCCCTACAGG ACGCGGGCTA TGCCACCGAT CCTCACTATG CCCGCAAACT CACCAGCATG ATTCAGCAGA TGAAATCGAT AAGCGACAAG GTGAGCAAAA CCTACAGCAT GAACATTGAT AATCTGTTCT GA
|
Protein sequence | MISDSKLLAS AAWDAQSLNE LKAKAGEDPA ANIRPVARQV EGMFVQMMLK SMRDALPKDG LFSSEHTRLY TSMYDQQIAQ QMTAGKGLGL AEMMVKQMTP EQPLPEESMP AAPMKFPLET VVRYQNQTLS QLVQKAVPRN YDDSLPGDSK AFLAQLSLPA QLASQQSGVP HHLILAQAAL ESGWGQRQIR RENGEPSYNL FGVKASGNWK GPVTEITTTE YENGEAKKVK AKFRVYSSYL EALSDYVGLL TRNPRYAAVT TAASAEQGAQ ALQDAGYATD PHYARKLTSM IQQMKSISDK VSKTYSMNID NLF
|
| |