Gene ECH74115_1460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1460 
SymbolflgJ 
ID6969934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1440064 
End bp1441005 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content55% 
IMG OID643385433 
Productflagellar rod assembly protein/muramidase FlgJ 
Protein accessionYP_002269927 
Protein GI209396103 
COG category[M] Cell wall/membrane/envelope biogenesis
[N] Cell motility
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3951] Rod binding protein 
TIGRFAM ID[TIGR02541] flagellar rod assembly protein/muramidase FlgJ 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.969398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0000147806 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCAGCG ACAGCAAACT ACTGGCAAGT GCGGCCTGGG ATGCGCAATC ACTCAACGAA 
CTAAAGGCGA AAGCGAGCGA AGATCCGGCG GCAAATATCC GTCCGGTGGC CCGTCAGGTG
GAAGGGATGT TCGTGCAGAT GATGTTGAAA AGCATGCGCG ACGCGTTACC AAAAGATGGC
CTGTTCAGCA GCGAGCACAC TCGCCTGTAT ACCAGTATGT ATGACCAGCA GATTGCCCAA
CAGATGACGA CGGGCAAAGG TCTGGGGCTT GCAGAGATGA TGGTTAAGCA GATGACGCCA
GAACAACCAT TGCCAGAGGA GTCCACGCCA GCAGCACCGA TGAAATTCCC GCTCGAAACC
GTGGTGCGTT ATCAAAATCA GGCGCTTTCG CAGCTGGTGC AAAAGGCCGT GCCACGTAAC
TACGATGATT CGCTGCCAGG TGACAGTAAA GCATTCCTCG CGCAACTCTC GCTGCCCGCC
CAACTGGCAA GCCAGCAAAG CGGTGTGCCA CATCATTTGA TCCTCGCTCA GGCGGCGCTG
GAATCTGGCT GGGGGCAACG GCAAATCCGC CGCGAAAACG GCGAGCCGAG CTATAACCTG
TTTGGTGTCA AAGCCTCTGG CAACTGGAAA GGGCCAGTCA CTGAAATCAC CACGACTGAA
TATGAAAATG GCGAAGCGAA GAAAGTAAAA GCGAAGTTTC GGGTCTACAG CTCGTATCTG
GAAGCATTGT CGGATTACGT TGGGCTGTTA ACACGTAACC CGCGCTACGC CGCCGTGACG
ACCGCCGCGA GTGCGGAGCA GGGGGCGCAG GCCCTACAGG ACGCGGGCTA TGCCACCGAT
CCTCACTATG CCCGTAAACT CACCAACATG ATTCAGCAGA TGAAATCGAT AAGCGACAAG
GTGAGCAAAA CCTACAGCAT GAACATTGAT AATCTGTTCT GA
 
Protein sequence
MISDSKLLAS AAWDAQSLNE LKAKASEDPA ANIRPVARQV EGMFVQMMLK SMRDALPKDG 
LFSSEHTRLY TSMYDQQIAQ QMTTGKGLGL AEMMVKQMTP EQPLPEESTP AAPMKFPLET
VVRYQNQALS QLVQKAVPRN YDDSLPGDSK AFLAQLSLPA QLASQQSGVP HHLILAQAAL
ESGWGQRQIR RENGEPSYNL FGVKASGNWK GPVTEITTTE YENGEAKKVK AKFRVYSSYL
EALSDYVGLL TRNPRYAAVT TAASAEQGAQ ALQDAGYATD PHYARKLTNM IQQMKSISDK
VSKTYSMNID NLF