Gene EcSMS35_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2047 
SymbolflgJ 
ID6142949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2069019 
End bp2069960 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content55% 
IMG OID641616923 
Productflagellar rod assembly protein/muramidase FlgJ 
Protein accessionYP_001744099 
Protein GI170683831 
COG category[M] Cell wall/membrane/envelope biogenesis
[N] Cell motility
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3951] Rod binding protein 
TIGRFAM ID[TIGR02541] flagellar rod assembly protein/muramidase FlgJ 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.277131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.106816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGCG ACAGCAAACT ACTGGCAAGT GCGGCCTGGG ATGCACAATC GCTCAACGAA 
CTTAAGGCGA AAGCGGGCGA AGATCCGGCG GCAAATATCC GTCCGGTGGC CCGCCAGGTG
GAAGGGATGT TCGTGCAGAT GATGTTGAAA AGCATGCGCG ACGCTTTACC AAAAGATGGC
CTGTTCAGCA GCGAGCACAC TCGCCTGTAT ACCAGTATGT ATGACCAGCA GATTGCCCAA
CAGATGACGG CGGGCAAAGG TCTGGGGCTG GCAGAGATGA TGGTTAAACA GATGACGCCA
GAACAACCAT TGCCAGAGGA GTCCATGCCA GCAGCACCGA TGAAATTCCC GCTCGAAACC
GTGGTGCGTT ATCAAAATCA GACGCTTTCG CAGCTGGTGC AAAAGGCCGT ACCACGTAAC
TACGATGATT CGCTGCCGGG TGACAGTAAA GCATTCCTCG CGCAACTCTC GTTGCCCGCC
CAACTGGCAA GCCAGCAAAG CGGTGTGCCA CATCATTTGA TCCTCGCTCA GGCGGCGCTG
GAATCTGGCT GGGGACAACG GCAAATCCGC CGTGAAAACG GCGAGCCGAG CTATAACCTG
TTTGGCGTCA AAGCCTCTGG CAACTGGAAA GGGCCAGTCA CTGAAATCAC CACGACTGAA
TATGAAAATG GCGAAGCGAA GAAAGTAAAA GCGAAGTTTC GGGTCTACAG CTCGTATCTG
GAAGCATTGT CGGATTACGT TGGGCTGTTA ACGCGTAACC CGCGCTACGC CGCCGTGACG
ACCGCCGCGA GTGCGGAGCA GGGGGCGCAG GCCCTACAGG ACGCGGGCTA TGCCACCGAT
CCTCACTATG CCCGCAAACT CACCAGCATG ATTCAGCAGA TGAAATCGAT AAGCGACAAG
GTGAGCAAAA CCTACAGCAT GAACATTGAT AATCTGTTCT GA
 
Protein sequence
MISDSKLLAS AAWDAQSLNE LKAKAGEDPA ANIRPVARQV EGMFVQMMLK SMRDALPKDG 
LFSSEHTRLY TSMYDQQIAQ QMTAGKGLGL AEMMVKQMTP EQPLPEESMP AAPMKFPLET
VVRYQNQTLS QLVQKAVPRN YDDSLPGDSK AFLAQLSLPA QLASQQSGVP HHLILAQAAL
ESGWGQRQIR RENGEPSYNL FGVKASGNWK GPVTEITTTE YENGEAKKVK AKFRVYSSYL
EALSDYVGLL TRNPRYAAVT TAASAEQGAQ ALQDAGYATD PHYARKLTSM IQQMKSISDK
VSKTYSMNID NLF