Gene EcE24377A_1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1204 
SymbolflgJ 
ID5586381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1214801 
End bp1215742 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content55% 
IMG OID640924903 
Productflagellar rod assembly protein/muramidase FlgJ 
Protein accessionYP_001462315 
Protein GI157158487 
COG category[M] Cell wall/membrane/envelope biogenesis
[N] Cell motility
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3951] Rod binding protein 
TIGRFAM ID[TIGR02541] flagellar rod assembly protein/muramidase FlgJ 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.592055 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGCG ACAGCAAACT ACTGGCAAGT GCGGCCTGGG ATGCGCAATC ACTCAACGAA 
CTAAAGGCGA AAGCGGGCGA AGATCCGGCG GCAAATATCC GTCCGGTGGC CCGCCAGGTG
GAAGGGATGT TCGTGCAGAT GATGTTGAAA AGCATGCGCG ACGCTTTACC AAAAGATGGC
CTGTTCAGCA GCGAGCACAC TCGTCTGTAT ACCAGTATGT ATGACCAGCA GATTGCCCAA
CAGATGACGG CGGGCAAAGG TCTGGGGCTT GCAGAGATGA TGGTTAAACA GATGACGCCA
GAACAACCAT TGCCAGAGGA GTCCACGCCA GCAGCACCGA TGAAATTCCC GCTCGAAACC
GTGGTGCGTT ATCAAAATCA GGCGCTTTCG CAGCTGGTGC AAAAGGCCGT GCCACGTAAC
TACGATGATT CGCTGCCGGG TGACAGTAAA GCATTCCTCG CGCAACTCTC GCTGCCCGCC
CAACTGGCAA GCCAGCAAAG CGGTGTGCCA CATCATTTGA TCCTCGCTCA GGCGGCACTG
GAATCTGGTT GGGGGCAACG GCAAATCCGC CGCGAAAACG GCGAGCCGAG CTATAACCTG
TTTGGTGTCA AAGCCTCTGG CAACTGGAAA GGGCCAGTTA CTGAAATCAC CACGACTGAA
TATGAAAACG GCGAAGCGAA GAAAGTAAAA GCGAAGTTTC GCGTCTACAG CTCGTATCTG
GAAGCCTTGT CGGATTACGT TGGGCTGTTA ACGCGTAACC CGCGCTACGC CGCCGTGACG
ACCGCCGCGA GTGCGGAACA GGGGGCGCAG GCCCTACAGG ACGCGGGCTA TGCCACCGAT
CCTCACTATG CCCGCAAACT CACCAACATG ATTCAGCAGA TGAAATCGAT AAGCGACAAG
GTGAGCAAAA CCTACAGCAT GAACATTGAT AATCTGTTCT GA
 
Protein sequence
MISDSKLLAS AAWDAQSLNE LKAKAGEDPA ANIRPVARQV EGMFVQMMLK SMRDALPKDG 
LFSSEHTRLY TSMYDQQIAQ QMTAGKGLGL AEMMVKQMTP EQPLPEESTP AAPMKFPLET
VVRYQNQALS QLVQKAVPRN YDDSLPGDSK AFLAQLSLPA QLASQQSGVP HHLILAQAAL
ESGWGQRQIR RENGEPSYNL FGVKASGNWK GPVTEITTTE YENGEAKKVK AKFRVYSSYL
EALSDYVGLL TRNPRYAAVT TAASAEQGAQ ALQDAGYATD PHYARKLTNM IQQMKSISDK
VSKTYSMNID NLF