Gene EcolC_2519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2519 
SymbolflgJ 
ID6067379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2771596 
End bp2772537 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content54% 
IMG OID641601925 
Productflagellar rod assembly protein/muramidase FlgJ 
Protein accessionYP_001725477 
Protein GI170020523 
COG category[M] Cell wall/membrane/envelope biogenesis
[N] Cell motility
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3951] Rod binding protein 
TIGRFAM ID[TIGR02541] flagellar rod assembly protein/muramidase FlgJ 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00319368 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCAGCG ACAGCAAACT ACTGGCAAGT GCGGCCTGGG ATGCGCAATC ACTCAACGAA 
CTAAAGGCGA AAGCGGGCGA AGATCCGGCG GCAAATATCC GTCCGGTGGC CCGTCAGGTG
GAAGGGATGT TCGTGCAGAT GATGTTGAAA AGCATGCGCG ACGCTTTACC AAAAGATGGC
CTGTTCAGCA GCGAGCACAC TCGCCTGTAT ACCAGTATGT ATGACCAGCA GATTGCCCAA
CAGATGACGG CGGGCAAAGG TCTGGGGCTT GCAGAGATGA TGGTTAAACA GATGACGCCA
GAACAACCAT TGCCAGAGGA GTCTACGCCA GCAGCACCGA TGAAATTCCC GCTCGAAACT
GTGGTGCGTT ATCAAAATCA GGCGCTTTCG CAGCTGGTGC AAAAGGCCGT GCCACGTAAC
TACGATGATT CGCTGCCGGG TGACAGTAAA GCATTCCTCG CGCAACTCTC GCTGCCCGCC
CAACTGGCAA GCCAGCAAAG CGGTGTGCCA CATCATTTGA TCCTCGCTCA GGCGGCACTG
GAATCTGGTT GGGGGCAACG GCAAATCCGC CGCGAAAACG GCGAGCCGAG CTATAACCTG
TTTGGTGTCA AAGCCTCTGG CAACTGGAAA GGGCCAGTTA CTGAAATCAC CACGACTGAA
TATGAAAACG GCGAAGCGAA GAAAGTAAAA GCGAAGTTTC GGGTCTACAG CTCGTATCTG
GAAGCATTGT CGGATTACGT TGGGCTGTTA ACACGTAACC CGCGCTACGC CGCCGTGACG
ACCGCCGCGA GTGCGGAGCA GGGGGCGCAG GCCCTACAGG ACGCGGGGTA TGCTACCGAT
CCTCACTATG CCCGTAAACT CACCAACATG ATCCAGCAGA TGAAATCGAT AAGCGACAAG
GTGAGCAAAA CCTACAGCAT GAACATTGAT AATCTGTTCT GA
 
Protein sequence
MISDSKLLAS AAWDAQSLNE LKAKAGEDPA ANIRPVARQV EGMFVQMMLK SMRDALPKDG 
LFSSEHTRLY TSMYDQQIAQ QMTAGKGLGL AEMMVKQMTP EQPLPEESTP AAPMKFPLET
VVRYQNQALS QLVQKAVPRN YDDSLPGDSK AFLAQLSLPA QLASQQSGVP HHLILAQAAL
ESGWGQRQIR RENGEPSYNL FGVKASGNWK GPVTEITTTE YENGEAKKVK AKFRVYSSYL
EALSDYVGLL TRNPRYAAVT TAASAEQGAQ ALQDAGYATD PHYARKLTNM IQQMKSISDK
VSKTYSMNID NLF