Gene B21_01085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01085 
SymbolflgJ 
ID8115487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1139347 
End bp1140288 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content55% 
IMG OID644847343 
Producthypothetical protein 
Protein accessionYP_002998916 
Protein GI251784612 
COG category[M] Cell wall/membrane/envelope biogenesis
[N] Cell motility
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3951] Rod binding protein 
TIGRFAM ID[TIGR02541] flagellar rod assembly protein/muramidase FlgJ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.188745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGCG ACAGCAAACT ACTGGCAAGT GCGGCCTGGG ATGCGCAATC ACTCAACGAA 
CTAAAGGCGA AAGCGGGCGA AGATCCGGCG GCAAATATCC GTCCGGTGGC CCGTCAGGTG
GAAGGGATGT TCGTGCAGAT GATGTTGAAA AGCATGCGCG ACGCTTTACC AAAAGATGGC
CTGTTCAGCA GCGAGCACAC TCGCCTGTAT ACCAGTATGT ATGACCAGCA GATTGCCCAA
CAGATGACGG CGGGCAAAGG TCTGGGGCTT GCAGAGATGA TGGTTAAACA GATGACGCCA
GAACAACCAT TGCCAGAGGA GTCCACGCCA GCAGCACCGA TGAAATTCCC GCTCGAAACT
GTGGTGCGTT ATCAAAATCA GGCGCTTTCG CAGCTGGTGC AAAAGGCCGT GCCACGTAAC
TACGATGATT CGCTGCCGGG TGACAGTAAA GCATTCCTCG CGCAACTCTC GCTGCCCGCC
CAACTGGCAA GCCAGCAAAG CGGTGTGCCA CATCATTTGA TCCTCGCTCA GGCGGCACTG
GAATCTGGTT GGGGGCAACG GCAAATCCGC CGCGAAAACG GCGAGCCGAG CTATAACCTG
TTTGGTGTCA AAGCCTCTGG CAACTGGAAA GGGCCAGTTA CTGAAATCAC CACGACTGAA
TATGAAAACG GCGAAGCGAA GAAAGTAAAA GCGAAGTTTC GCGTCTACAG CTCGTATCTG
GAAGCCTTGT CGGATTACGT TGGGCTGTTA ACGCGTAACC CGCGCTACGC CGCCGTGACG
ACCGCCGCGA GTGCGGAACA GGGGGCGCAG GCCCTACAGG ACGCGGGCTA TGCCACCGAT
CCTCACTATG CCCGCAAACT CACCAACATG ATTCAGCAGA TGAAATCGAT AAGCGACAAG
GTGAGCAAAA CCTACAGTAT GAACATTGAT AATCTGTTCT GA
 
Protein sequence
MISDSKLLAS AAWDAQSLNE LKAKAGEDPA ANIRPVARQV EGMFVQMMLK SMRDALPKDG 
LFSSEHTRLY TSMYDQQIAQ QMTAGKGLGL AEMMVKQMTP EQPLPEESTP AAPMKFPLET
VVRYQNQALS QLVQKAVPRN YDDSLPGDSK AFLAQLSLPA QLASQQSGVP HHLILAQAAL
ESGWGQRQIR RENGEPSYNL FGVKASGNWK GPVTEITTTE YENGEAKKVK AKFRVYSSYL
EALSDYVGLL TRNPRYAAVT TAASAEQGAQ ALQDAGYATD PHYARKLTNM IQQMKSISDK
VSKTYSMNID NLF