Gene EcHS_A1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1204 
SymbolflgJ 
ID5595088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1201051 
End bp1201992 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content54% 
IMG OID640920363 
Productflagellar rod assembly protein/muramidase FlgJ 
Protein accessionYP_001457926 
Protein GI157160608 
COG category[M] Cell wall/membrane/envelope biogenesis
[N] Cell motility
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3951] Rod binding protein 
TIGRFAM ID[TIGR02541] flagellar rod assembly protein/muramidase FlgJ 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.0662676 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGCG ACAGCAAACT ACTGGCAAGT GCGGCCTGGG ATGCGCAATC ACTCAACGAA 
CTAAAGGCGA AAGCGGGCGA AGATCCGGCG GCAAATATCC GTCCGGTGGC CCGTCAGGTG
GAAGGGATGT TCGTGCAGAT GATGTTGAAA AGCATGCGCG ACGCTTTACC AAAAGATGGC
CTGTTCAGCA GCGAGCACAC TCGCCTGTAT ACCAGTATGT ATGACCAGCA GATTGCCCAA
CAGATGACGG CGGGCAAAGG TCTGGGGCTT GCAGAGATGA TGGTTAAACA GATGACGCCA
GAACAACCAT TGCCAGAGGA GTCTACGCCA GCAGCACCGA TGAAATTCCC GCTCGAAACT
GTGGTGCGTT ATCAAAATCA GGCGCTTTCG CAGCTGGTGC AAAAGGCCGT GCCACGTAAC
TACGATGATT CGCTGCCGGG TGACAGTAAA GCATTCCTCG CGCAACTCTC GCTGCCCGCC
CAACTGGCAA GCCAGCAAAG CGGTGTGCCA CATCATTTGA TCCTCGCTCA GGCGGCACTG
GAATCTGGTT GGGGGCAACG GCAAATCCGC CGCGAAAACG GCGAGCCGAG CTATAACCTG
TTTGGTGTCA AAGCCTCTGG CAACTGGAAA GGGCCAGTTA CTGAAATCAC CACGACTGAA
TATGAAAACG GCGAAGCGAA GAAAGTAAAA GCGAAGTTTC GGGTCTACAG CTCGTATCTG
GAAGCATTGT CGGATTACGT TGGGCTGTTA ACACGTAACC CGCGCTACGC CGCCGTGACG
ACCGCCGCGA GTGCGGAGCA GGGGGCGCAG GCCCTACAGG ACGCGGGGTA TGCTACCGAT
CCTCACTATG CCCGTAAACT CACCAACATG ATCCAGCAGA TGAAATCGAT AAGCGACAAG
GTGAGCAAAA CCTACAGCAT GAACATTGAT AATCTGTTCT GA
 
Protein sequence
MISDSKLLAS AAWDAQSLNE LKAKAGEDPA ANIRPVARQV EGMFVQMMLK SMRDALPKDG 
LFSSEHTRLY TSMYDQQIAQ QMTAGKGLGL AEMMVKQMTP EQPLPEESTP AAPMKFPLET
VVRYQNQALS QLVQKAVPRN YDDSLPGDSK AFLAQLSLPA QLASQQSGVP HHLILAQAAL
ESGWGQRQIR RENGEPSYNL FGVKASGNWK GPVTEITTTE YENGEAKKVK AKFRVYSSYL
EALSDYVGLL TRNPRYAAVT TAASAEQGAQ ALQDAGYATD PHYARKLTNM IQQMKSISDK
VSKTYSMNID NLF