Gene EcSMS35_4644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4644 
SymbolhflX 
ID6145417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4745454 
End bp4746734 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content54% 
IMG OID641619460 
Productputative GTPase HflX 
Protein accessionYP_001746568 
Protein GI170681964 
COG category[R] General function prediction only 
COG ID[COG2262] GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03156] GTP-binding protein HflX 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0602066 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTTGACC GTTATGATGC TGGTGAGCAG GCGGTACTGG TACACATCTA TTTTACGCAA 
GACAAAGATA TGGAAGACCT CCAGGAGTTT GAATCTCTGG TCTCTTCCGC CGGTGTCGAA
GCATTGCAGG TGATTACCGG TAGCCGTAAA GCGCCGCACC CAAAGTATTT TGTAGGTGAA
GGTAAAGCAG TTGAAATTGC GGAAGCCGTA AAAGCGACGG GAGCTTCGGT CGTTCTTTTT
GACCATGCCC TGAGCCCGGC GCAGGAGCGT AACCTGGAGC GTTTGTGCGA GTGTCGTGTC
ATCGACCGCA CCGGCCTTAT TTTAGATATT TTCGCCCAAC GTGCGCGTAC CCATGAGGGT
AAGTTGCAGG TTGAGCTGGC GCAGTTGCGC CATCTGGCTA CGCGCCTGGT GCGTGGCTGG
ACCCACCTTG AAAGACAGAA AGGCGGGATA GGTTTGCGTG GTCCGGGTGA AACCCAGCTC
GAAACCGACC GTCGTTTGTT GCGTAATCGC ATCGTGCAGA TACAGTCGCG CCTGGAAAGA
GTTGAAAAGC AGCGTGAGCA GGGGCGGCAA TCGCGTATCA AAGCCGACGT TCCTACTGTT
TCGCTGGTGG GATATACCAA CGCCGGTAAA TCTACCCTTT TCAATCGCAT CACCGAAGCG
CGGGTCTACG CGGCAGACCA GTTGTTTGCC ACCCTCGACC CGACGTTGCG GCGTATTGAC
GTTGCAGATG TCGGTGAAAC CGTACTTGCA GATACCGTAG GGTTTATTCG CCACCTGCCG
CACGATCTGG TGGCGGCATT TAAAGCCACG TTACAAGAGA CGCGGCAAGC CACATTGCTG
CTGCACGTCA TTGATGCGGC GGATGTACGT GTACAAGAAA ACATCGAAGC GGTGAATACG
GTTCTTGAAG AGATCGACGC TCACGAGATC CCAACCCTGC TGGTGATGAA CAAGATCGAT
ATGCTGGAAG ATTTCGAACC GCGTATTGAT CGGGACGAAG AGAACAAACC GATCCGTGTC
TGGCTTTCCG CACAGACCGG AGCGGGGATA CCACAGCTTT TTCAGGCTTT GACGGAGCGG
CTTTCCGGCG AGGTGGCGCA GCATACATTG CGTCTGCCAC CGCAGGAAGG GCGTCTGAGA
AGTCGTTTTT ATCAGCTTCA GGCAATAGAA AAAGAGTGGA TGGAGGAGGA CGGCAGCGTA
AGTCTGCAAG TTCGTATGCC GATCGTTGAC TGGCGTCGCC TCTGTAAACA AGAACCGGCG
TTGATCGATT ACCTGATCTA A
 
Protein sequence
MFDRYDAGEQ AVLVHIYFTQ DKDMEDLQEF ESLVSSAGVE ALQVITGSRK APHPKYFVGE 
GKAVEIAEAV KATGASVVLF DHALSPAQER NLERLCECRV IDRTGLILDI FAQRARTHEG
KLQVELAQLR HLATRLVRGW THLERQKGGI GLRGPGETQL ETDRRLLRNR IVQIQSRLER
VEKQREQGRQ SRIKADVPTV SLVGYTNAGK STLFNRITEA RVYAADQLFA TLDPTLRRID
VADVGETVLA DTVGFIRHLP HDLVAAFKAT LQETRQATLL LHVIDAADVR VQENIEAVNT
VLEEIDAHEI PTLLVMNKID MLEDFEPRID RDEENKPIRV WLSAQTGAGI PQLFQALTER
LSGEVAQHTL RLPPQEGRLR SRFYQLQAIE KEWMEEDGSV SLQVRMPIVD WRRLCKQEPA
LIDYLI