Gene EcolC_3840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3840 
Symbol 
ID6064497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4195124 
End bp4196404 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content54% 
IMG OID641603252 
Productputative GTPase HflX 
Protein accessionYP_001726771 
Protein GI170021817 
COG category[R] General function prediction only 
COG ID[COG2262] GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03156] GTP-binding protein HflX 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.209609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000230034 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGTTTGACC GTTATGATGC TGGTGAGCAG GCGGTACTGG TACACATCTA TTTTACGCAA 
GACAAAGATA TGGAAGACCT CCAGGAGTTT GAATCTCTGG TCTCTTCCGC CGGTGTCGAA
GCATTGCAGG TGATTACCGG TAGCCGTAAA GCGCCGCACC CAAAGTATTT TGTAGGTGAA
GGTAAAGCAG TTGAAATTGC GGAAGCTGTC AAAGCGACGG GTGCTTCGGT CGTTCTTTTT
GACCATGCCC TGAGCCCGGC GCAAGAGCGT AACCTGGAGC GTTTGTGCGA GTGTCGTGTT
ATCGACCGCA CCGGCCTTAT TTTAGATATT TTCGCCCAAC GTGCGCGTAC CCATGAGGGT
AAGTTGCAGG TTGAGCTGGC GCAGCTGCGC CATCTGGCTA CGCGCCTGGT GCGTGGCTGG
ACCCACCTTG AAAGACAGAA AGGCGGGATA GGTTTGCGTG GTCCGGGTGA AACCCAGCTC
GAAACCGACC GTCGTTTGTT GCGTAATCGC ATCGTGCAGA TACAGTCGCG CCTGGAAAGA
GTTGAAAAGC AGCGTGAGCA GGGGCGGCAA TCGCGTATCA AAGCCGACGT TCCTACTGTT
TCGCTGGTGG GATATACCAA CGCCGGTAAA TCTACCCTTT TCAATCGCAT CACCGAAGCG
CGGGTCTACG CGGCAGACCA GTTGTTTGCC ACCCTCGACC CGACGTTGCG GCGTATTGAC
GTCGCGGATG TCGGTGAAAC CGTACTTGCA GATACCGTAG GGTTTATTCG CCATCTGCCG
CACGATCTGG TGGCGGCATT TAAAGCCACG TTACAAGAGA CGCGGCAAGC CACATTACTG
CTGCACGTCA TTGATGCGGC GGATGTGCGT GTACAAGAAA ACATCGAAGC GGTGAATACG
GTTCTTGAAG AGATCGACGC TCACGAGATC CCAACCCTGC TGGTGATGAA CAAGATCGAT
ATGCTGGAAG ATTTCGAACC GCGTATTGAT CGGGACGAAG AGAACAAACC GATCCGTGTC
TGGCTTTCCG CACAGACCGG AGCGGGGATA CCACAGCTTT TTCAGGCTTT GACGGAGCGG
CTTTCCGGCG AGGTGGCGCA GCATACATTG CGTCTGCCAC CGCAGGAAGG GCGTCTGAGA
AGTCGTTTTT ATCAGCTTCA GGCAATAGAA AAAGAGTGGA TGGAGGAGGA CGGCAGCGTA
AGTCTGCAAG TTCGTATGCC GATCGTTGAC TGGCGTCGCC TCTGTAAACA AGAACCGGCG
TTGATCGATT ACCTGATCTA A
 
Protein sequence
MFDRYDAGEQ AVLVHIYFTQ DKDMEDLQEF ESLVSSAGVE ALQVITGSRK APHPKYFVGE 
GKAVEIAEAV KATGASVVLF DHALSPAQER NLERLCECRV IDRTGLILDI FAQRARTHEG
KLQVELAQLR HLATRLVRGW THLERQKGGI GLRGPGETQL ETDRRLLRNR IVQIQSRLER
VEKQREQGRQ SRIKADVPTV SLVGYTNAGK STLFNRITEA RVYAADQLFA TLDPTLRRID
VADVGETVLA DTVGFIRHLP HDLVAAFKAT LQETRQATLL LHVIDAADVR VQENIEAVNT
VLEEIDAHEI PTLLVMNKID MLEDFEPRID RDEENKPIRV WLSAQTGAGI PQLFQALTER
LSGEVAQHTL RLPPQEGRLR SRFYQLQAIE KEWMEEDGSV SLQVRMPIVD WRRLCKQEPA
LIDYLI