Gene Hore_16800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_16800 
Symbol 
ID7313198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1800821 
End bp1802380 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content45% 
IMG OID643612128 
Productflagellar hook protein FlgE 
Protein accessionYP_002509425 
Protein GI220932517 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR02489] flagellar hook protein FlgE, epsilon proteobacterial
[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00000289081 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCGTT CCATGTATGC CGGTGTTTCC GGTTTAAAAG CCCATCAAAC TAGGATGGAT 
GTCATTGGTA ATAATATTGC CAATGTGAAT ACAGTTGGTT ATAAAAGCAG TAGTGTTACT
TTCAAAGAGA TGTTAAGTCA GACCCTGCGC GGTGCTAAGG CACCACAGGG AGGCCGCGGG
GGAGTTAACC CCATGCAGGT CGGCCTCGGG GTAGGAGTTG GCAGTATAAC TGTTGACCAT
ACCCAGGGCA ATCTGCAGCC AACCGGTGTT ACTACTGATC TTGCCATTCA GGGTAATGGA
TATTTTGTTG TCAATAATGG TCAGAAAAAC CTGTTCACCA GGGCCGGTAG GCTAAACCTC
GACGATAATG GATATCTTGT TAATGTTTCT AACGGTTTTG TAATTCAGGG CTGGATGGCC
GACATCAGTA CTGGTACCAT TAATTCCGGG CAGGACCCCG AGAATATCCA TATTACCGAT
GAATATTCAA TTATGAATGC CCGGGCCACC GGGAATGCCA CTATAAGTGG TAATCTTGAT
AGTCAATTTG GTGGGACCAG GGAAATTACT GTTGATGTCC TTGATTCCCT GGGGGAAATG
CATACTGTAA CCCTGTCCTT TACAAAAAGG GTACCTGAGC TTACCACAAC AATTGGTGGC
TGGGATCTAA ATTTTAGAGC AACTGAACCA GATATTAATT TAAATAATTT GACAATTGAT
TTTCTTGCTG ATGATGATAG TCAGATAAAT GCAAGTTATA ATAGTGGTAC CAATACTGTA
ACTGTAGCTG CTGACTGGGA TAACAGCAGT ACAAATGCTC CTGCTGATTT AGAAGCTATT
GAAAACGCTA TAAATGATGC CTTAAATGCC AATGGGCTTG CCTCTGTAGA TATTACGGCT
ACCACAACCG GGGCTATGAC AGATTTTGAT GGAGCTGGAG CTATAACCCT TTCCAATCCA
CCCAGCAATA CCTGGGACTG GAATTTAGTA GATGTAACTG ATGCTAATCT TCCTGCTTCG
CCTGCATCAG GAACTATAAC CTTCAATCCA GACGGGACAA TAAATAGTGG GTCCAATGGT
AGTATTGCCT TTGACCCGAC CAATGGTGCC GCTACCGGTC AGACCATAAA TCTTGATTTT
TCAGCCCTGA CCCAGCTGGC TGAAGGGTTT GACTTCAAAA TAAATTCTGA TGGATATGAA
ACCGGTGCTT TAGAAGGATT TACCATAGAT GATGGTGGTG TCATCACCGG TAGTTATTCT
AATGGACTTG TAAGGCCCAT CGGGCAGATT GCCATTGCCT ATTTTGTCAA TCCGTCCGGG
TTAATGAAGG AAGGAGAAAC CCTCTTTTCA CCATCAGAGA ACTCAGGTGA TCCCCAGATA
GGGGAAGCAG GTAGTGGAGG TCGTGGCAAG ATATCGGTCG GTAATCTGGA AATGTCCAAT
GTGGACCTTG CAGAACAGTT TACGGACATG ATTACGACCC AGCGTGGTTT CCAGGCCAAT
TCCAAGATTA TTACTACCAC TGACCAGATG CTCCAGGACC TGGTTAACCT CAAGAGATAA
 
Protein sequence
MMRSMYAGVS GLKAHQTRMD VIGNNIANVN TVGYKSSSVT FKEMLSQTLR GAKAPQGGRG 
GVNPMQVGLG VGVGSITVDH TQGNLQPTGV TTDLAIQGNG YFVVNNGQKN LFTRAGRLNL
DDNGYLVNVS NGFVIQGWMA DISTGTINSG QDPENIHITD EYSIMNARAT GNATISGNLD
SQFGGTREIT VDVLDSLGEM HTVTLSFTKR VPELTTTIGG WDLNFRATEP DINLNNLTID
FLADDDSQIN ASYNSGTNTV TVAADWDNSS TNAPADLEAI ENAINDALNA NGLASVDITA
TTTGAMTDFD GAGAITLSNP PSNTWDWNLV DVTDANLPAS PASGTITFNP DGTINSGSNG
SIAFDPTNGA ATGQTINLDF SALTQLAEGF DFKINSDGYE TGALEGFTID DGGVITGSYS
NGLVRPIGQI AIAYFVNPSG LMKEGETLFS PSENSGDPQI GEAGSGGRGK ISVGNLEMSN
VDLAEQFTDM ITTQRGFQAN SKIITTTDQM LQDLVNLKR