Gene Moth_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0789 
Symbol 
ID3831026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp821839 
End bp822963 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content63% 
IMG OID637828720 
Productflagellar biosynthetic protein FlhB 
Protein accessionYP_429650 
Protein GI83589641 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1377] Flagellar biosynthesis pathway, component FlhB 
TIGRFAM ID[TIGR00328] flagellar biosynthetic protein FlhB
[TIGR00789] flhB C-terminus-related protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000340873 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCAC ATAACCTGTT GGCATTTTAC GGCGACCAGG ACGTGACGGG TAGACACTTA 
ATCAACCTGC AGCTTTTTGC CGAGGAAAAA ACCGAGGAGG CGACCCCCCA CCGCCTCCAG
GAAGTGCGCC AGAAGGGCCA GGTGGCCCGG AGCAATGACC TGAGTACGGC CATGGTGCTG
CTGGCGTGTG TGGTATTCCT TTACTGGCGG CGGGAAACCT TTTACCAGGC CATGGCGGAC
CTGATTACCA GTACCCTCCA GGATGGCTGG CATCAGCAGC TGGATGGCGG TTCGCTGATG
GCCCTCGGCA GCCAGCTGGC CTTAAAGGTA GGGCTGCTCC TGGCCCCCCT TCTGGCCCTG
GCAGCGGCCG TCGGCCTGGC GGCCAATTTC GCCCAGACGG GCTTTGTCTT CTCCCTGGAA
CCGTTACTCC CGCGCCTGGA GAACCTGGAC CCGGTGAAGG GCATGCAGCG CTTCTTTTCC
CGGCGGGCCT TGATGGAACT CCTCAAAAGC CTGGCCAAGG TGGTTGTCGT CAGCCTGGTG
GTCTGGCAGC TGGTTAAGGG GCAGTTTACC CAGCTTCTGC TGACCGTTGA TATGGGGTTG
CCGGCCACCC TGGACCTGGT GAGCCGGATG GTCTACCGGG TGGGTCTGGG TACAGTGGCC
GTATTTCTGG CCCTGGCGGC GGCCGATTAT GTCTTCCAGC GGCGGGAGTA CCAGAAAAAC
CTGCGTATGA CCAGGCAGGA AGTAAAAGAA GAAATGAAGC AGATGGAAGG CGACCCCCTG
GTGCGTTCCC GGTTGCGGGA GAAGCAGCGC CAGCTGGCCC GGCACCGGAT GATGCACGCC
GTGCCGGAAG CCACGGTGGT CATCACCAAC CCCACCCATG TGGCTGTAGC CCTGCGTTAC
CGGGAAGAGG AGAGGGCGCC GCGAGTGGTG GCCAAGGGTG CCGGGAGCAT CGCCGAAAGG
ATCAAGGCTG TGGCCCGTCG CCACAACGTA CCGGTAGTGG AAAACCCGCC GGTGGCCCGT
GCCCTCTACC GCCAGGTGGA GCTGGGCCAG GAAATCCCGG TGGCCCTCTA CCAGGCGGTA
GCCGAGATCC TGGCCCGGAT CTACAAGCTG CGGGGGAGAT TGTAA
 
Protein sequence
MRAHNLLAFY GDQDVTGRHL INLQLFAEEK TEEATPHRLQ EVRQKGQVAR SNDLSTAMVL 
LACVVFLYWR RETFYQAMAD LITSTLQDGW HQQLDGGSLM ALGSQLALKV GLLLAPLLAL
AAAVGLAANF AQTGFVFSLE PLLPRLENLD PVKGMQRFFS RRALMELLKS LAKVVVVSLV
VWQLVKGQFT QLLLTVDMGL PATLDLVSRM VYRVGLGTVA VFLALAAADY VFQRREYQKN
LRMTRQEVKE EMKQMEGDPL VRSRLREKQR QLARHRMMHA VPEATVVITN PTHVAVALRY
REEERAPRVV AKGAGSIAER IKAVARRHNV PVVENPPVAR ALYRQVELGQ EIPVALYQAV
AEILARIYKL RGRL