Gene Moth_0771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0771 
SymbolfliF 
ID3831484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp807332 
End bp808906 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content62% 
IMG OID637828702 
Productflagellar MS-ring protein 
Protein accessionYP_429632 
Protein GI83589623 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1766] Flagellar biosynthesis/type III secretory pathway lipoprotein 
TIGRFAM ID[TIGR00206] flagellar basal-body M-ring protein/flagellar hook-basal body protein (fliF) 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00107573 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGGTTA AAGATATATT TGGCAAAGCA AAGAGTTTTT GGCAGGGATT GACTCCGGGG 
CGGCGTTCGG CCCTGGTAGC GGTAACCCTG GCCGCCTTGC TGGCGGCGGG TTTCCTGGTC
CAGTGGCTGG TGAGGCCCCA ATACGCGCCC CTTTTCACCA ACATGCAGCA GCAGGACGCC
GCGGCGGTTA CGGCAAAATT GAAGGAGATG AAGGTTCCCT ACCGCCTGAC GGGTGACGGT
ACCACCATCG AGGTACCCAA AGACCAGATC TACCAGTTAC GCCTGGACCT GGCCAGCGCC
GGTGTCCTGA ACAACGGCCA GGGTTTTGAG CTTTTCGACC AGAACAAGCT GGGTATGACC
GACTTTGAGC GTAACCTGGA TTATCAGCGT GCCCTGCAGG AGGAACTGCG GCGGACCATC
GTTACCCTGG ACGAGGTGGA AGACGCCCGG GTGCACCTGG TCATCCCCCA GCCCAGTGTT
TTCCTGCAAC AGCAGCAGCC GCCGTCGGCG GCCGTGGTTT TAAAGCTCAA GCCCCTGGCG
CGGCTGAAAC CGGAACAGGT CAAGGGCATC ATGGAGCTGA TTGCCGCCAG CGTCCCCGGG
CTGAAGCTGG AAAACATCCG CGTCATCGAT ATGTACGGCA ACGTCTTGAG TGACGGCGTT
GCCGACAGTG CGAACGCCCC CTTAAGCCAG AAGCAGCAGA CCCAGATGGA GATCAAGCGG
CAGTTCGAAA AGGACCTGGA GCAGCGCCTC CAGAGTATGC TGACCCAGAT TTTGGGGCCG
GGGAAGGCCG TGGCCATGGT GACGGCGGAC CTCAACTTCG ACCAGCAGGA AATCAACCAG
ACCACCTGGG GCAAGCAGGG GGCGCTTAGA AGCGAGGAGA TTAAAACCGA GCAGGGCACT
TCTAACGGCG GAGCCGGCGG TGTGGCGGGG ACGGCAGGTA ACAACGGGCC GGGTTATCCG
GCCGTAAACC CCGCCAACGG AAACTATAAC ACGAGCGATA CCGTCCACAA CTACGAGCTG
GATAAAACCG ATACCCACAC CATAGTGGCT CCCGGGCAGG TCCGGCGGCT TTCCACCGCG
GTAGCCGTTA ACGGCCCGGT GAACGCGGCC TTAAACAACC AGATCCAGCA GATCGTCAGC
GCCGCGGTGG GCTACCAGCC GGCCCGGGGC GATCAGATCA CCGTTACCAG CCTGGCCTTT
GACAACTCCC TGCAGCAGCA AATGGCGGCC GACATGGCCG CCCAGCAGCA GCGCCAGCAG
AGGCTGCGCC AGTACCTCCT GTGGGGAGGC GTGGGGCTGG TGTCCCTGGC CCTGCTGGTG
ACCCTGATCG TTCTTATCCT GCGGCGCCGT CGGCAGGCGG CCCTGGAAGA ACAGATGGCT
GCCGCGGCCC TGCCGGCCGG CGTGCCGGTA GAACCCCTGG TAGTGGAACC GGTAGAACCG
GTGGAACCGG CGGACCTGGA GAAGCAGCGC CAGGAAGCCC AGCGCAAGGC TAAACTGGAG
CAACTGCAGG AGATCATCCG CCAGCGGCCG GAGGATGCGG CTTTACTGCT GAAGGCCTGG
CTGGCAGAAG ATTAG
 
Protein sequence
MKVKDIFGKA KSFWQGLTPG RRSALVAVTL AALLAAGFLV QWLVRPQYAP LFTNMQQQDA 
AAVTAKLKEM KVPYRLTGDG TTIEVPKDQI YQLRLDLASA GVLNNGQGFE LFDQNKLGMT
DFERNLDYQR ALQEELRRTI VTLDEVEDAR VHLVIPQPSV FLQQQQPPSA AVVLKLKPLA
RLKPEQVKGI MELIAASVPG LKLENIRVID MYGNVLSDGV ADSANAPLSQ KQQTQMEIKR
QFEKDLEQRL QSMLTQILGP GKAVAMVTAD LNFDQQEINQ TTWGKQGALR SEEIKTEQGT
SNGGAGGVAG TAGNNGPGYP AVNPANGNYN TSDTVHNYEL DKTDTHTIVA PGQVRRLSTA
VAVNGPVNAA LNNQIQQIVS AAVGYQPARG DQITVTSLAF DNSLQQQMAA DMAAQQQRQQ
RLRQYLLWGG VGLVSLALLV TLIVLILRRR RQAALEEQMA AAALPAGVPV EPLVVEPVEP
VEPADLEKQR QEAQRKAKLE QLQEIIRQRP EDAALLLKAW LAED