Gene Moth_0786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0786 
SymbolfliP 
ID3831023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp819762 
End bp820715 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content63% 
IMG OID637828717 
Productflagellar biosynthesis protein FliP 
Protein accessionYP_429647 
Protein GI83589638 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1338] Flagellar biosynthesis pathway, component FliP 
TIGRFAM ID[TIGR01103] flagellar biosynthetic protein FliP 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000685779 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCTA GGGATCTAGC GACGGTGGGG CGGCGACGGC GGCAGGTGCA GATGCCGGCG 
TACCGCAGCC GGCCGGATTG GCCGGGGGCG GGGGGATTCC ACCGGGCGGG GCCAGTGGAG
CAGGGCACCG GTTTCGCCCT GGAAGAAGCC ACTGCGGTGG AGAGCAATGG CCGGGGGACC
CCGCCGGCCG TAAGGCAGCC GTTATGGCCC CTGGTCCTGG GAGGCGCGGC CTTGCTGGCC
GGATTGGTCC TGGGGCTTAG ACCGGCCCTG GCCCAGCCGG TTCCCGTGCC CCAGGTGAAC
CTGAACCTGG CCCAGACCAC CGACCCGCGC CAGGTGGTGG ACACCGTCAG GCTGCTGATC
CTGCTGACAG TTCTGGCCCT GGCGCCGGCC CTGGTCCTGC TGATGACCTC CTTTACCAGG
ATAATCGTTG TCCTGTCCTT CGTTCGCAGT GCCCTGGCCA CCCAGCAGAC GCCGCCCAAC
CAGATCTTAA TCGGCCTGGC CCTGTTCCTG ACCTTTTTCA TCATGGCGCC GGTTTACAAC
CAGGTAAAGA CCCAGGCCAT CGACCCTTAC CTGGCGGGGC GAATAACCCA GGAGCAGGCC
CTGGCCGCCG GGGCCCGGCC GGTGAGAGAG TTTATGTACC GCCAGACCCG GGAAAAAGAC
CTGGCCCTGT TCGTCCACAT GTCCGGCATG GCCCAGCCCC GCACCCGGGA CGATGTGCCC
CTGCATGTCC TGATCCCGGC CTTTATCATC AGCGAGCTGA AAACGGCCTT CCAGATGGGC
TTTTTGATCT ACATCCCGTT CCTGATAATC GATCTGGTAA TTGCCAGTAC CCTCATGGCC
ATGGGCATGT TTATGGTACC GCCAGTGATG ATCTCTCTAC CTTTTAAACT CATGCTCTTT
GTCCTGGTTG ACGGCTGGTA CTTGGTTGTC AAGTCCTTGC TGGAGAGTTT TTAG
 
Protein sequence
MMARDLATVG RRRRQVQMPA YRSRPDWPGA GGFHRAGPVE QGTGFALEEA TAVESNGRGT 
PPAVRQPLWP LVLGGAALLA GLVLGLRPAL AQPVPVPQVN LNLAQTTDPR QVVDTVRLLI
LLTVLALAPA LVLLMTSFTR IIVVLSFVRS ALATQQTPPN QILIGLALFL TFFIMAPVYN
QVKTQAIDPY LAGRITQEQA LAAGARPVRE FMYRQTREKD LALFVHMSGM AQPRTRDDVP
LHVLIPAFII SELKTAFQMG FLIYIPFLII DLVIASTLMA MGMFMVPPVM ISLPFKLMLF
VLVDGWYLVV KSLLESF