Gene Mfla_1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1701 
Symbol 
ID4001238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1819248 
End bp1822373 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content56% 
IMG OID637938615 
Producthypothetical protein 
Protein accessionYP_545810 
Protein GI91776054 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3170] Tfp pilus assembly protein FimV 
TIGRFAM ID[TIGR03504] FimV C-terminal domain
[TIGR03505] FimV N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0554599 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATAAGT CCAAACTAAA GGGGGTCGCA CTTGCGGTTT TGTTGGCCTT TGCGCCAATG 
GCGAGCGAGG CCGCAGGCTT GGGACGCTTG ACTGTGACAT CTGGACTTGG TGAGCCGCTT
TCTGCCGAAA TTGAGCTGTT ATCCACGACG CCGGACGAGC TGGCGACATT GACTGCAGGT
ATTGCGCCAG AGGAAGCCTA CAATGTCCAA GGTGTCGAAC GTACTGCAAT CCACAATGCC
ATCAAGGTCG ATGTCAGCAA GCGTGCGGAC GGGACGCCAG TGCTCAAGCT TACGACATCT
CAGGCGATCA GCGACCCTTT CCTGGATATG TTGATTCAGG TCGATTGGGC AACAGGGCGC
TTGCTGCGCG AATATACCGT ATTACTTGAC CCTCCGGGCT ACAACAATCA AGCGCAAGTG
TATAACGGTG TACAGCTTCC TCCGGATGCG GCTCCCACCA CACTACCGGC CCCTGCTGCG
GCACCACAAC AGAATCCTCC GCCTGCCAAT ACAGCCCCCC AGTCTATTCC AGCTCCAATT
CAGACCCAGG CAGCAGCTGT CCCAGAAGTT CCAGAACAGG CGACCCAACT TCAAGACGCT
TCGCCACAGA CCTATACCAC TTCGACCGGT GATACCTTGA GCGCAATCGC ACGTGAGCAT
CAGGTGCCGG GTGTAAGCCT GGACCAGATG CTGGTCGGGC TGTATCGCGC CAACGAGGAA
GCCTTTATCG GCAAGAACAT GAACCGTCTC AAGGTGGGGC AGATCATGCG GGTACCTAGC
CCCGCTGAGC TGCAACAGAT TGCCCCTGCT GAGGCAAGAC GGGAAGTCAG GGTGCATACG
TCGGACTGGA ATGCTTATCG TAACCGCTTG GCAGACTTGG CGGCCGAATC GACGCTGTCG
GTAGATGGTG GCCCACAACA GTCGGCATCC GGGCGCATTT CGACGCCTGC AACGGATCAG
GCTGCGCCTC CAGCGGATGC TTCACGGGAT GTCGTCAGGC TTTCCAGCGA TAATGCCAGC
CAGGCGCAAT TGCATGCAAT GCAGGAAGAG CTCACAGCCA AGGACAAGAG CCTGAAGGAG
GCGGCCGAGC GCGTTGCAAT GCTCGAGAAG CAAATTGCTG ATATGCAGAA GCTGCTGGCT
ATCAAAAACC AGGCTCTGGC CAACTTGCAG AATGCCGATG TGTCCTCAGC CAATCCAGAA
GAATCAACCC TGCTACCAGA AGGTGATGCC GAACCGCAGG TGCAGGCTGA GTCAGATGCT
GCACCAACAG CAGCGCAAGC ATCTGAAACA GATGAGCCCG CAACGGATAC CGCATCAACG
GGCGATGTTG CTGAACCAGT GAAACCAGCG TCTCAGCCAG CACCGGTTGA ACCAGTAGTG
GAGGAGCCCA GCTTGTTTGA AGAGCTGGTT GGAGAAGATC CGTTGCCGCT TGCCCTGGCT
GCTGGGGGTA TTGTCCTGTT GCTGGGTGGA TGGCTGTATA TGCGAAACAA GCGCAAGCGC
GAGCTGGATA GCTTCGAAAA AGGTATCCAG ACTGCAGGAG GGCTCAAGCC TAATACGGTG
TTTGGCAATA CCGCCGGCGC TACCATCAAT ACAGGAGATA CCTCTTTCCT CAATGACTTC
TCCCAGAATA CCGGGGGCAT GATTGACACA CATGACGTTG ACCCCATTGC TGAGGCAGAA
GTGTATATGG CATATGGGCG CGATGCCCAG GCAGAAGAAA TCCTCAAAGA TGCCATTGTC
AAGGAGCCGC AACGTCATGA ACTCCATCTC AAGTTGCTCG AGATTTATGT CGCACGCAAC
GATACCTCGG CATTCGAAAC AGTTGCCGGT GAGATCTATA CCACGTTGGG TGCATCTGAT
CCCATCTGGA ATAAAGTGGC TGAAATGGGC CGCAAGCTGG AGCCGGATAA TCCCCTGTAT
GCAGAAGGCA GCGTGGCCGA TAAGCCTGCA TCCTCGCCCG TGCAAGATGT AAAGGATGCA
GAGCCTGGCA AGGAAACGAC AGGGACGAGC CAGCAGGCGG CGTTCACCAG TGCTGCCCTT
GCAGGTGCCA CTATTGCTGT CACTGGCAGC CTGCCGGACG AGAAAGGCGC GCAAGACCCT
GCTCCTGATG AAGCATCTTT CGCTGAAGCG AATCATGACC AGGAGCAAGC ACACGATTTC
CCCTCGCTTG ATTTTGACCT GAGTGCCGAG GATAGGGAAA GCCTCCAGCA GCCAGATGTG
GATGCTCCGG ATTCATATGC AGGCAATAGC ATGGATTTCA CGCTCGATCT GCCGGAGCCG
CCGGCCAAGA ATAGTCTGCA GCTGGATATC GCGGAGCTTG ATGCAGCAAT TCCCGATTTC
GACTTGCCGG AATTGCCTGG CAGCGAAAAG AACGAGGCTG CGGATGCCGT GCCTGCGTAC
GCTATTCCAG GGGGTGATGA GGCGGCCGAG CAGTCATCAT CGGAATCCGC TCCATTCCAC
ACTCCGGATG TTGACCTTGG TTTGCCCGAG ATTGCATTGC CTGAAATCGA ACTCCCCAAT
CCTGCTGCTG AGCAACCTGC AGTAGAAGAA CCGGTTGCTC AGGGGGGCGA TATGCCCGAT
ATCGATTTCG ATCTGCCGGA AATCAGTTTC GATGAGGAGC CACACCCGGA AGAGGCTGCA
CAAAGTGCCG GTCTGGAGTC TTTGCAGCCC GTTGATGATG CGCTGGAGGC TCCCGAAACT
GCTTCCGCAT CAGCGATCGA AAAAACACAA CCTGATGCAG CACAGCCCCT CGATTTTGCC
GACATAGATC TGGAGACTGA TGATCTTGGC GACAATCGTG TAGACGCGAC TACGGACCAG
GATGCAGAGA TCACCTTCCC TGATTTTGAT ATTGAGCTTG ATGAAAGCGA GCCCGTCCCT
GCTGCCGATA TCGATCTTTC GGGAATCAGC CTAGATGTGG CAGGTGACAC GGCGGAAGCG
GAAAAACTGG TGTCTGGCGA AGGATCGGAA GAGCCAGCTG ATGTGGACAC CAAGCTGGAC
CTGGTGACGG CCTACATGGA CATGGGGGAT AACGAAGGCG CGCGTGAATT GCTCGAGGAA
ATCCTGAAGG AAGGGGGCCC ACGGCAGCAG CAGAGGGCCA AATCCATTCT GGATAGCCTC
TCCTGA
 
Protein sequence
MHKSKLKGVA LAVLLAFAPM ASEAAGLGRL TVTSGLGEPL SAEIELLSTT PDELATLTAG 
IAPEEAYNVQ GVERTAIHNA IKVDVSKRAD GTPVLKLTTS QAISDPFLDM LIQVDWATGR
LLREYTVLLD PPGYNNQAQV YNGVQLPPDA APTTLPAPAA APQQNPPPAN TAPQSIPAPI
QTQAAAVPEV PEQATQLQDA SPQTYTTSTG DTLSAIAREH QVPGVSLDQM LVGLYRANEE
AFIGKNMNRL KVGQIMRVPS PAELQQIAPA EARREVRVHT SDWNAYRNRL ADLAAESTLS
VDGGPQQSAS GRISTPATDQ AAPPADASRD VVRLSSDNAS QAQLHAMQEE LTAKDKSLKE
AAERVAMLEK QIADMQKLLA IKNQALANLQ NADVSSANPE ESTLLPEGDA EPQVQAESDA
APTAAQASET DEPATDTAST GDVAEPVKPA SQPAPVEPVV EEPSLFEELV GEDPLPLALA
AGGIVLLLGG WLYMRNKRKR ELDSFEKGIQ TAGGLKPNTV FGNTAGATIN TGDTSFLNDF
SQNTGGMIDT HDVDPIAEAE VYMAYGRDAQ AEEILKDAIV KEPQRHELHL KLLEIYVARN
DTSAFETVAG EIYTTLGASD PIWNKVAEMG RKLEPDNPLY AEGSVADKPA SSPVQDVKDA
EPGKETTGTS QQAAFTSAAL AGATIAVTGS LPDEKGAQDP APDEASFAEA NHDQEQAHDF
PSLDFDLSAE DRESLQQPDV DAPDSYAGNS MDFTLDLPEP PAKNSLQLDI AELDAAIPDF
DLPELPGSEK NEAADAVPAY AIPGGDEAAE QSSSESAPFH TPDVDLGLPE IALPEIELPN
PAAEQPAVEE PVAQGGDMPD IDFDLPEISF DEEPHPEEAA QSAGLESLQP VDDALEAPET
ASASAIEKTQ PDAAQPLDFA DIDLETDDLG DNRVDATTDQ DAEITFPDFD IELDESEPVP
AADIDLSGIS LDVAGDTAEA EKLVSGEGSE EPADVDTKLD LVTAYMDMGD NEGARELLEE
ILKEGGPRQQ QRAKSILDSL S