Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mfla_1701 |
Symbol | |
ID | 4001238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacillus flagellatus KT |
Kingdom | Bacteria |
Replicon accession | NC_007947 |
Strand | - |
Start bp | 1819248 |
End bp | 1822373 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637938615 |
Product | hypothetical protein |
Protein accession | YP_545810 |
Protein GI | 91776054 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3170] Tfp pilus assembly protein FimV |
TIGRFAM ID | [TIGR03504] FimV C-terminal domain [TIGR03505] FimV N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0554599 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCATAAGT CCAAACTAAA GGGGGTCGCA CTTGCGGTTT TGTTGGCCTT TGCGCCAATG GCGAGCGAGG CCGCAGGCTT GGGACGCTTG ACTGTGACAT CTGGACTTGG TGAGCCGCTT TCTGCCGAAA TTGAGCTGTT ATCCACGACG CCGGACGAGC TGGCGACATT GACTGCAGGT ATTGCGCCAG AGGAAGCCTA CAATGTCCAA GGTGTCGAAC GTACTGCAAT CCACAATGCC ATCAAGGTCG ATGTCAGCAA GCGTGCGGAC GGGACGCCAG TGCTCAAGCT TACGACATCT CAGGCGATCA GCGACCCTTT CCTGGATATG TTGATTCAGG TCGATTGGGC AACAGGGCGC TTGCTGCGCG AATATACCGT ATTACTTGAC CCTCCGGGCT ACAACAATCA AGCGCAAGTG TATAACGGTG TACAGCTTCC TCCGGATGCG GCTCCCACCA CACTACCGGC CCCTGCTGCG GCACCACAAC AGAATCCTCC GCCTGCCAAT ACAGCCCCCC AGTCTATTCC AGCTCCAATT CAGACCCAGG CAGCAGCTGT CCCAGAAGTT CCAGAACAGG CGACCCAACT TCAAGACGCT TCGCCACAGA CCTATACCAC TTCGACCGGT GATACCTTGA GCGCAATCGC ACGTGAGCAT CAGGTGCCGG GTGTAAGCCT GGACCAGATG CTGGTCGGGC TGTATCGCGC CAACGAGGAA GCCTTTATCG GCAAGAACAT GAACCGTCTC AAGGTGGGGC AGATCATGCG GGTACCTAGC CCCGCTGAGC TGCAACAGAT TGCCCCTGCT GAGGCAAGAC GGGAAGTCAG GGTGCATACG TCGGACTGGA ATGCTTATCG TAACCGCTTG GCAGACTTGG CGGCCGAATC GACGCTGTCG GTAGATGGTG GCCCACAACA GTCGGCATCC GGGCGCATTT CGACGCCTGC AACGGATCAG GCTGCGCCTC CAGCGGATGC TTCACGGGAT GTCGTCAGGC TTTCCAGCGA TAATGCCAGC CAGGCGCAAT TGCATGCAAT GCAGGAAGAG CTCACAGCCA AGGACAAGAG CCTGAAGGAG GCGGCCGAGC GCGTTGCAAT GCTCGAGAAG CAAATTGCTG ATATGCAGAA GCTGCTGGCT ATCAAAAACC AGGCTCTGGC CAACTTGCAG AATGCCGATG TGTCCTCAGC CAATCCAGAA GAATCAACCC TGCTACCAGA AGGTGATGCC GAACCGCAGG TGCAGGCTGA GTCAGATGCT GCACCAACAG CAGCGCAAGC ATCTGAAACA GATGAGCCCG CAACGGATAC CGCATCAACG GGCGATGTTG CTGAACCAGT GAAACCAGCG TCTCAGCCAG CACCGGTTGA ACCAGTAGTG GAGGAGCCCA GCTTGTTTGA AGAGCTGGTT GGAGAAGATC CGTTGCCGCT TGCCCTGGCT GCTGGGGGTA TTGTCCTGTT GCTGGGTGGA TGGCTGTATA TGCGAAACAA GCGCAAGCGC GAGCTGGATA GCTTCGAAAA AGGTATCCAG ACTGCAGGAG GGCTCAAGCC TAATACGGTG TTTGGCAATA CCGCCGGCGC TACCATCAAT ACAGGAGATA CCTCTTTCCT CAATGACTTC TCCCAGAATA CCGGGGGCAT GATTGACACA CATGACGTTG ACCCCATTGC TGAGGCAGAA GTGTATATGG CATATGGGCG CGATGCCCAG GCAGAAGAAA TCCTCAAAGA TGCCATTGTC AAGGAGCCGC AACGTCATGA ACTCCATCTC AAGTTGCTCG AGATTTATGT CGCACGCAAC GATACCTCGG CATTCGAAAC AGTTGCCGGT GAGATCTATA CCACGTTGGG TGCATCTGAT CCCATCTGGA ATAAAGTGGC TGAAATGGGC CGCAAGCTGG AGCCGGATAA TCCCCTGTAT GCAGAAGGCA GCGTGGCCGA TAAGCCTGCA TCCTCGCCCG TGCAAGATGT AAAGGATGCA GAGCCTGGCA AGGAAACGAC AGGGACGAGC CAGCAGGCGG CGTTCACCAG TGCTGCCCTT GCAGGTGCCA CTATTGCTGT CACTGGCAGC CTGCCGGACG AGAAAGGCGC GCAAGACCCT GCTCCTGATG AAGCATCTTT CGCTGAAGCG AATCATGACC AGGAGCAAGC ACACGATTTC CCCTCGCTTG ATTTTGACCT GAGTGCCGAG GATAGGGAAA GCCTCCAGCA GCCAGATGTG GATGCTCCGG ATTCATATGC AGGCAATAGC ATGGATTTCA CGCTCGATCT GCCGGAGCCG CCGGCCAAGA ATAGTCTGCA GCTGGATATC GCGGAGCTTG ATGCAGCAAT TCCCGATTTC GACTTGCCGG AATTGCCTGG CAGCGAAAAG AACGAGGCTG CGGATGCCGT GCCTGCGTAC GCTATTCCAG GGGGTGATGA GGCGGCCGAG CAGTCATCAT CGGAATCCGC TCCATTCCAC ACTCCGGATG TTGACCTTGG TTTGCCCGAG ATTGCATTGC CTGAAATCGA ACTCCCCAAT CCTGCTGCTG AGCAACCTGC AGTAGAAGAA CCGGTTGCTC AGGGGGGCGA TATGCCCGAT ATCGATTTCG ATCTGCCGGA AATCAGTTTC GATGAGGAGC CACACCCGGA AGAGGCTGCA CAAAGTGCCG GTCTGGAGTC TTTGCAGCCC GTTGATGATG CGCTGGAGGC TCCCGAAACT GCTTCCGCAT CAGCGATCGA AAAAACACAA CCTGATGCAG CACAGCCCCT CGATTTTGCC GACATAGATC TGGAGACTGA TGATCTTGGC GACAATCGTG TAGACGCGAC TACGGACCAG GATGCAGAGA TCACCTTCCC TGATTTTGAT ATTGAGCTTG ATGAAAGCGA GCCCGTCCCT GCTGCCGATA TCGATCTTTC GGGAATCAGC CTAGATGTGG CAGGTGACAC GGCGGAAGCG GAAAAACTGG TGTCTGGCGA AGGATCGGAA GAGCCAGCTG ATGTGGACAC CAAGCTGGAC CTGGTGACGG CCTACATGGA CATGGGGGAT AACGAAGGCG CGCGTGAATT GCTCGAGGAA ATCCTGAAGG AAGGGGGCCC ACGGCAGCAG CAGAGGGCCA AATCCATTCT GGATAGCCTC TCCTGA
|
Protein sequence | MHKSKLKGVA LAVLLAFAPM ASEAAGLGRL TVTSGLGEPL SAEIELLSTT PDELATLTAG IAPEEAYNVQ GVERTAIHNA IKVDVSKRAD GTPVLKLTTS QAISDPFLDM LIQVDWATGR LLREYTVLLD PPGYNNQAQV YNGVQLPPDA APTTLPAPAA APQQNPPPAN TAPQSIPAPI QTQAAAVPEV PEQATQLQDA SPQTYTTSTG DTLSAIAREH QVPGVSLDQM LVGLYRANEE AFIGKNMNRL KVGQIMRVPS PAELQQIAPA EARREVRVHT SDWNAYRNRL ADLAAESTLS VDGGPQQSAS GRISTPATDQ AAPPADASRD VVRLSSDNAS QAQLHAMQEE LTAKDKSLKE AAERVAMLEK QIADMQKLLA IKNQALANLQ NADVSSANPE ESTLLPEGDA EPQVQAESDA APTAAQASET DEPATDTAST GDVAEPVKPA SQPAPVEPVV EEPSLFEELV GEDPLPLALA AGGIVLLLGG WLYMRNKRKR ELDSFEKGIQ TAGGLKPNTV FGNTAGATIN TGDTSFLNDF SQNTGGMIDT HDVDPIAEAE VYMAYGRDAQ AEEILKDAIV KEPQRHELHL KLLEIYVARN DTSAFETVAG EIYTTLGASD PIWNKVAEMG RKLEPDNPLY AEGSVADKPA SSPVQDVKDA EPGKETTGTS QQAAFTSAAL AGATIAVTGS LPDEKGAQDP APDEASFAEA NHDQEQAHDF PSLDFDLSAE DRESLQQPDV DAPDSYAGNS MDFTLDLPEP PAKNSLQLDI AELDAAIPDF DLPELPGSEK NEAADAVPAY AIPGGDEAAE QSSSESAPFH TPDVDLGLPE IALPEIELPN PAAEQPAVEE PVAQGGDMPD IDFDLPEISF DEEPHPEEAA QSAGLESLQP VDDALEAPET ASASAIEKTQ PDAAQPLDFA DIDLETDDLG DNRVDATTDQ DAEITFPDFD IELDESEPVP AADIDLSGIS LDVAGDTAEA EKLVSGEGSE EPADVDTKLD LVTAYMDMGD NEGARELLEE ILKEGGPRQQ QRAKSILDSL S
|
| |