Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mfla_2689 |
Symbol | |
ID | 4001787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacillus flagellatus KT |
Kingdom | Bacteria |
Replicon accession | NC_007947 |
Strand | - |
Start bp | 2901962 |
End bp | 2905132 |
Gene Length | 3171 bp |
Protein Length | 1056 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637939614 |
Product | hypothetical protein |
Protein accession | YP_546793 |
Protein GI | 91777037 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family [TIGR02675] tape measure domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAAA CCCTACAGAT CAAGATAACG GCGGATGGCC GGATTGCCCT GGATGAGATC AAGGCGGTCG GTAAAGCCCA GGCAGATGCC AACAAGGCAC AAGCCGCTGC GGCTGCCAGC ATGGAGAGAT CCCGTAGTGC CCAAACTGCA GCTGCGGCTT CCACGCGTGC ATTAGGGGAT GAGCAGCGTC GCCTGGCGAG TGAGCTGGAT ACGTTGATGA AGCGTATCGA TAGCGCCTAC CGTGAGACGA AGCGCCTGGA GGAAGGCCAA TCTACACTGC GCCGGAGTTA TGCTGCTGGC CTGATCAATC TGAGCCAGTA CAAGCAAGGT TTGGCATCCC TCAATAATGA AACCTCCCAG GCAGTACAGT CAGGGGAGCG CCATGCATCC ATAATCGCCC GGGTCGGCCA TTATGCCGCC GCTGCTTTTG CAGTGAGCCA AGTTGTCGAT TTCACCAAAC AGATCGTCCA AGCCAATCTC AACTTCGAAA AATACAACAA CACACTGGTC TACGCCACAG GCAGCCAACA CCTGGCAGCA GTGGAAATGG ATTACATCAC ACGCACTGCG GATCAGCTGG GCCTAAATCT TGAAGGTGCG ATTGTCGGGT ATACGAAGTT TTCTGCCGCG ACTAGAGGTA CCTCCATCGA GGGCGAGAAA ACACGGGAGG TGTTCGAATC AATCGCCAAG GCGTCAGTGG TGATGAGGCT GTCTGCTGAT GAAACCAATG GTGCCTTGCT TGCCATCTCG CAGATGATCA GCAAAGGTAC GGTGCAGGCT GAGGAATTGC GTGGCCAATT GGGTGAACGT CTACCTGGCG CGTTCAATGT GGCTGCACGC GCGATGGGCG TGACGACACA AGAGCTGGAT AAGCTGCTTG AAGATGGGGC TGTGGTGTCT GATGAGTTTC TGCCCAGATT TGCAGATGAG TTGACCAGGA CACTAGGCGA TAACCCACAA AGCGCTGCCA GTAGCGCCCA AGCGCAGTTA AATCGTCTCT CCAATGCCTA TCTGGAGTTC AAGCGATCAT TAGGCGAGAG CCTGATCATG GAACTGGTAT TGCGCGTGAC GGAAGGTTCG ACCGATGCAC TCAAGTGGTT CAATCACTTT GTATTCGATA GGGGGGAACT GGCCGATTTG AATGCCCGCG CCAATGCGCC AGAGCGCCTG GCCAAGCTGG ACCGCGAGCG GCAGTTCCTG CTCGAGGGAG GCAACAACCT TGATCCATCC AATACTCTTG CTCGCCGACA GGCTCGATTG GCCGCCATTG ATGCCGAGGT CCAGGCAATC CGCAACCGGG CTGGCCTGGA TACTCCTGCC GAAGAAGTTG GCGAATCTGA ATCCGCAAAG AACGCTGCAG CGTTGCGTGA CAAGGCGCAG CAGAATGCCT TGGATAAATT TATCAATACT GAGAAGTGGC AGACAAGGTC GCAAAAGCTT GCGTCAGCGC TGGAAGAAGA ACGCAAGGCA TTTGAGAAAT TGGTCGAGGG CATGGAGAAG GACTCTCCAC GTTATAAAGC CGCCTATGAG GCCCACTTGT CGCACATCGA TGTCATCAAA GGCAAGGCCG AGGCATCGGC CAAAAAGAGC AATAGCGATC CGCTGGCCAA TGCTATGCGT CAGCTGGAAA ACGCACGTGC AGAGTCTGGC CGCAATACCG AGAAATTCAT CTATGACGCC GAGCTGAAGG CTCTGGATAC CGGCCTGCAG CAGGCCCTGA TCAGCTACCG TGAGTATTAC GTCCAGCGCG AAACGATCGA GGACGATTAT TACAATCGCC AGGTATCCCG GATTGAGCAA AAGATTGCCA ATGAGCAAGC GGCGGCGGCT GCAGCCAGGG CAAGAGGCGA TAACGTTGGC GCAGCTGGCA ATGAGACCAA TATTGTTAAA CTGCAGTCCG AGTTGAATGC CCTGGACCAG CAACGTGCCA ATAGCCGTGC TGCCAACATG GAAGCAGAGC GTGCGGCTAA TCTGGAAATG GCGCGGCAAG GTATGGCCAT GACAGCCCAG CTGCTGCAAT CGCAGGGGCA GCTGGAGGAT GCCGCCCGCC TGCAGGTGGC TCAGAAGTAC AGCGCCAGCC TGGCCAGGAT GAAGGCTGAA GGCAACGAGG CTGGTATAGA GCTGATAGAA AAGCTGATCA ATGTCGAATT GGCCAGCGGT CGATTGAATC AGATTGAAAC CGAGCTCAAT GCCTCACTCG CCCGCATCAC TGACGAAACC AGGCGTGTGG ATATCCAGAA GAATGCGGGC ATCCTGACCG AATTCGAAGC CAGGCGGCGA CTCATTGGTC TACAGCAGCA GGAAATTCCA TTGCGCCAAG CGCAATTGAC GGCATTGGAA CAAGCCTATC AGCAGAGCCC AACCCGGGAA CTGAGCGACA GGATACGACA GGCTCAGCTC GACATCGAGC AGCTTCAGGC GGTAGTGAGT GGAGCCAACC GCACTTTTGA ATATGGCGCG CGCACTGCGA TTGGAGAATA CCGCGACGCG ATCAGCAATG GCGCTACGCA AGCACGTGAC TTGTTCAACA GCAGTTTCCA AAGCATGGAG AACGTGCTGG CCACCTTCTT CCAGACTGGC AAGCTCAACA TGCAGGATTT CTTCAGCGCC CTGGAGCAAA GCCTGGCCAA GGTTGCCGCG CAGAAGATGA TGGAGGGAAT TTTTGGTGCC CTGGACATGG GTAGTTGGTT TGGTGGCAGC AATTCTACAG TGCCTTCATA TGGCAGCTCC GGCACGGTGC CAGGTGGAGT CGGTCGCTGG ATCGGTACCA ATCACACCGG CGGCATCCTG GGCAGCGAGA GTACGGCGCT TCGGTATGTC TCTTCAGATG TATTCGATGG TGCACCACGC TTCCATTCTG GTGGCATCTT GGGCGATGAA ATCCCTATTA TTGGCCGTAA GGGTGAAGGT GTTTTTACTG CAGAGCAGAT GAAGAACCTG GCCCCTGTTG GCACTGGTGG CCAGCAAGTG GTCAACGTGC AGATTTTCGA GGCAGCAGGC ACACAAGCCA CGGTCACTCA AAGCACTGGC GATAACGGCG AATTAAATCT GCAAGTGATG GTGGAAACCC TGACTGACAT CATGGGCCGC GATATAGAGC GTGGCCGTGG CCTAGGCCCA ACACTTGAAG CCAAATATGG CCTGAATCCT TCAGCAGGAG CCTTGGGATG A
|
Protein sequence | MEKTLQIKIT ADGRIALDEI KAVGKAQADA NKAQAAAAAS MERSRSAQTA AAASTRALGD EQRRLASELD TLMKRIDSAY RETKRLEEGQ STLRRSYAAG LINLSQYKQG LASLNNETSQ AVQSGERHAS IIARVGHYAA AAFAVSQVVD FTKQIVQANL NFEKYNNTLV YATGSQHLAA VEMDYITRTA DQLGLNLEGA IVGYTKFSAA TRGTSIEGEK TREVFESIAK ASVVMRLSAD ETNGALLAIS QMISKGTVQA EELRGQLGER LPGAFNVAAR AMGVTTQELD KLLEDGAVVS DEFLPRFADE LTRTLGDNPQ SAASSAQAQL NRLSNAYLEF KRSLGESLIM ELVLRVTEGS TDALKWFNHF VFDRGELADL NARANAPERL AKLDRERQFL LEGGNNLDPS NTLARRQARL AAIDAEVQAI RNRAGLDTPA EEVGESESAK NAAALRDKAQ QNALDKFINT EKWQTRSQKL ASALEEERKA FEKLVEGMEK DSPRYKAAYE AHLSHIDVIK GKAEASAKKS NSDPLANAMR QLENARAESG RNTEKFIYDA ELKALDTGLQ QALISYREYY VQRETIEDDY YNRQVSRIEQ KIANEQAAAA AARARGDNVG AAGNETNIVK LQSELNALDQ QRANSRAANM EAERAANLEM ARQGMAMTAQ LLQSQGQLED AARLQVAQKY SASLARMKAE GNEAGIELIE KLINVELASG RLNQIETELN ASLARITDET RRVDIQKNAG ILTEFEARRR LIGLQQQEIP LRQAQLTALE QAYQQSPTRE LSDRIRQAQL DIEQLQAVVS GANRTFEYGA RTAIGEYRDA ISNGATQARD LFNSSFQSME NVLATFFQTG KLNMQDFFSA LEQSLAKVAA QKMMEGIFGA LDMGSWFGGS NSTVPSYGSS GTVPGGVGRW IGTNHTGGIL GSESTALRYV SSDVFDGAPR FHSGGILGDE IPIIGRKGEG VFTAEQMKNL APVGTGGQQV VNVQIFEAAG TQATVTQSTG DNGELNLQVM VETLTDIMGR DIERGRGLGP TLEAKYGLNP SAGALG
|
| |