Gene Mfla_2689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_2689 
Symbol 
ID4001787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp2901962 
End bp2905132 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content55% 
IMG OID637939614 
Producthypothetical protein 
Protein accessionYP_546793 
Protein GI91777037 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family
[TIGR02675] tape measure domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAA CCCTACAGAT CAAGATAACG GCGGATGGCC GGATTGCCCT GGATGAGATC 
AAGGCGGTCG GTAAAGCCCA GGCAGATGCC AACAAGGCAC AAGCCGCTGC GGCTGCCAGC
ATGGAGAGAT CCCGTAGTGC CCAAACTGCA GCTGCGGCTT CCACGCGTGC ATTAGGGGAT
GAGCAGCGTC GCCTGGCGAG TGAGCTGGAT ACGTTGATGA AGCGTATCGA TAGCGCCTAC
CGTGAGACGA AGCGCCTGGA GGAAGGCCAA TCTACACTGC GCCGGAGTTA TGCTGCTGGC
CTGATCAATC TGAGCCAGTA CAAGCAAGGT TTGGCATCCC TCAATAATGA AACCTCCCAG
GCAGTACAGT CAGGGGAGCG CCATGCATCC ATAATCGCCC GGGTCGGCCA TTATGCCGCC
GCTGCTTTTG CAGTGAGCCA AGTTGTCGAT TTCACCAAAC AGATCGTCCA AGCCAATCTC
AACTTCGAAA AATACAACAA CACACTGGTC TACGCCACAG GCAGCCAACA CCTGGCAGCA
GTGGAAATGG ATTACATCAC ACGCACTGCG GATCAGCTGG GCCTAAATCT TGAAGGTGCG
ATTGTCGGGT ATACGAAGTT TTCTGCCGCG ACTAGAGGTA CCTCCATCGA GGGCGAGAAA
ACACGGGAGG TGTTCGAATC AATCGCCAAG GCGTCAGTGG TGATGAGGCT GTCTGCTGAT
GAAACCAATG GTGCCTTGCT TGCCATCTCG CAGATGATCA GCAAAGGTAC GGTGCAGGCT
GAGGAATTGC GTGGCCAATT GGGTGAACGT CTACCTGGCG CGTTCAATGT GGCTGCACGC
GCGATGGGCG TGACGACACA AGAGCTGGAT AAGCTGCTTG AAGATGGGGC TGTGGTGTCT
GATGAGTTTC TGCCCAGATT TGCAGATGAG TTGACCAGGA CACTAGGCGA TAACCCACAA
AGCGCTGCCA GTAGCGCCCA AGCGCAGTTA AATCGTCTCT CCAATGCCTA TCTGGAGTTC
AAGCGATCAT TAGGCGAGAG CCTGATCATG GAACTGGTAT TGCGCGTGAC GGAAGGTTCG
ACCGATGCAC TCAAGTGGTT CAATCACTTT GTATTCGATA GGGGGGAACT GGCCGATTTG
AATGCCCGCG CCAATGCGCC AGAGCGCCTG GCCAAGCTGG ACCGCGAGCG GCAGTTCCTG
CTCGAGGGAG GCAACAACCT TGATCCATCC AATACTCTTG CTCGCCGACA GGCTCGATTG
GCCGCCATTG ATGCCGAGGT CCAGGCAATC CGCAACCGGG CTGGCCTGGA TACTCCTGCC
GAAGAAGTTG GCGAATCTGA ATCCGCAAAG AACGCTGCAG CGTTGCGTGA CAAGGCGCAG
CAGAATGCCT TGGATAAATT TATCAATACT GAGAAGTGGC AGACAAGGTC GCAAAAGCTT
GCGTCAGCGC TGGAAGAAGA ACGCAAGGCA TTTGAGAAAT TGGTCGAGGG CATGGAGAAG
GACTCTCCAC GTTATAAAGC CGCCTATGAG GCCCACTTGT CGCACATCGA TGTCATCAAA
GGCAAGGCCG AGGCATCGGC CAAAAAGAGC AATAGCGATC CGCTGGCCAA TGCTATGCGT
CAGCTGGAAA ACGCACGTGC AGAGTCTGGC CGCAATACCG AGAAATTCAT CTATGACGCC
GAGCTGAAGG CTCTGGATAC CGGCCTGCAG CAGGCCCTGA TCAGCTACCG TGAGTATTAC
GTCCAGCGCG AAACGATCGA GGACGATTAT TACAATCGCC AGGTATCCCG GATTGAGCAA
AAGATTGCCA ATGAGCAAGC GGCGGCGGCT GCAGCCAGGG CAAGAGGCGA TAACGTTGGC
GCAGCTGGCA ATGAGACCAA TATTGTTAAA CTGCAGTCCG AGTTGAATGC CCTGGACCAG
CAACGTGCCA ATAGCCGTGC TGCCAACATG GAAGCAGAGC GTGCGGCTAA TCTGGAAATG
GCGCGGCAAG GTATGGCCAT GACAGCCCAG CTGCTGCAAT CGCAGGGGCA GCTGGAGGAT
GCCGCCCGCC TGCAGGTGGC TCAGAAGTAC AGCGCCAGCC TGGCCAGGAT GAAGGCTGAA
GGCAACGAGG CTGGTATAGA GCTGATAGAA AAGCTGATCA ATGTCGAATT GGCCAGCGGT
CGATTGAATC AGATTGAAAC CGAGCTCAAT GCCTCACTCG CCCGCATCAC TGACGAAACC
AGGCGTGTGG ATATCCAGAA GAATGCGGGC ATCCTGACCG AATTCGAAGC CAGGCGGCGA
CTCATTGGTC TACAGCAGCA GGAAATTCCA TTGCGCCAAG CGCAATTGAC GGCATTGGAA
CAAGCCTATC AGCAGAGCCC AACCCGGGAA CTGAGCGACA GGATACGACA GGCTCAGCTC
GACATCGAGC AGCTTCAGGC GGTAGTGAGT GGAGCCAACC GCACTTTTGA ATATGGCGCG
CGCACTGCGA TTGGAGAATA CCGCGACGCG ATCAGCAATG GCGCTACGCA AGCACGTGAC
TTGTTCAACA GCAGTTTCCA AAGCATGGAG AACGTGCTGG CCACCTTCTT CCAGACTGGC
AAGCTCAACA TGCAGGATTT CTTCAGCGCC CTGGAGCAAA GCCTGGCCAA GGTTGCCGCG
CAGAAGATGA TGGAGGGAAT TTTTGGTGCC CTGGACATGG GTAGTTGGTT TGGTGGCAGC
AATTCTACAG TGCCTTCATA TGGCAGCTCC GGCACGGTGC CAGGTGGAGT CGGTCGCTGG
ATCGGTACCA ATCACACCGG CGGCATCCTG GGCAGCGAGA GTACGGCGCT TCGGTATGTC
TCTTCAGATG TATTCGATGG TGCACCACGC TTCCATTCTG GTGGCATCTT GGGCGATGAA
ATCCCTATTA TTGGCCGTAA GGGTGAAGGT GTTTTTACTG CAGAGCAGAT GAAGAACCTG
GCCCCTGTTG GCACTGGTGG CCAGCAAGTG GTCAACGTGC AGATTTTCGA GGCAGCAGGC
ACACAAGCCA CGGTCACTCA AAGCACTGGC GATAACGGCG AATTAAATCT GCAAGTGATG
GTGGAAACCC TGACTGACAT CATGGGCCGC GATATAGAGC GTGGCCGTGG CCTAGGCCCA
ACACTTGAAG CCAAATATGG CCTGAATCCT TCAGCAGGAG CCTTGGGATG A
 
Protein sequence
MEKTLQIKIT ADGRIALDEI KAVGKAQADA NKAQAAAAAS MERSRSAQTA AAASTRALGD 
EQRRLASELD TLMKRIDSAY RETKRLEEGQ STLRRSYAAG LINLSQYKQG LASLNNETSQ
AVQSGERHAS IIARVGHYAA AAFAVSQVVD FTKQIVQANL NFEKYNNTLV YATGSQHLAA
VEMDYITRTA DQLGLNLEGA IVGYTKFSAA TRGTSIEGEK TREVFESIAK ASVVMRLSAD
ETNGALLAIS QMISKGTVQA EELRGQLGER LPGAFNVAAR AMGVTTQELD KLLEDGAVVS
DEFLPRFADE LTRTLGDNPQ SAASSAQAQL NRLSNAYLEF KRSLGESLIM ELVLRVTEGS
TDALKWFNHF VFDRGELADL NARANAPERL AKLDRERQFL LEGGNNLDPS NTLARRQARL
AAIDAEVQAI RNRAGLDTPA EEVGESESAK NAAALRDKAQ QNALDKFINT EKWQTRSQKL
ASALEEERKA FEKLVEGMEK DSPRYKAAYE AHLSHIDVIK GKAEASAKKS NSDPLANAMR
QLENARAESG RNTEKFIYDA ELKALDTGLQ QALISYREYY VQRETIEDDY YNRQVSRIEQ
KIANEQAAAA AARARGDNVG AAGNETNIVK LQSELNALDQ QRANSRAANM EAERAANLEM
ARQGMAMTAQ LLQSQGQLED AARLQVAQKY SASLARMKAE GNEAGIELIE KLINVELASG
RLNQIETELN ASLARITDET RRVDIQKNAG ILTEFEARRR LIGLQQQEIP LRQAQLTALE
QAYQQSPTRE LSDRIRQAQL DIEQLQAVVS GANRTFEYGA RTAIGEYRDA ISNGATQARD
LFNSSFQSME NVLATFFQTG KLNMQDFFSA LEQSLAKVAA QKMMEGIFGA LDMGSWFGGS
NSTVPSYGSS GTVPGGVGRW IGTNHTGGIL GSESTALRYV SSDVFDGAPR FHSGGILGDE
IPIIGRKGEG VFTAEQMKNL APVGTGGQQV VNVQIFEAAG TQATVTQSTG DNGELNLQVM
VETLTDIMGR DIERGRGLGP TLEAKYGLNP SAGALG