Gene Mfla_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1839 
Symbol 
ID4000966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1977879 
End bp1979840 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content60% 
IMG OID637938755 
Producttransglutaminase-like protein 
Protein accessionYP_545947 
Protein GI91776191 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.544604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCATCG CCAGCCAGCC TGGCAAGGCC GACCTCCATT GGCTGCTGGG AGCCATGTTG 
CTCGCCATGG CGCTACATTT CAGCCACTTC CCGCTATGGG CCAGTGCATT GATCCTGCTA
TTCATGGTCT GGCGCCTGTT ACTGCAGCTG AAGCAGTGGA CCATGCCCAA GTTATGGTTG
CTGTTGCCCA TCACCCTGCT GGGCGTCGCC GGGATATACC TGCAATACCG TACCCTGTTT
GGTCGCGATG CCAGCGTGAC GCTGCTTGCG CTGATGCTGT CGCTCAAACT CATGGAGTCC
GGCACCCGAC GCGACTATAT CATCCTGATC TTTGCTGGGT ATTTTCTCAC GATCACTGCC
TTTCTGTTCG ACCAGTCGCT ATGGGTCGGC GCTTACCTGC TGCTCCCGGT ATTCGGCCTG
ACAGCATGTC TAATCGGCAT CAGTTATCCG CACGGCATCC TGCCCCACCG CGTGCGCACT
CGCTTGAGTA TGATGCTGCT CTTGCAGGCA GTACCGCTGA TGCTGGTGCT TTTTATGCTG
TTCCCGCGCA TTCCCGGCCC GCTCTGGGGC GTGCCGCAGG ATGCCTACCG TGCCATGAGC
GGTCTGAATG ACCATATGCA ACCTGGCGAC ATCAGTGAGC TCAGCCTGTC CGGTTCAGTG
GCCTTCCGAG CGCAGTTCCA GGGCAAAATC CCAGCGCAGC AGCACCTTTA TTGGCGCGGA
CCAGTACTCT GGCAATTCGA TGGTCGCAAC TGGCATATGG GAGGGCATGA ACGCTACCAG
GAAACGCTGG AGGGATTGGC GGAACCAGTC GACTATGCCG TCACGCTGGA ACCGCATAAC
CGCCGCTGGC TGCTGATGCT GGATGTGCCG CACGCCCAAC CGGCGGGCAC CTCTTTTACC
AGCGACCGGC AGGCGCTTGC GCAGATGCCC GTTCGGACAA GAATGCGCTA TACCGGTAGC
TCCCACGTAC AATACCACCT AGCCGCCGAC ATCTCCGACA AGACACTTGC CCAGGCCTTG
CAGCTGCCAT CGGGCGGCAA TCCGCGCAGC CGTGCACTGG CGCAAGCCTG GCACCGGGAG
CTGCAAGCGC CGGAGCGCAT TATCGATGCC GCGTTGTCGA TGTTCCGCAG CCAGGCATTT
TTCTACACCC TGACGCCCCC GCGCCTGGGC GACCAAGCTA TTGATGACTT TCTCTTCAAC
AGCAGGCGCG GCTTCTGCGA GCACTATGCC AGCAGCTTTG CCTACCTCAT GCGGGCTGCC
GGCATTCCTG CACGCATCGT CACAGGCTAT CAGGGCGGCG AGACCAATCC CGTGGGCCAA
TACCTGATCG TACGCCAATC CGATGCCCAT GCTTGGGTGG AAGTCTGGCT CGAAGGCCGG
GGCTGGGTCA GGGTGGACCC CACGGCGGCA GTCGCCCCAT CCCGGGTGGA GAGCGGGATC
AGCACAGCAT TGTCGGACAA TAGCCCATTG CCTATCCTGG CGCGGCAGGA TCGCACCTGG
CTGCGCGATC TATACCTGAA CTGGGATGCC ATTAACAACG GTTGGAACCA GTGGGTGCTG
GGCTATGACC AGCAACGCCA GCTTGCGCTG CTGTCACGCA TGACGGGTAC ATCCGCCAAC
TGGCAGCAGC TCATGCTATG GCTCACTGGC AGTCTGCTGT TGATGTCGGT ACTGCTGTGC
GCCTGGTTAC TCCGCCGCAG GCATATACCA ACCGACCCCA TAAAAGCGCT TTACCTGCGC
TACCTGGCGA AACTGTCGCG CAGGGGGATT CACCATAGTC CCGGAGAAGG GCCGCTCAGC
GTCCAGCGAC GTGCCAGCAC AGCGTTACCT CAGCATGCGA CAGACATTGC CCGTATCACC
CGGCTTTATC TCGCGCTTCG TTACGGACCA CTTGAAGAAC TGCCTAATGA GCACAAAACC
ATGGCAGAAT TACGGTTACT AATAGAAGGA CTAGATGGCT AA
 
Protein sequence
MSIASQPGKA DLHWLLGAML LAMALHFSHF PLWASALILL FMVWRLLLQL KQWTMPKLWL 
LLPITLLGVA GIYLQYRTLF GRDASVTLLA LMLSLKLMES GTRRDYIILI FAGYFLTITA
FLFDQSLWVG AYLLLPVFGL TACLIGISYP HGILPHRVRT RLSMMLLLQA VPLMLVLFML
FPRIPGPLWG VPQDAYRAMS GLNDHMQPGD ISELSLSGSV AFRAQFQGKI PAQQHLYWRG
PVLWQFDGRN WHMGGHERYQ ETLEGLAEPV DYAVTLEPHN RRWLLMLDVP HAQPAGTSFT
SDRQALAQMP VRTRMRYTGS SHVQYHLAAD ISDKTLAQAL QLPSGGNPRS RALAQAWHRE
LQAPERIIDA ALSMFRSQAF FYTLTPPRLG DQAIDDFLFN SRRGFCEHYA SSFAYLMRAA
GIPARIVTGY QGGETNPVGQ YLIVRQSDAH AWVEVWLEGR GWVRVDPTAA VAPSRVESGI
STALSDNSPL PILARQDRTW LRDLYLNWDA INNGWNQWVL GYDQQRQLAL LSRMTGTSAN
WQQLMLWLTG SLLLMSVLLC AWLLRRRHIP TDPIKALYLR YLAKLSRRGI HHSPGEGPLS
VQRRASTALP QHATDIARIT RLYLALRYGP LEELPNEHKT MAELRLLIEG LDG