Gene Mfla_1495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1495 
Symbol 
ID4000956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1600295 
End bp1603261 
Gene Length2967 bp 
Protein Length988 aa 
Translation table11 
GC content65% 
IMG OID637938406 
Producttransposase Tn3 
Protein accessionYP_545604 
Protein GI91775848 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.487839 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGAC GCTCCATTCT GACCGCCGCC GAGCGGGCAA GCCTGCTAGC GTTACCCGAT 
ACCGAAGATG AATTGATCCG GCACTACACG TTCAGCGAGG CCGACCTGTC ACTGATTCGC
CAGCGACGTG GGGATGCGAA CCGACTGGGT GTCGCGGTGC AGTTGTGCTT GCTGCGCTTC
CCCGGTCAGG GCCTGCTTCC CGACGCCGCG GTGCCGATGT CCCTGCTGCA ATGGATCGGA
CGGCAACTGC GGCTCGACCC GGTGTGTTGG CCGCAGTATG CTGAGCGGGA GGAAACCCGG
CGCGAGCATC TGCTCGAACT GCGGGCGTAC CTGGGCATGG AGCCGTTCGG CCTGGCGCAC
TATCGGCAGG CCGTCCATGC CACGACCGAG CTGGCCTTGC AGACCGACAA GGGCATCGTG
CTGGCCGCCA GCGTCCTCGA TGCGCTGCGC CACCGGCACA TCATCATTCC GACGCTGGAT
GTCATCGAGC GCGTCTGTGC CGAAGCAATC ACCCGCGCCA ACCGGCGCAT CTACGACGCC
TTGACCGAGC CGCTGTCGGA CGGGCATCGC CGCCGCCTCG ACGATCTGCT CAAGCGCCGG
GACAACGGCC AAACGACCTG GCTTGCCTGG CTGCGCCAGT CGCCCGCTAA GCCGAACTCG
CGGCACATGC TCGAACACAT CAAACGCCTC AAGGCGTGGC AGGCACTCGA CCTGCCTTCC
GGCATCGAGC GGCTGGTTCA CCAGAACCGG CTGCTCAAGA TCGCCCGCGA GGGCGGCCAG
ATGACGCCTG CCGACCTGGC CAAGTTCGAG GCGCAGCGCC GCTACGCGAC CCTGGTGGCG
CTCGCCATTG AGGGCATGGC CACCGTTACC GACGAAATCA TCGACCTGCA TGACCGCATC
CTGGGCAAGC TGTTCAACGC CGCCAAGAAC AAGCATCAGC AGCAGTTCCA GGCATCCGGC
AAGGCGATCA ACGCCAAGGT GCGGCTGTTC GGCCGCATCG GTCAGGCACT GATCGAGGCC
AAGCAATCGG GCCGCGATCC GTTCGCCGCC ATCGAGGCCG TCATGTCCTG GGACGCCTTC
GCCGAGAGCG TCACCGAGGC GCAGCGGCTC GCGCAGCCTG AGGACTTCGA TTTCCTGCAC
CGCATCGGCG AGAGCTACGC CACGCTGCGC CGCTACGCGC CGGAATTCCT CGACGTGCTC
AAGCTGCGGG CCGCGCCCGC CGCCAAGGAC GTGCTGGAGG CCATCGACGT GCTGCGCGGC
ATGAACAGCG ACAACGCCCG CAAGGTGCCT GCCGACGCGC CGACAGACTT CATCAAGCCG
CGCTGGCAGA AGCTGGTGAT GACCGACGCC GGCATCGACC GGCGCTACTA CGAACTGTGC
GCGCTGTCGG AGCTGAAGAA CGCGCTGCGC TCCGGCGACA TATGGGTGCA GGGTTCGCGC
CAGTTCAAGG ACTTCGAGGA CTACCTGGTG CCGCCCGCGA AATTCGCCAG CCTAAAGCGG
GCCAGCGAAT TGCCGCTGGC CGTGGCCACC GACTGCGACC AGTACCTGCA CGACCGGCTG
ACGCTGCTGG AAATGCAGCT CGCCACCGTC AACCGCATGG CACTGGCCAA TGACCTGCCA
GACGCCATCA TCACCGAATC GGGCCTGAAA ATCACGCCGC TTGATGCGGC GGTACCCGAC
ACTGCGCAGG CCCTGATTGA CCAGACGGCG ATGATCCTGC CGCACGTCAA GATCACCGAA
CTGCTGCTGG AGGTGGACGA ATGGACGGGC TTCACCCGGC ACTTCACGCA TCTGAAGTCG
GGCGACCTGG CCAAGGACAA GAACCTGCTG CTGACCACGA TCCTGGCAGA CGCGATCAAC
CTGGGCCTGA CCAAGATGGC CGAGTCCTGC CCCGGCACGA CCTACGCCAA GCTGGCCTGG
CTGCAAGCCT GGCACATCCG CGACGAAACC TATGGGGCGG CGCTGGCCGA GCTGGTCAAT
GCACAGTTCC GCCACCCGTT CGCCGAGCAT TGGGGCGACG GCACCACGTC ATCGTCGGAC
GGCCAGAACT TCCGCACCGG CAGCAAGGCC GAGAGCACCG GCCACATCAA CCCGAAATAC
GGCAGCAGCC CAGGGCGGAC GTTCTATACC CACATTTCCG ACCAGTACGC GCCATTCCAC
ACCAAGGTCG TGAACGTCGG CGTGCGCGAC TCGACCTACG TGCTCGACGG CCTGCTGTAC
CACGAATCCG ACCTGCGCAT CGAGGAACAC TACACCGACA CGGCGGGCTT CACCGATCAC
GTCTTCGCCC TGATGCACCT CTTGGGCTTC CGCTTCGCGC CGCGCATCCG CGACCTGGGC
GACACCAAGC TCTACATCCC GAAGGGCGAC GCCGCCTATG ACGCGCTCAA GCCGATGATC
GGCGGCACGC TCAACATCAA GCGCGTCCGC GCCCATTGGG ATGAAATCCT GCGGCTGGCC
ACCTCGATCA AGCAGGGCAC GGTGACGGCT TCACTGATGC TGCGCAAACT CGGCAGCTAC
CCGCGCCAGA ACGGCTTGGC CGTCGCCCTG CGCGAGTTGG GGCGCATCGA GCGCACGCTG
TTCATCCTGG ACTGGCTGCA AAGCGTCGAG CTGCGCCGCC GCGTGCATGC CGGGCTGAAC
AAGGGCGAGG CGCGCAACGC ACTGGCCCGT GCCGTGTTCT TCAACCGCCT GGGGGAAATT
CGTGACCGCA GTTTCGAGCA GCAGCGCTAC CGCGCTTCCG GCCTCAATCT GGTGACGGCG
GCCATCGTCC TTTGGAATAC GGTCTATCTG GAACGGGCCG CGAACGCCTT GCGTGGCCAC
GGCCAGACCG TCGATGACGG TCTGTTGCAG TACCTGTCGC CGCTCGGCTG GGAACACATC
AACCTGACTG GCGATTACCT CTGGCGTAGC AGCGCCAAGA TCGGCGCAGG CAAGTTCAGG
CCGCTACGGC CGCTGCAACC GGCTTAG
 
Protein sequence
MPRRSILTAA ERASLLALPD TEDELIRHYT FSEADLSLIR QRRGDANRLG VAVQLCLLRF 
PGQGLLPDAA VPMSLLQWIG RQLRLDPVCW PQYAEREETR REHLLELRAY LGMEPFGLAH
YRQAVHATTE LALQTDKGIV LAASVLDALR HRHIIIPTLD VIERVCAEAI TRANRRIYDA
LTEPLSDGHR RRLDDLLKRR DNGQTTWLAW LRQSPAKPNS RHMLEHIKRL KAWQALDLPS
GIERLVHQNR LLKIAREGGQ MTPADLAKFE AQRRYATLVA LAIEGMATVT DEIIDLHDRI
LGKLFNAAKN KHQQQFQASG KAINAKVRLF GRIGQALIEA KQSGRDPFAA IEAVMSWDAF
AESVTEAQRL AQPEDFDFLH RIGESYATLR RYAPEFLDVL KLRAAPAAKD VLEAIDVLRG
MNSDNARKVP ADAPTDFIKP RWQKLVMTDA GIDRRYYELC ALSELKNALR SGDIWVQGSR
QFKDFEDYLV PPAKFASLKR ASELPLAVAT DCDQYLHDRL TLLEMQLATV NRMALANDLP
DAIITESGLK ITPLDAAVPD TAQALIDQTA MILPHVKITE LLLEVDEWTG FTRHFTHLKS
GDLAKDKNLL LTTILADAIN LGLTKMAESC PGTTYAKLAW LQAWHIRDET YGAALAELVN
AQFRHPFAEH WGDGTTSSSD GQNFRTGSKA ESTGHINPKY GSSPGRTFYT HISDQYAPFH
TKVVNVGVRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFALMHLLGF RFAPRIRDLG
DTKLYIPKGD AAYDALKPMI GGTLNIKRVR AHWDEILRLA TSIKQGTVTA SLMLRKLGSY
PRQNGLAVAL RELGRIERTL FILDWLQSVE LRRRVHAGLN KGEARNALAR AVFFNRLGEI
RDRSFEQQRY RASGLNLVTA AIVLWNTVYL ERAANALRGH GQTVDDGLLQ YLSPLGWEHI
NLTGDYLWRS SAKIGAGKFR PLRPLQPA