Gene Mext_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1664 
Symbol 
ID5831694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1870610 
End bp1872835 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content73% 
IMG OID641367463 
Producthemolysin-type calcium-binding region 
Protein accessionYP_001639134 
Protein GI163851091 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.273862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCG AGATCAAGGG CACGAAGCCC GGGCGGGACG CGCCCGAGGG TGCCGAGCGC 
TACGTCTCGC GCAGCGCCGA CCGGCGATCC GGGGTCCCCT ACTTCGCCGC GGTGGCGGTC
GCGTCCGTGG CTGCCTACCT CAAATCCCTG CTTCTGCCCC GCGCAACCCT GGCAGAGGAA
GTGGACCCGG AGATCGTGGC GGCACGCGCC GACGCCCTGC GCCTCGTCCA TTCCGAGGAG
CTCGCCGCGC CCACCCCTGC AAAGGCCGCC GAGGCACCGA AGCCACTGGC CGAGCCGCCA
TCCCCGGTTC TGCCATCCGG CATGTCCCTG TTCGACGACG GGCACGAACT GCTCGCGCGC
GGGCTGCGCT TCGCAAGGCC CGACGCGCAT CCGGACCTCG GCGACTTCAA GGCCTCTCCG
GTCATCCCCC AGCCGATCAA CGACAACGGC GGCTCCCTCG CAGCCGGATC GGGCGGGCGC
GGAGGCGGGC AGCCGAGCGG CGGCGGCAGC GGCCGGCCAC GGGACGCAAC CCACCCGGAA
CCGGGATCGC ACGAGCCCGG CGACGAGGCA CAGCCGGCCC CCCGACCGCT GGGACAGGAG
AAGGCCGACA GGCCGGATCA AGGCTCGACT GGGCAAGGCT CGACCGGGCA AGGCTCGACC
GGGCAAGGCT CGACAGGGCA AGGCGGATCG AACCCACAGG ATCCTGGGCC GTCGGACGGC
GGCGTCCGGA TCCCTCCCGG CTCCACGCCG GGCGGCGATC CGCATCCCGT CGATCCGGCG
AGCCCGCCGC GCGGCCGGGA CGACGGACCG GCGGGGCAAC GCAACCGCGC GCCCGTGGTG
AACGGCCCGG TCCAGCTCGG GGATGTCGCG GGCTGCGCAC TCCTCACCAT CGCCCTGAGC
GATCTCCTGC GCGGCGCCAG CGATCCGGAC GGAGACGCAC TCAGCGTGCG CGACGTCCGG
GTCTCCTCCG GCACCGTGAC GGCGGACGGC TCGGGCTGGG TGTTCGACGC CGACGCGCCC
GGTCCGGTCA CGATTACCTA CGCGGTGACG GACGGCGAGT TCTCGGTCGC CCGCACGGCT
CATCTGACCG TCCTCGAACG CGCCTTGATC GGCGGCACGG ACGGGGACGA CCTCATCCTC
GGCACACCCT GCGCGGACGA CATCGCGGCC GGCGCGGGAG ACGACAATGT CGACGCCCGC
GGAGGCGACG ACCTCGTCGA TGCCGGCAGC GGCAGCGACC ACGTGATCGC GGGCGACGGC
GCTGACACGG TCCTGGCCGG GCCGGGCGAC GACGTCGTGT TCGCCGGGGC GGGCGCGGAT
CGGGTCTCGG GCGGGGCGGG TCACGACCGC CTGTTCGGGG AGGCGGGCGA CGACCTTCTG
TTCGGGGAGG CGGGCGACGA CCTGCTCGAT GGCGGCGAGG GGCGCGACAT CCTGGACGGG
GGCGACGGCG ACGATCTCCT CCTTGGAGGC GAGGGCAACG ATAGCCTCTA CGGCGCGGCC
GGCGCCGACG ACCTGTCCGG TGGCACCGGT GCCGACGTGC TCGTCGGGGA TGCGGGTGAT
GATCGGCTGC AGGGCGGCGA GGGCGCGGAC ATCCTCTCGG ACGGGGCTGG GCGTGACCTC
GTGTCGGGCG AGGCGGGCGA CGACGTGATC ATCCTCGCCC TCGACAGCGC GGAGGACAGG
GTCGACGGCG GCGCGGGGCG GGACACGCTC GACCTGTCGG CGGCCACGGT CGATCTGGTG
GTGGACCTGA GGAACGAGAC CGTCTCCGCT CAGGAGCTCG GCCTCGACCG GATCACCTCG
GTCGAGGCGA TCATCGCGGG ATCGGGCGAC GACCGCTTCG TGGTGGGCGG CCGGGATCTC
GTGCTCACCG GCGGCGGCGG CGGCGACGTG TACGCGTTCG CCGCTCCGAC CGAGCCGCGC
GATGGCACCC GCACCGTGCA GATCACGGAC TTTTCCGTCG GCGATTACAT CGATCTGGTC
CGCTACGCCC TGTTCAAGGA GGAGACCGCG GCCGGGCGCC CGCTCGCGGA GGCGCTCCGG
GGCGAAAGCG ATGCACCGAC CGGGATCCAG TGCCGGTTCG ATCGTTCCGA AGGCCGGGAT
CGCACGGTCG TCTCGGCCGA CTTCGACCAT GACGAGGCCT ACGAGACCAC CGTCGTCCTC
GACGGCGAGC ACTTGCTGCG CTTCACGATC GGGCCGCTGC CCGAACCGCC GACATTCCAC
ACTTGA
 
Protein sequence
MTIEIKGTKP GRDAPEGAER YVSRSADRRS GVPYFAAVAV ASVAAYLKSL LLPRATLAEE 
VDPEIVAARA DALRLVHSEE LAAPTPAKAA EAPKPLAEPP SPVLPSGMSL FDDGHELLAR
GLRFARPDAH PDLGDFKASP VIPQPINDNG GSLAAGSGGR GGGQPSGGGS GRPRDATHPE
PGSHEPGDEA QPAPRPLGQE KADRPDQGST GQGSTGQGST GQGSTGQGGS NPQDPGPSDG
GVRIPPGSTP GGDPHPVDPA SPPRGRDDGP AGQRNRAPVV NGPVQLGDVA GCALLTIALS
DLLRGASDPD GDALSVRDVR VSSGTVTADG SGWVFDADAP GPVTITYAVT DGEFSVARTA
HLTVLERALI GGTDGDDLIL GTPCADDIAA GAGDDNVDAR GGDDLVDAGS GSDHVIAGDG
ADTVLAGPGD DVVFAGAGAD RVSGGAGHDR LFGEAGDDLL FGEAGDDLLD GGEGRDILDG
GDGDDLLLGG EGNDSLYGAA GADDLSGGTG ADVLVGDAGD DRLQGGEGAD ILSDGAGRDL
VSGEAGDDVI ILALDSAEDR VDGGAGRDTL DLSAATVDLV VDLRNETVSA QELGLDRITS
VEAIIAGSGD DRFVVGGRDL VLTGGGGGDV YAFAAPTEPR DGTRTVQITD FSVGDYIDLV
RYALFKEETA AGRPLAEALR GESDAPTGIQ CRFDRSEGRD RTVVSADFDH DEAYETTVVL
DGEHLLRFTI GPLPEPPTFH T