Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1664 |
Symbol | |
ID | 5831694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1870610 |
End bp | 1872835 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641367463 |
Product | hemolysin-type calcium-binding region |
Protein accession | YP_001639134 |
Protein GI | 163851091 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.273862 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATCG AGATCAAGGG CACGAAGCCC GGGCGGGACG CGCCCGAGGG TGCCGAGCGC TACGTCTCGC GCAGCGCCGA CCGGCGATCC GGGGTCCCCT ACTTCGCCGC GGTGGCGGTC GCGTCCGTGG CTGCCTACCT CAAATCCCTG CTTCTGCCCC GCGCAACCCT GGCAGAGGAA GTGGACCCGG AGATCGTGGC GGCACGCGCC GACGCCCTGC GCCTCGTCCA TTCCGAGGAG CTCGCCGCGC CCACCCCTGC AAAGGCCGCC GAGGCACCGA AGCCACTGGC CGAGCCGCCA TCCCCGGTTC TGCCATCCGG CATGTCCCTG TTCGACGACG GGCACGAACT GCTCGCGCGC GGGCTGCGCT TCGCAAGGCC CGACGCGCAT CCGGACCTCG GCGACTTCAA GGCCTCTCCG GTCATCCCCC AGCCGATCAA CGACAACGGC GGCTCCCTCG CAGCCGGATC GGGCGGGCGC GGAGGCGGGC AGCCGAGCGG CGGCGGCAGC GGCCGGCCAC GGGACGCAAC CCACCCGGAA CCGGGATCGC ACGAGCCCGG CGACGAGGCA CAGCCGGCCC CCCGACCGCT GGGACAGGAG AAGGCCGACA GGCCGGATCA AGGCTCGACT GGGCAAGGCT CGACCGGGCA AGGCTCGACC GGGCAAGGCT CGACAGGGCA AGGCGGATCG AACCCACAGG ATCCTGGGCC GTCGGACGGC GGCGTCCGGA TCCCTCCCGG CTCCACGCCG GGCGGCGATC CGCATCCCGT CGATCCGGCG AGCCCGCCGC GCGGCCGGGA CGACGGACCG GCGGGGCAAC GCAACCGCGC GCCCGTGGTG AACGGCCCGG TCCAGCTCGG GGATGTCGCG GGCTGCGCAC TCCTCACCAT CGCCCTGAGC GATCTCCTGC GCGGCGCCAG CGATCCGGAC GGAGACGCAC TCAGCGTGCG CGACGTCCGG GTCTCCTCCG GCACCGTGAC GGCGGACGGC TCGGGCTGGG TGTTCGACGC CGACGCGCCC GGTCCGGTCA CGATTACCTA CGCGGTGACG GACGGCGAGT TCTCGGTCGC CCGCACGGCT CATCTGACCG TCCTCGAACG CGCCTTGATC GGCGGCACGG ACGGGGACGA CCTCATCCTC GGCACACCCT GCGCGGACGA CATCGCGGCC GGCGCGGGAG ACGACAATGT CGACGCCCGC GGAGGCGACG ACCTCGTCGA TGCCGGCAGC GGCAGCGACC ACGTGATCGC GGGCGACGGC GCTGACACGG TCCTGGCCGG GCCGGGCGAC GACGTCGTGT TCGCCGGGGC GGGCGCGGAT CGGGTCTCGG GCGGGGCGGG TCACGACCGC CTGTTCGGGG AGGCGGGCGA CGACCTTCTG TTCGGGGAGG CGGGCGACGA CCTGCTCGAT GGCGGCGAGG GGCGCGACAT CCTGGACGGG GGCGACGGCG ACGATCTCCT CCTTGGAGGC GAGGGCAACG ATAGCCTCTA CGGCGCGGCC GGCGCCGACG ACCTGTCCGG TGGCACCGGT GCCGACGTGC TCGTCGGGGA TGCGGGTGAT GATCGGCTGC AGGGCGGCGA GGGCGCGGAC ATCCTCTCGG ACGGGGCTGG GCGTGACCTC GTGTCGGGCG AGGCGGGCGA CGACGTGATC ATCCTCGCCC TCGACAGCGC GGAGGACAGG GTCGACGGCG GCGCGGGGCG GGACACGCTC GACCTGTCGG CGGCCACGGT CGATCTGGTG GTGGACCTGA GGAACGAGAC CGTCTCCGCT CAGGAGCTCG GCCTCGACCG GATCACCTCG GTCGAGGCGA TCATCGCGGG ATCGGGCGAC GACCGCTTCG TGGTGGGCGG CCGGGATCTC GTGCTCACCG GCGGCGGCGG CGGCGACGTG TACGCGTTCG CCGCTCCGAC CGAGCCGCGC GATGGCACCC GCACCGTGCA GATCACGGAC TTTTCCGTCG GCGATTACAT CGATCTGGTC CGCTACGCCC TGTTCAAGGA GGAGACCGCG GCCGGGCGCC CGCTCGCGGA GGCGCTCCGG GGCGAAAGCG ATGCACCGAC CGGGATCCAG TGCCGGTTCG ATCGTTCCGA AGGCCGGGAT CGCACGGTCG TCTCGGCCGA CTTCGACCAT GACGAGGCCT ACGAGACCAC CGTCGTCCTC GACGGCGAGC ACTTGCTGCG CTTCACGATC GGGCCGCTGC CCGAACCGCC GACATTCCAC ACTTGA
|
Protein sequence | MTIEIKGTKP GRDAPEGAER YVSRSADRRS GVPYFAAVAV ASVAAYLKSL LLPRATLAEE VDPEIVAARA DALRLVHSEE LAAPTPAKAA EAPKPLAEPP SPVLPSGMSL FDDGHELLAR GLRFARPDAH PDLGDFKASP VIPQPINDNG GSLAAGSGGR GGGQPSGGGS GRPRDATHPE PGSHEPGDEA QPAPRPLGQE KADRPDQGST GQGSTGQGST GQGSTGQGGS NPQDPGPSDG GVRIPPGSTP GGDPHPVDPA SPPRGRDDGP AGQRNRAPVV NGPVQLGDVA GCALLTIALS DLLRGASDPD GDALSVRDVR VSSGTVTADG SGWVFDADAP GPVTITYAVT DGEFSVARTA HLTVLERALI GGTDGDDLIL GTPCADDIAA GAGDDNVDAR GGDDLVDAGS GSDHVIAGDG ADTVLAGPGD DVVFAGAGAD RVSGGAGHDR LFGEAGDDLL FGEAGDDLLD GGEGRDILDG GDGDDLLLGG EGNDSLYGAA GADDLSGGTG ADVLVGDAGD DRLQGGEGAD ILSDGAGRDL VSGEAGDDVI ILALDSAEDR VDGGAGRDTL DLSAATVDLV VDLRNETVSA QELGLDRITS VEAIIAGSGD DRFVVGGRDL VLTGGGGGDV YAFAAPTEPR DGTRTVQITD FSVGDYIDLV RYALFKEETA AGRPLAEALR GESDAPTGIQ CRFDRSEGRD RTVVSADFDH DEAYETTVVL DGEHLLRFTI GPLPEPPTFH T
|
| |