Gene Mext_4558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4558 
Symbol 
ID5835233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5088504 
End bp5090849 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content66% 
IMG OID641370352 
Producthemolysin-type calcium-binding region 
Protein accessionYP_001641997 
Protein GI163853954 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0372374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAA TCTCGGTCAC CTGGGCCAAT GAATACTCGA TGGGGTTGGC CGGGTGGCAG 
GCTGGTCTAA TCCTCGCGAC GGCACAGGCG GCTGCCGACG CGTGGGGCAA GTACATCCAA
GTCGAGAGCA CGTTCGATAT CGCGATCGGG ATCGGCAATG TCGGCGGCGC CCTCGCCAAC
GGCGGCCCGC AATGGACCTG GAACGGGAGA TGGTGGGAAA ACACCCCCAT CCTGAAGGCC
CGCGAGGGGT ATGACCCGAA CGGCGGCGCG CAGGACGCCA ACCTCACGCT CGGTGAGCAT
ACGTTCCGCG ATTGGTTCTA CGATCCGACA GGTGTTGCGG CCGCGCCCGG CAACAGGTTC
GACGCCTTCA CCATTTTCCA GCACGAGATC GGCCACGGGC TGGGCTTCCT CGAATTCCCG
GCCACGGTCG GAGCCAACGG GCTGCTCCTC TTCAATGGTG AGAACACGCG GACGGTTCTC
GGCGGCCCGG TCGTGCTCGA TGGGGCCCGG GCGCATGTGT GGGGATCCGA GGATCTCATG
GATCCCTATC TCGGCTGGGG CCAGCGGTCC TACATCTCGG ACCTCGACCT GGCGATGCTT
CAGGACAAGG GGATGCCGAT CTCGACCGAG CGGGCCGACA AGATGTGGCT CGGCAACCGG
GCCGACACCT TCTTCGCCTA TGGCGGTGAC GACTGGATTG ACGGCGGCTG GGGCGACGAC
AAACTCTTCG GCGGCGTGGG CAACGACACG CTCATCGGCG GGGACGGGAA CGACGCGCTG
GACGGGGGCG CGGGCGACGA CACGCTCATC GGCGGGAACG GGGGCGACAC GCTGGAAGGG
GGCGAGGGCA TCGACAGGGC CGTGTTGACG GGCCGGAGCC AGGACTACGT CGCGCTCTAC
GGCGCCGCCG GTTCACTCGC CATCAAGCAC CTCGGCTCCG GCAACATCGA CACGCTGACC
TCGATCGAGA CGCTGAACTT CGACGACACC GTGCTCGGCA TGTCGGGCCT CCTCGCCCAT
CTGCAGCAGC GCTTCGGGCC CGTACAGGCC GGCAAGGCGC CCTACGAGAT GGCCCTGAAC
GCCGCCGCGG ACGAGGCTTA TCGCCCCGAC ATCCCCGAGA TCGACGGCGC GTTCAGCGTC
ACCGCGCGGG TGCGGTTCGA CGACCTCGCC GGCGGCCATT TCCAGCGCGT TTTCGATACC
GGCAACGGTC CGGACAGCGA CAACATCTGG CTCGGTCAGG TCGGCAACGG ACGGGACATG
GCCTTCGAGA TCCTCGACGG CGCGGTCAAG CATAGCATCA CCGCCAGGGA CGGCATCACC
CAGGGCGTCG AGGCGCGCTG GACCGCCGGT GTCGACGAGC GCGGATGGAT GTCGCTGTAC
AAGGACGGTG TGCTCGTCGC CGAGGGTCAG GGGGTGGTGC CGCGCGACCT GACGCGCGCG
AAGGATTTCG TGGGCCAATC CAACTGGGCG CAGGACGCTG CGCTGAAGGG CGGCATCTAC
GACCTGACCT TCAAGGACAA CCTGCCCGAC ATCCACGGCG CCTTCACGGC CAGTGCGACG
GTCCGGTTCG ATGATCTCGA TGCCGGCGCG TGGCAGCGGG TCTACGACAT CGGCAACGGT
GCCGGCGCCG AGAACGTCTT CCTCGGCCAG ATCGGCACCT CCAACGACAT GCAGTTCATG
ATCCTGAACG GCACCAGCTC CGCCAACATC GTGGCCAAGG GCGCGATCGT CGAAGGCCAG
GAAGCGACCT GGACGACCAG CGTCAACGAG ACCGGTTGGA TGCGCCTGTT CAAGGACGGC
GTGCTCCTGG CCGAGGGACA GGGCATCGTC CCGAAGGACG TCGTGCGGAC CAACGAGTTC
GTCGGCAAGT CCAACTGGAC GTGGGACAAA CCGCTCGTCG GCGAGGTGAG CGACCTGACC
ATCACGCCGT TCAAGGGCAT CCCGGAGATC GACGGCGCCT TCAAGATGTT CGCCGAAGTC
CGCTTCGACG ATCTCGCCCA CGGAAACTAT CAGCGCGTGT TCGATACCGG TAACGGGCGG
GACAGCAACA ACATCTGGCT CGGTCAGGTC GGCAACGGTG ACGACATGGC CTTCGAGATC
CTCACGGGCG CCACGAAACA CCGGATCACG GCGGCGGACA CCATCGTCGA GGGTGAGATG
GCGAAGTGGC AGGCCAGCGT GGACGAGGCC GGCTACATGC GCCTGATCAA GAACGACAAG
CTCGTGGCCG AGGGCCAGGG CGCGGTTCCG CTGGATGTGC TGCGCACCAG CGATCTGGTC
GGCCAGTCCA ACTGGTCGTG GGATACCGCG CTGGCCGGAC AGGTGAAGGA TCTGATCTTC
GCCTGA
 
Protein sequence
MAKISVTWAN EYSMGLAGWQ AGLILATAQA AADAWGKYIQ VESTFDIAIG IGNVGGALAN 
GGPQWTWNGR WWENTPILKA REGYDPNGGA QDANLTLGEH TFRDWFYDPT GVAAAPGNRF
DAFTIFQHEI GHGLGFLEFP ATVGANGLLL FNGENTRTVL GGPVVLDGAR AHVWGSEDLM
DPYLGWGQRS YISDLDLAML QDKGMPISTE RADKMWLGNR ADTFFAYGGD DWIDGGWGDD
KLFGGVGNDT LIGGDGNDAL DGGAGDDTLI GGNGGDTLEG GEGIDRAVLT GRSQDYVALY
GAAGSLAIKH LGSGNIDTLT SIETLNFDDT VLGMSGLLAH LQQRFGPVQA GKAPYEMALN
AAADEAYRPD IPEIDGAFSV TARVRFDDLA GGHFQRVFDT GNGPDSDNIW LGQVGNGRDM
AFEILDGAVK HSITARDGIT QGVEARWTAG VDERGWMSLY KDGVLVAEGQ GVVPRDLTRA
KDFVGQSNWA QDAALKGGIY DLTFKDNLPD IHGAFTASAT VRFDDLDAGA WQRVYDIGNG
AGAENVFLGQ IGTSNDMQFM ILNGTSSANI VAKGAIVEGQ EATWTTSVNE TGWMRLFKDG
VLLAEGQGIV PKDVVRTNEF VGKSNWTWDK PLVGEVSDLT ITPFKGIPEI DGAFKMFAEV
RFDDLAHGNY QRVFDTGNGR DSNNIWLGQV GNGDDMAFEI LTGATKHRIT AADTIVEGEM
AKWQASVDEA GYMRLIKNDK LVAEGQGAVP LDVLRTSDLV GQSNWSWDTA LAGQVKDLIF
A