Gene Mext_3534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3534 
Symbol 
ID5834859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3914419 
End bp3918735 
Gene Length4317 bp 
Protein Length1438 aa 
Translation table11 
GC content67% 
IMG OID641369331 
Producthypothetical protein 
Protein accessionYP_001640988 
Protein GI163852945 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATCG CCGTCCGGCA TAACCTCCAG ATCTTCGACC CGGCGGACCC TTCGCTTTGC 
GAGAGCAGCG CTATCGTGCT GCCGATCGCC GAGGCGCAGG CTGCCATGGG CGAGACCGTC
GTCGCCTACC TCGTGCGCGT CGCGTGGCGG TTCGACCTGC CGACTGTCTG CCGGATCAAC
GGTGAGTTCT ACGCTCGGGC CGATTGGGAG ACACGCGCGC TCGGCGTCAA CGAAAACGTC
GAGTTCGTCA GCCGGCCGCT TGGTGGCGGA TCGAGCGGCG GCTCCACCGG TAAGAGCATC
CTGTCCGTCG TCGCCCTTGT CGCGCTCACC GCGGTGGCGC CCTACGCCGT CGGCGCCATC
GCCGGCGCAC TGGGCACCGC GGCGCTCGGC ACGGCCTCCG CCCTCACCTT TGCCGGCAAG
CTCGCAGCGG CGGTCATCGT CGGCGCGGGC GCGCTCGCCG TCTCGCACTT CCTGAGCCCG
AAGTCCGGCG GCAAGACCAA CAGCAGCGAC GCGCTCTATT CGTTCGGCCT TCAGGGCAAC
GCGGCCCGGC CGATGCAGCC GATCCCGGTG CTGAACGGTC GCCGCAAGTT CGCCCCCGAT
TACGCCGCGC CGACCTACAG CGAGTATGCC GGCGACTCGA TGACCGACTA CGCGCTGTAT
GCGCTGACGT GCGGCCGGAT GCAGGTGGAG CAGGTGCTGA TCGGCGACAC CCCGATCTGG
CACTATGAGA GCGGCCCGAG CCCCGATTAT CCCGGCATCG AGCTTCAGAT CGTCGAGCCG
GGCGAGCAGG TCACGCTGTA TCCGGTCAAC GTGGTCACGG CCGACGAGTC GAGCGGCGCC
GAGCTGCAGC AGACCTACAC GCCCGGCTAC ATCGTCAACG CCGCCGGCAC GCGGGCCAAG
GAGCTGATCT TCGACCTCGT CTGGCCGGGC GGCGCCTACG TCACCTTCAA GGACCGGACG
CTCGCCGCAA CCACGCACGT CGAGATCCAG GCGCGGCAGG TCGATGATGC CGGCGCGCCG
ATCACCGGGT GGACGACGAT CGTCAGCGAT CCCTACGTCC AGGCGAAGCA GAGCCAGATC
CGGATGACCG TCCGCCGGAT CGTAGAACTG GCCCGGTACG AGGTCCGCGG GCGCCGGGTG
AACCCGAGCA TCAACGACAG CGGTATCGAG AAGATCGGCG GCACCGACGA TGTGACGTGG
ACGGCCGCTC GCGCGCACAT CGAGGGGCCG CAGGCCTTCC CGCGCGTCAC GACGCTCGCG
GTGAAGGGCG TTGCCTCGAA GCAACTCTCC GGCGTCTCGG GCGGGCAGCT GCGCGTCGTC
GGCACCCGCA TCCTGCCGGT CTGGCGTGAC GGACAGTTCG TGGAGGAGCC GACGCGCTCC
ATCGCCTGGG CCGCCCTCGA TTGGTGGCGC AACAGCGACT ACGCCGCGGG CCTCAGCATC
TCCGACATCG ACTTTCAGAG CTTCGTGCGC CACGCCACGC TCTGGGATGC GCTGGGCCAC
ACCTTCGACC ATCGCTTCAC CGAGGTGCAG AACCTCGACG ACGTGCTCGA GACGGTGCTG
AAGGCCGGCC GCGCCTTCCC GGCCCCGGTA GGCGACAAGC TCACCATCAC GCGGGACGAG
CCGCGCGTGC TGCCGCGCAT GCTCTTCACC GACAACGACA TCCTGCGCGA CACGCTGGAG
ATCGACTACG CGCTCTCGGA CGAGGCCTGG GCCGACGGCA TGGTCGGCGA GTACGTCGAC
GAGACCACCT GGCGCCTCGC CGAGGTCTCG TCCGCGCCCG ACGGCGTGAC CCTGCTGAAG
CCAGCCCGCG TCCAGCTGGA AGGCGTGGTC AACCGCAAAC AGGCCGCCGG CATGGTCCGG
ATGATGGCCG CCGAGAGCCA GTACCGCCGC ATCACCGTGT CTTGGACCGC CCGCATGGAA
GGGCGGCTGC TGAAGCGTGG CGACCTCGTG CGGATCACCA CCGAGGAGCC CGAGACCTGG
GGGCAGTCCT GCGAGGTCGT TGGCTTCACT CCGGCCACGC GACGCCTGAC GCTCGACCCA
GCGCCGAGAT GGGAGGCGAG TGGCAACCAC TACGTCGAGA TCCGCCGCCG CGACGGCAGC
CCGTGGGGGC CGGTGCAGGT CACCCGCGGT GCCAACGACG CGCAGGCGAT TGTCAGCGTG
TCCGAGGTGA GCGGCGTGAC GCTGGCGGAT GCGGTGGGCC GCTCCAGCAC GCAGGAGCCG
GCATGGCTCG CCTTCTCGCC CGGTCAGCCC CGCAGCTTCC CGGTGCTCAT CACCGACGGC
GACCCGGATC AGGACGGCGA GCACATCCAC CTGTCGGGTG TCATGGACGC CGCCGAGGTC
TACGCGACCA CCGAGGACGG CGTCCCACCG CTCCTGCAGA TCCCCGACCT GTTCTCGTCG
GCGCTGCCGG TCATCGCCGC GCTGATGGCG AACATCGCCC AGCGACAGGC CTCGCTGATT
CTGACGGCCG GCTGGCAGCC CGCGAAGAAC GCGGTGAGCT ACGAGGCTCA GGTCTCCTAC
GACGGTGGCG GGTCATGGAT CGGCGCCTAC GAGGGCAACC GCACCACGTT CGAAGCCGTG
GTCGGCGGGT CCGACCAGAT GCGCGTCCGA GTCCGCGGCG TCACGCCGGC CGGACCCCGT
GGTGCTTGGT CGGTGGTGGA GGTTGAGGCG CCAACCCTGG CAATCAGCGG GGATCTGTTC
GCGGACCTCT CCATCACCAT CGACAAGCTG TCGGCCGAGA CGAAGAAGGC GATCGAGGAC
CTCAACGCAC TCGGAGCGGG CGCGCTGGCT ATGGGCAGCG ACGCCACTGA ACTCGCCCTC
AGCATGGCGC GCGACACCAT CCGCGGGGCG CAGCGGCAGC TTGCCGCCTA CATTGAGCAG
CTGGCCGAGG CGCAGGCGAC ACAGGAGTTC TATTCCTACG AGCAGCGCCA GCTGATCAAG
GTCGGCAACA GCGCGAACTA TGCCGCGATC CTGCGCACCG AGCGCGCCAT GGTCGAGGCG
GACGAAGCGC TGGCTGAGAT TACAACGGCC TTGGACGTTC GCCTCGTCGA TGCACAGACC
GCTCTCGTCG GTCAGGCTAC CGCCCTGGAT CAACTGAACG CGCGGGTAAC CGTCAACGAG
GAGGGCATCA CGGCGACCGC TCAGCGCGCG ACCGCGCTGG AGAGCACCGT CAACAATCCC
ACGAACGGCG TCTCAGCTAC CGCTACGGCG GTGGACACCC TGAAGACCAC CGTCACCACA
CAGGGCCAGA ACATCACGGC TACCGCCAAC CGGACAACGG CGCTGGAAAG CACGGTCAAC
AATCCGACGA CAGGCGTGAC CGCCACGGCG ACCACGCTCG ACACCCTGAA GACCACGGTG
ACCCAGCAGG GTGGCACGCT TACCACCACG GCCGAGCGGA CAACGGCGCT TGAGAGCACG
GTCAACAACG GCACAACGGG GGTGGCCGCC ACGGCTACTG CGCTGAACAG CCTGACGACG
ACGGTTACGC AGCAAGGCGG CACGATCAGC AGCACGTCGC AGCAGCTCAC CAGCCTTGCT
AACGAGGTGC GCAACCCGAC GACGGGTCTC AGCGCTACAG CGCAGGCCGT CGACGTGCTG
GAAACACAAG TCAACGACGG CACGAATGGT CTGAGCGCTG TTGCGCAGCG GGTCTCGTCG
CTGAATACGA CCGTGGGGAA CAACAGCGCC AGCATTACGA GCCTGTTCGG CTCTTACGAC
GGCGTGAAGG TCCAGTTCGG CGTCACGGGC ACCATCGACG GCCAGACCGG CGGCTTTGTC
CTGAAAGGCA TCAGGAAACT CGATGGCTCG GTCAGCTACT CTATGCTGAT CGACGCGGAT
GTGTTCGCTC GATCGATCAC TTCCCCGCTC TTGCAGACGA CAAAGCTGAT CGCGACCTCA
GCGCAGATCG GAAATCTCAT CGTCGACAAC ATCCACGTCA AGGACGGCGG CATTTCATCG
ATGGTGTCCA ACTTCGCGGA CGCCAAAAAG GTCAGCGCAA GCATCACAGT CCGCACGGGC
GGAAAAATCA GGATCTCGAT GAGCCGATCC GGCAATGCCG GTGTCCGCTA TGCGCCTGCG
CTTGGGTACA CGTCGGGCAA CTTCCAACTC CGCCGCAATG GGTTCCTCGT CTATGAGGCC
CCCGCGGCAG TGTCGTTCCA GTGGGATCCG CAGGCCGGCG GTAATTATCT GGTGGTGATC
GCCAGCACAG CGATGGTTGA CGACACGCCG GGCCCAGGCA CGCACACCTA CGAGCTAACC
GATACCAACA ACACGCCAGT GGCTGGGGTC TATATCTCGG TTCAGGAGAG CAAGTGA
 
Protein sequence
MHIAVRHNLQ IFDPADPSLC ESSAIVLPIA EAQAAMGETV VAYLVRVAWR FDLPTVCRIN 
GEFYARADWE TRALGVNENV EFVSRPLGGG SSGGSTGKSI LSVVALVALT AVAPYAVGAI
AGALGTAALG TASALTFAGK LAAAVIVGAG ALAVSHFLSP KSGGKTNSSD ALYSFGLQGN
AARPMQPIPV LNGRRKFAPD YAAPTYSEYA GDSMTDYALY ALTCGRMQVE QVLIGDTPIW
HYESGPSPDY PGIELQIVEP GEQVTLYPVN VVTADESSGA ELQQTYTPGY IVNAAGTRAK
ELIFDLVWPG GAYVTFKDRT LAATTHVEIQ ARQVDDAGAP ITGWTTIVSD PYVQAKQSQI
RMTVRRIVEL ARYEVRGRRV NPSINDSGIE KIGGTDDVTW TAARAHIEGP QAFPRVTTLA
VKGVASKQLS GVSGGQLRVV GTRILPVWRD GQFVEEPTRS IAWAALDWWR NSDYAAGLSI
SDIDFQSFVR HATLWDALGH TFDHRFTEVQ NLDDVLETVL KAGRAFPAPV GDKLTITRDE
PRVLPRMLFT DNDILRDTLE IDYALSDEAW ADGMVGEYVD ETTWRLAEVS SAPDGVTLLK
PARVQLEGVV NRKQAAGMVR MMAAESQYRR ITVSWTARME GRLLKRGDLV RITTEEPETW
GQSCEVVGFT PATRRLTLDP APRWEASGNH YVEIRRRDGS PWGPVQVTRG ANDAQAIVSV
SEVSGVTLAD AVGRSSTQEP AWLAFSPGQP RSFPVLITDG DPDQDGEHIH LSGVMDAAEV
YATTEDGVPP LLQIPDLFSS ALPVIAALMA NIAQRQASLI LTAGWQPAKN AVSYEAQVSY
DGGGSWIGAY EGNRTTFEAV VGGSDQMRVR VRGVTPAGPR GAWSVVEVEA PTLAISGDLF
ADLSITIDKL SAETKKAIED LNALGAGALA MGSDATELAL SMARDTIRGA QRQLAAYIEQ
LAEAQATQEF YSYEQRQLIK VGNSANYAAI LRTERAMVEA DEALAEITTA LDVRLVDAQT
ALVGQATALD QLNARVTVNE EGITATAQRA TALESTVNNP TNGVSATATA VDTLKTTVTT
QGQNITATAN RTTALESTVN NPTTGVTATA TTLDTLKTTV TQQGGTLTTT AERTTALEST
VNNGTTGVAA TATALNSLTT TVTQQGGTIS STSQQLTSLA NEVRNPTTGL SATAQAVDVL
ETQVNDGTNG LSAVAQRVSS LNTTVGNNSA SITSLFGSYD GVKVQFGVTG TIDGQTGGFV
LKGIRKLDGS VSYSMLIDAD VFARSITSPL LQTTKLIATS AQIGNLIVDN IHVKDGGISS
MVSNFADAKK VSASITVRTG GKIRISMSRS GNAGVRYAPA LGYTSGNFQL RRNGFLVYEA
PAAVSFQWDP QAGGNYLVVI ASTAMVDDTP GPGTHTYELT DTNNTPVAGV YISVQESK