Gene Mext_3246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3246 
Symbol 
ID5835519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3600696 
End bp3604133 
Gene Length3438 bp 
Protein Length1145 aa 
Translation table11 
GC content73% 
IMG OID641369046 
ProductSel1 domain-containing protein 
Protein accessionYP_001640704 
Protein GI163852661 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.00516483 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGAAAAGG TCGTTTCAGT CCGATCCGCA GTCCGCCGTT TCACGCCTTG CGCGAAACGA 
TGGTCCGCCG GGCACCGGAA GCCGCCGACT TCGGATCTCC GCGAGATGAA ACAGACTGCC
CCGATCAGCC TCGACAGCTT CGACCCGGAG GTACTCGCGG CTGCCCGCGA GGTCGCCCGG
CGAGCGGGGG TGCCGCTGGA GAGCTGGATC GCCTCGGTGG CGACACCCGA CCCGTCGAAG
CCGGGTCCGC GGCGGCGGCG CGCGGATGCG ACGTCCCCAG CCAGGGAGGC CGGCGCGGAG
CCGGCCCGGC ACGTCGGAGG CCGGGCGCCG GAGAAGGCGC CCACGCCCGC CTCACGGAAG
GACGGGCCAC AGCACAAGCG CCGGGAAGCC ACTCAGGGCG CTTCGGCCGC CGAGGCGTCC
GCAACGACAT CGCTGGAAGC CTCGCTCGGC GCGATGATGC GGCGGCTCGA CGCCCTCGAC
CGCTCGATCA GCGAGGAGCG CGAGGCCTCC AAGGCCGATG CCGCCCGGAT GATCGACGAG
ATCGAGGCGC GGTTGACCAC CGCCCGCCAG CCCGTCGCAC CGGAAATGAT CGCCGGACGC
ATCGCCGACA TCGAACGCAA GATGGGCGAG ATCGCCGGCC AGCTCGACAC GCCCCGCCCG
CTGGGTCGGC GCGGGCGCCC GCTCGCCACC GAGGTCCGCG ATGCCGTCGC CGAGGTGCGC
CGCCGCCAGC GCGAGCTGGA AGAGGGCATC GCCGAGTGGA ACGCCGCCAG CGCGGGAGAA
TTGAAGGACC GGCCGCAGGA TGGCCCGGCC GCACTTGCGG TGACGCCTGC CGAGGAAGGC
TCCGGCCAGG AGCCGTCCCC GGCCATCGCC GAACTGCAAC GCGAGACGAA CCGGCTGCGC
GACGCGCTCG GCAGCCTCGC CACCGGCCGC GACGTCAGCG AGCTGGAGCG GACCATGCAG
GCCGTCGCCA GCGACCTTCA GCGGGCGCGT GCCCCGCAGG AGCTCGCCGC CATCGCCGCC
CCGGTCGAGC TGATGCGCCT CCAAGTGGAG CGGATCGCCG AAGACGTGGC GGACAACGTC
CATGCCCGCA TCGCGGGTGA GGTCGAGCGC TTGGCCGGGA AGGTGGATGC CGTTCTCTCC
GGCGTCTTGT CCGGCCCCGC CGACCAGAGC GCCCTCGACG GCGTGTTCCG CGAACTCGAC
GAGATCCGCC GCCTCGTGGC GTCCCTGGCA GGGCCGGAGC GCATCCAGAG CCTCGCCCAG
GGCGTGCAGG CAATCAGCGC CCAGATCACC CAGCTTCAGC GGGACGAGGA TGCGGGTATC
GCCACCCTCA AGCCGCTGCT GGAGGAGATC CGCGGCGAAC TGAAGGCGCC CGATTCATCT
CGCGAGCTTC CCGGCGCGCT TCTCGGACGT TTCGAAGCGC TCGCGCAGCG GCTCGATGGC
GCCGAGTCCG GCTCCGTCGG CGAGCTGATC GAGCGGCTCG AAGGCGTGGC TGAAAAAGTC
GACCGCGTCA GCGCGGGCGG CAGCGGGCTC GATGCCCTGG AGCGCCACGT CCTCGCCCTG
GCGAGCCGGC TCGAAGCGCC GCGCGACACC GATCCGGCCG TGGCGCGCCT CGAGCGCTCC
ATGGGCGACC TGCTCGCCCA GGTCACGGCC CTGCGCAACG GAACCGATCT GGAGGCCACC
GTCGCGCAGG CGGTCCGTGA GGCCGTGGCG GGCTCGACCG CCCCGCTCGC GGCCGGCGGC
GGTTTCGAGC TGCTGCGGGC CGATCTCGCC GAGATGCGGG CCAACCAGAA GGGCGCGGAC
CAGCGCCTCC AATCGACGAT GGAAGGCGTC CAGTCGGTGC TGATGCGGCT GAGCGAGCAG
CTCGACCGCA CCATGACCTC GTCCGCGGCC CTCACCGCCG CCGCCCCGCA GGAGCGCGCG
CCCGTCGTGC TGTCCTCGGC CGAGCGCGTC TCGCACGAGC GCCCCGCCGC CCCGAAGACG
GCTGCCAAGC CGTCATCCGA GCCGTCGTCC CAGAACCTTG CCCGCCCGAA CCGCACCGCA
TCCTCCGACG AGGCCGGCGG GACGGAGGCG AGCCGCCTGT CCGACGAGTT GCTGGAGCCC
GGTGCCGGCC GGCCCGGATC CGGCCGCCCG GCCGCGCCGG AGGCGAGCCC CGCCGCGACC
GGCGGCGCCG ACATCAAGAC CAGCTTCATC GCCGCCGCGC GGCGCGCGGC GCAGGCCGCG
CAGGCCGAAT CGGCCACCGA GGCGCCACTG ACGGCGCGGC TGCGCGACAA GGTCGCTCCG
GCCCGCATGC CGGGCGCGGA GACGACGCCC CTCTCGCGGA TCCGCGGTGC CCTCGACAGC
CGCCGCCGCA CGCTCCTGCT CGGCCTTGCC GCTGTGGTGC TGGCGCTCGG CGCCTACCAA
GCCTTCGTCG CGGGCAAGGG CACTCCGACC GGAACTCCGA CCGGCGACCC GGCCGCGCCG
GAGGCTCGTC CGGTGGCGAG CACCGCCCCG GCGGCCTCCG CCGACGTCGC CGCGAGCCGC
ACCGAGACCA CGGCCGAGCC CGCTCAGGCG GCGTCGCAGA CCCAGGCTCC GTCCGAGACT
TCCCCCCAGA CCGGGACATC CGCCCAGACG ACGCCCGACC CGGCGACGAC CCAATCCATC
GCCGAGCCGA AATCCGCGCC GGCCAAGCGC GGGCTGCCCC AGGTCGCGGG CATGAGCACG
CTCGGCCCCG ACCTTGCCGG CCTGCCGCCG GCCCTGGCCA AGCTCAAGCA GGATGCGCTC
GACGGCGACG GCGCCGCGGT CTGGGAGATC GCCTCCCGCG AGGCCGAAGG CCGGGGCGTG
ACGCGCGACC TCGTGGTCGC CGCCAAGCTC TACGAGCGGC TCGCGAATGC CGGCTACGCG
CCGGCTCAGT TCAAGGTCGG CAACGCCTAC GAAAAGGGCT CGGGCGTGGT CCGGGACATC
GAGAAGGCGA AGGCGTGGTA CGGCCGCGCC GCGGATCAGG GCAACATCCG CGCGATGCAC
AACCTCGCCG TGCTGCATGC CGAGAACCCG GCGGCCAACG GCAAAGCGGA TTTCGTGACC
GCCGCGAACG CCTTCCGCCG GGCGGCGGAA CACGGGGTGC GCGACAGCCA GTACAACCTG
GCCGTACTCT ACGCCCGCGG TCTCGGCGTC GGACAGGATC TCGTCCAGTC CTATCTCTGG
TTCTCGGCCG CTGCCACGCA GGGGGACCAG GAAGCGGGCC GCAAGCGGGA CGAGGTCGCC
GCCAAGCTCT CGCCGAAGGA TCTCACCGAA GCCAGGAGCC TCGCAGCGGG CTTCAAGGCG
AAGGCCGTCG ATCCGGCCGC GAACGAGGCC CCGTCCCAGA AGGCTACCGC CGCAGCGGGG
ATGTCTCTGA TGGGCGCGCC GTCGCCGGGC ATGCCGACCG CCGCTTCCCC ATCGGCGCAG
AAGCGGTTCG GGGTCTGA
 
Protein sequence
MEKVVSVRSA VRRFTPCAKR WSAGHRKPPT SDLREMKQTA PISLDSFDPE VLAAAREVAR 
RAGVPLESWI ASVATPDPSK PGPRRRRADA TSPAREAGAE PARHVGGRAP EKAPTPASRK
DGPQHKRREA TQGASAAEAS ATTSLEASLG AMMRRLDALD RSISEEREAS KADAARMIDE
IEARLTTARQ PVAPEMIAGR IADIERKMGE IAGQLDTPRP LGRRGRPLAT EVRDAVAEVR
RRQRELEEGI AEWNAASAGE LKDRPQDGPA ALAVTPAEEG SGQEPSPAIA ELQRETNRLR
DALGSLATGR DVSELERTMQ AVASDLQRAR APQELAAIAA PVELMRLQVE RIAEDVADNV
HARIAGEVER LAGKVDAVLS GVLSGPADQS ALDGVFRELD EIRRLVASLA GPERIQSLAQ
GVQAISAQIT QLQRDEDAGI ATLKPLLEEI RGELKAPDSS RELPGALLGR FEALAQRLDG
AESGSVGELI ERLEGVAEKV DRVSAGGSGL DALERHVLAL ASRLEAPRDT DPAVARLERS
MGDLLAQVTA LRNGTDLEAT VAQAVREAVA GSTAPLAAGG GFELLRADLA EMRANQKGAD
QRLQSTMEGV QSVLMRLSEQ LDRTMTSSAA LTAAAPQERA PVVLSSAERV SHERPAAPKT
AAKPSSEPSS QNLARPNRTA SSDEAGGTEA SRLSDELLEP GAGRPGSGRP AAPEASPAAT
GGADIKTSFI AAARRAAQAA QAESATEAPL TARLRDKVAP ARMPGAETTP LSRIRGALDS
RRRTLLLGLA AVVLALGAYQ AFVAGKGTPT GTPTGDPAAP EARPVASTAP AASADVAASR
TETTAEPAQA ASQTQAPSET SPQTGTSAQT TPDPATTQSI AEPKSAPAKR GLPQVAGMST
LGPDLAGLPP ALAKLKQDAL DGDGAAVWEI ASREAEGRGV TRDLVVAAKL YERLANAGYA
PAQFKVGNAY EKGSGVVRDI EKAKAWYGRA ADQGNIRAMH NLAVLHAENP AANGKADFVT
AANAFRRAAE HGVRDSQYNL AVLYARGLGV GQDLVQSYLW FSAAATQGDQ EAGRKRDEVA
AKLSPKDLTE ARSLAAGFKA KAVDPAANEA PSQKATAAAG MSLMGAPSPG MPTAASPSAQ
KRFGV