Gene Mpop_3442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpop_3442 
Symbol 
ID6311276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium populi BJ001 
KingdomBacteria 
Replicon accessionNC_010725 
Strand
Start bp3670751 
End bp3674017 
Gene Length3267 bp 
Protein Length1088 aa 
Translation table11 
GC content76% 
IMG OID642652166 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_001926128 
Protein GI188582683 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGA CTGCCCCGAT CAGCCTCGAC AGCTTCGACC CGGAGGTGCT CGCGGCTGCC 
CGCGAGGTCG CCCGGCGGGC GGGGGTGCCG CTGGAGACCT GGATCGCCTC GGTGGCTGCG
CCCACCCCGT CGAAGCCGGC ACCGCGCCGC CGACGCTCGG ACGCCGCGCC ACCATCCGCC
AAGACAGCGA ACAAGCCGGC GGGCACGGAG CCGGTGCGGC AGGTCGGAGG ACGGGCGGCG
GAACCGGCCT CCGCGTCCGA CCCCGGCGCG GCGGCCCCGC AGCACAAGCG CCGCGAATCC
TCCGCCGCTC CGGCGACCGA GGCCTCCGCG ATGGCCTCGC TCGAAGCCTC GCTCGGCGCG
ATGATGCGGC GGCTCGACGC CCTCGACCAC TCGATCAGCG CGGAACGCGA AGCGTCCAAG
GCCGATGCCG CGCGGATGGC CGGCGACATC GAGCGCCGAA TGGGCGAACT GGCCGGACAA
CTGAACACCC CGCGACCGCT GGGCCGGCGC GGGCGCCCGC TCGCCAGCGA GGTCCGCGAC
GCCGTCGCCG AGGTGCGACG CCGCCAGCGC GAGCTGGAGG AGGGGATCGC CGAGTGGAAC
GCCGACACCG CCGTTGACGC CGCCGACGCC GCCGCCGGGC GGCAGGACGA CGGACCGGTG
CGGACGGAGA TGCAGGCCGA TACCGTCTCC GGCGCGATAC CGTCGGCGGC GATCGCCGAG
CTGCAGCGCG AGACAAGCCG GCTGCGCGAC ACGCTCGGCG GCCTCGCCAC CGGCCGCGAC
GTGGGCGAGC TGGAGCAGAC CATGCACGCC GTCGCCAGCG ACCTCCGGCG AGCACGCGAC
CCGCGCGAAG TCGCCGCCAT CGCCGCACCC ATCGAGCTGA TGCGCGTCCA GGTGGAGCGG
CTGGCCGAGG ACGTGGCCGA CAACGTCCAT GCCCGCGTCG CGGGCGAGGT GGAGCGCCTG
GCCTTCAAGG TCGACGAGGT TCTTTCCGGC GCCCTGTCGG GTCCGGCCGA CCAGAGCGCC
CTCGAGAGCG TCTTCCGCGA ACTCGACGAG ATCCGCCGCC TCGTGGCCGC CCTGGCGGGA
CCGGAGCGGA TCCAGAGCCT TGCCCAGGGT GTGCAGGCGA TCAGTGCGCA GATCGCGCAG
CTCCAGCGGG ACGAGGATGC GGGGCTCGCC GCCATCAAGC CGCTGCTGGA GGAGATCCGC
GGCGAGCTGA AGGCGCCCGC CGCCGCCCAG GAGCTTCCCG GCGCGCTGCT CGGCCGGTTC
GAGGCGCTGG CCGACCGGCT CGACCGGGCC GAGGCCGGCT CCGTCGGCGA CCTGATCGAG
CGGCTCGAAG CGGTGGCCGA GAAGGTCGAT CGCGTCAGCG CGGGCGGCAG CGGCCTCGAT
GCCCTGGAGC GCCACGTCCT CGCCCTCGCG AGCCGGCTCG ACGCCCCGCG CACCGCCGAT
CCGGCCGTGG CGCGCCTCGA ACGCTCCATG GGCGACCTGC TCGCCCAGGT CGCGGCCCTG
CGCGACGGCG CCGGTCTGGA CGCCGCCGTC GAGCGCGCCG TCCGCGAGGC CATGTCGGGC
TCGGCTGGCC CGCTCGCGGC GGCCGGCGGC GAGGTTGAGC TGCTGCGGGC CGATCTCGCC
GAGATCCGCG CCCACCAGAA GGGCTCGGAC CAGCGCCTCC AAACGACGAT GGAGGGCGTG
CAATCGGTGC TGATGCGGCT GAGCGAGCAG CTCGATCGGA CCATGACCTC CGCCGCCGCG
CTCACCGCGG GCGCTCCGCA GGAGCACGCC CCCGCCGCGC CGCCGCGGAG CGAGCGCATC
GCCCAGGAGC GCCCGGCCGC CCCGAAGACG GCCGGCCAGC CGCTGGGCCG GCCGCACCAG
ACCCCGAAGC CCGACGCCGC GGCCCCTTCG CAGGCGAGCC GCATGTCCGA GGAGCTGCTG
GAGCCCGGCG CCGGCCGGCC CGCGGCCGGG CGCCCGGCCG CGCCGGAGGC GGGATCCTCC
GCCGGCAGCG CCGACATCAA GACGAGCTTC ATCGCCGCCG CGCGGCGCGC AGCCCAGGCG
GCCCAGGCCG AAACCGCCGC CGAGACCCCG CTGACGGCGC GCCTGCGCGA CAAGGTCGCC
CCGGCCGCCC GCATGCCGGG CGCTGCCGGC GCGGAGGCGA CGCCGCTCTC GCGGCTCCGC
GGCGCCCTCG ACGGCCGCCG CCGCACGCTG CTGCTCGGTC TCGCCGCCGT GGTGCTGGCG
CTCGGCGCCT ACCAAGCCTT CCTCGCCGGC AAGAGCATGC CGGCCGGCGA AGCCGCCGCG
CCGGAGAGCC CCACGGTGGC AAGCACCGCC CCCGCCGCCT CCGAGGCCGG CACGGCCCGC
AGCGCGGAGA CCAAGACCGA GCCCGGTTCG GAGCCGGTGT CGGCCGCGCC TGCGGAGGCT
TCGGCCGAGG TGCCGGCACG GGCCAATCCC CCGGCGGTTC CGGCCGATCC GGCGACGACG
CAATCCATCG CCGACGCGAA ATCCCCGTTG GTCAAGCGCG GGCTGCCCCA GGTCACCGGC
ATGGCCGCCC TGGGGTCGGA CCTCGCCCCC CTGCCGCCGG GCCTGGCCAA GCTCAAGCAG
GATGCCCTCG ACGGCGACGG CGCCGCGGTC TGGGAGATCG CGTCCCGCGA GGCCGAGGGC
CGCGGCGTGA CGCGCGACCT CGGCCTCGCC GCCAAGCTCT ACGAGAAGCT CGCGACGGCC
GGCTACGGGC CGGCCCAGTT CAAGACCGGC AATGCCTACG AGAAGGGTTC CGGCGTGGTC
CGGGACATCG AGAAGGCGAA GGTCTGGTAC GGCCGCGCGG CGGATCAGGG CAACATCCGC
GCGATGCACA ACCTCGCGGT CCTGCACGCG GAGAATCCGG CGGCCAACGG CCGGCCGGAC
TTCGCGAGCG CCGCGAACGC CTTCCGCCGG GCCGCGGAGC ACGGCGTGCG CGACAGCCAG
TACAATCTGG CCGTTCTCTA CGCCCGCGGA CTCGGCGTCG AGCAGAACCT CGTCCAATCC
TATCTCTGGT TCTCCGCCGC CGCGGCGCAG GGCGATCAGG AAGCCGGCCG CAAGCGCGAC
GAGGTCGCCA CCAAGCTCTC GCCGAAGGAT CTCACCGAGG CCAAAACCCT CGCCTCCGGC
TTCCGGGCGC GGGCCGCCGA TCCGGCCGCG AACGAGGCGC CGTCGCCGAA AGCCACCGCT
GCGGCGCCGA TGTCCCTGAT GGGCGCGCCG TCTCCGGGCC TGCCGAACGC CCCGTCCCAC
GCGGCGCAGA AGCGGGTCGG GGTGTAA
 
Protein sequence
MKQTAPISLD SFDPEVLAAA REVARRAGVP LETWIASVAA PTPSKPAPRR RRSDAAPPSA 
KTANKPAGTE PVRQVGGRAA EPASASDPGA AAPQHKRRES SAAPATEASA MASLEASLGA
MMRRLDALDH SISAEREASK ADAARMAGDI ERRMGELAGQ LNTPRPLGRR GRPLASEVRD
AVAEVRRRQR ELEEGIAEWN ADTAVDAADA AAGRQDDGPV RTEMQADTVS GAIPSAAIAE
LQRETSRLRD TLGGLATGRD VGELEQTMHA VASDLRRARD PREVAAIAAP IELMRVQVER
LAEDVADNVH ARVAGEVERL AFKVDEVLSG ALSGPADQSA LESVFRELDE IRRLVAALAG
PERIQSLAQG VQAISAQIAQ LQRDEDAGLA AIKPLLEEIR GELKAPAAAQ ELPGALLGRF
EALADRLDRA EAGSVGDLIE RLEAVAEKVD RVSAGGSGLD ALERHVLALA SRLDAPRTAD
PAVARLERSM GDLLAQVAAL RDGAGLDAAV ERAVREAMSG SAGPLAAAGG EVELLRADLA
EIRAHQKGSD QRLQTTMEGV QSVLMRLSEQ LDRTMTSAAA LTAGAPQEHA PAAPPRSERI
AQERPAAPKT AGQPLGRPHQ TPKPDAAAPS QASRMSEELL EPGAGRPAAG RPAAPEAGSS
AGSADIKTSF IAAARRAAQA AQAETAAETP LTARLRDKVA PAARMPGAAG AEATPLSRLR
GALDGRRRTL LLGLAAVVLA LGAYQAFLAG KSMPAGEAAA PESPTVASTA PAASEAGTAR
SAETKTEPGS EPVSAAPAEA SAEVPARANP PAVPADPATT QSIADAKSPL VKRGLPQVTG
MAALGSDLAP LPPGLAKLKQ DALDGDGAAV WEIASREAEG RGVTRDLGLA AKLYEKLATA
GYGPAQFKTG NAYEKGSGVV RDIEKAKVWY GRAADQGNIR AMHNLAVLHA ENPAANGRPD
FASAANAFRR AAEHGVRDSQ YNLAVLYARG LGVEQNLVQS YLWFSAAAAQ GDQEAGRKRD
EVATKLSPKD LTEAKTLASG FRARAADPAA NEAPSPKATA AAPMSLMGAP SPGLPNAPSH
AAQKRVGV