Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A1511 |
Symbol | |
ID | 3626912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | - |
Start bp | 1868521 |
End bp | 1871181 |
Gene Length | 2661 bp |
Protein Length | 886 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637700397 |
Product | peptidase family protein U32 |
Protein accession | YP_305044 |
Protein GI | 73669029 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCTG AAAAACCGGA ACTGCTTGCC CCGGCAGGAG GCCTGGAAGC TTTTATTGCA GCCGTAGAAA ATGGGGCAGA TGCCGTGTAC CTCGGAGCTC GGGCTTTCAG TGCTCGGGGG TATGCCTCAA ACTTTTCAGA GAAAGAGCTT GAAGAAGCCG TCGATTATGC TCACCTGAGA GGCGTGAAAG TATATGTAAC CGTAAATACG CTACTCAAGG ACGAAGAAGT AGAAAATGCC CTCAAGTTGC TTTCCTGGCT GAGAGAAATT GGAACTGACG CAATCATAAT TCAGGATCTA GGGCTTATCT CTCTTGCAAG AAAATATCTG CCTGACCTTT CACTGCATGC GAGCACGCAA ATGACTTTGC ATAATAGCGA AGGGGTTGAA TTCGCAAAAA AAATGGGAAT AGAGAGGGCT GTGCTCTCAA GGGAATGCTC GCTTGAAGAG ATAAAACGAA TTAAAGAGAA AACAGAGACT GAAATCGAGG TTTTCATACA TGGGGCTCTC TGTATATCCT ATTCTGGCCA GTGCCTTTTG AGCAGTCTTA TAGGGGGAAG AAGCGGAAAC CGTGGTTTCT GTGCCCAGCC GTGCCGTAAA AAATACAGGC TGTACTGCGA AGGGAAACAA ATCAAAACAA CAGGCAGTTA CCTCTTAAGC CCGAAAGACC TTAACACGAC TGCAGGCCTT GGAGCCCTTA TAAAAACCGG GATTGAATCT TTCAAAATTG AAGGCCGGAT GAAAAGGCCG GAATATGTTG CAGGTGTTGT CAGGATTTAC CGCCATCTGA TTGACAGGTA TATTGAAAAC CCTGCAGAAT ATTTCGTTTC CGAAGAAGAG CAGGAAATAC TTACCCAGCT TTTTAACAGG GGTTTTACTC AGGGTTACTT TTTTGAAAAC CCGCGCTGGG AGCTCATGAA CAGGGAAAAC CCCCACAACC GTGGAGTTCT GGCAGGTACG GTTACCGGGT ATGACAGGCG TTTAAACCGT ATCCGCGTGA AACTTTCCCG GCCTCTTCGT CTCGGAGACG GGATAATGGT TGAAAATGCA GAAACCAGGT CAGAAGATAA AGGAAAAATC GTATCCTCAA TGTATACCGG AAAAGGTCCA GTATACAGTG CAAGAGTAGG AGATACGGTA GAAATTCCTT TCGATTCAAG GGCACCCTCA GGAAGCACAG TATACAGGAC GCATGACAAA AAGCTTATGG ATTCCCTTAA AAAAGAAAGT GAGTCAGGAA GCCTGAGGCC TAAAATCCCT GTGTCTATTA CGGCAATCAT TGAGTCTGGA AGACCTGTCA GGCTTGAGGT AAAAGACAGG GATTCAAATA CAGTAATAGT TGAGTCCGGG TATCTTGTCG AGAAAGCCGA AAAGCAGCCA ACTTCAAAGG CCCGTATCGA AAAACAACTT TCAAAACTCG GAAACACGAT TTTTGAGGCT GCTGAATTTT ATGTCAAAAT GGAAGAAGAT GTTTTTATTC CTGTAGGGCA GTTGAATGAG CTGAGGACAA AAGCAATAGA GCAGCTTGAA AATCTGCGGA TTTCCAGGTG GAAGCGGAAA CCTCTTGATA TTCTACAGTT CTACGACTCT GGAGAAAAGG GATCCAAAGA GAAGGAATCC GTAGAAAAAG AATTCGTAGA AAAGGAATCT GTAGGAAAGG AATCTGTAGG AAAGAAATTC GTAGAAAAAG AATCCATAGG AAAAGAATCC GTAGAAAAGG TGGGACAGGA AGCCAAAAAA ACTATTCAAG AGCGCCTTTC TACATATCCA TTGCTCTCAG TTTCCGTGTA TTCCCTTGAA GGGCTCGAAG GAGCACTCGC AGGTGGAGCC GATAGGATTT ATTTCGGCGA AGGGCTTTTT AGAAAGCCGA AAAAAACAGA GCAAGAAAAT GGATCGGCAA AGGGACCTGA CGCAGTTTTT GAGAAAGCAG TATTGGAGAC TCAAAAGGCA GGCAAGAATA TCTATTTCAT TACTCCAAAG CTTGTTAAAG ACTCCAGAAT GGAGTCCGTA GAAAAAATTA TCTCTCGTGT AAGGAAACTG GGAGCTGACG GGGTCCTTGT TTCAAATCTG GGAACCCTTG GCCTCGCAAA AACTGAGAAA ATCCCTTTTA TTGCAGACAG TCCTCTTAAC ATCTTCAACA GCCGCACTTT TGACTTTATA TTGAAGGAAG GGGCTCAGAT GGCTGTAATT TCTCCGGAGC TGACCCTTGA GGAATTAAAA AGCATTGCGT CTCATGGGCC TGCAGAGTGT ATTATTTACG GCAGGCTTGA ACTGATGGAG TCCGAACACT GCCTTATAGG CGGGCTGCTC GGGAACAATA AAGGTCAATG TAGTGCCCCA TGCAGGTCAG GAAAGTTTAC ACTGGTAGAT GAGAAAAACT ACGAATTTCC CCTGCTCATG GACTATGAAT GCAGGATGCA CCTCCTTAAC TCAAGATCGC TCTGCATGCT TGAATACCTC CCTGAAATCC TCGAAAGCGG GGTTTCAAGC CTCAGGGTCG AAACTCTGGG AATGGATAAT CCAGAAGAGA TCCGAAAAGT GACCCGAGCC TACAGGAGAG CAATTGATAC TTACCTTGAA ACCGGGAAAA AGGGGCAGGA AAACTGCGAA AAGCTTGGAA AAGGCTTTAC TACAGGACAT TATTTCAGAG GTGTGCAGTA A
|
Protein sequence | MKPEKPELLA PAGGLEAFIA AVENGADAVY LGARAFSARG YASNFSEKEL EEAVDYAHLR GVKVYVTVNT LLKDEEVENA LKLLSWLREI GTDAIIIQDL GLISLARKYL PDLSLHASTQ MTLHNSEGVE FAKKMGIERA VLSRECSLEE IKRIKEKTET EIEVFIHGAL CISYSGQCLL SSLIGGRSGN RGFCAQPCRK KYRLYCEGKQ IKTTGSYLLS PKDLNTTAGL GALIKTGIES FKIEGRMKRP EYVAGVVRIY RHLIDRYIEN PAEYFVSEEE QEILTQLFNR GFTQGYFFEN PRWELMNREN PHNRGVLAGT VTGYDRRLNR IRVKLSRPLR LGDGIMVENA ETRSEDKGKI VSSMYTGKGP VYSARVGDTV EIPFDSRAPS GSTVYRTHDK KLMDSLKKES ESGSLRPKIP VSITAIIESG RPVRLEVKDR DSNTVIVESG YLVEKAEKQP TSKARIEKQL SKLGNTIFEA AEFYVKMEED VFIPVGQLNE LRTKAIEQLE NLRISRWKRK PLDILQFYDS GEKGSKEKES VEKEFVEKES VGKESVGKKF VEKESIGKES VEKVGQEAKK TIQERLSTYP LLSVSVYSLE GLEGALAGGA DRIYFGEGLF RKPKKTEQEN GSAKGPDAVF EKAVLETQKA GKNIYFITPK LVKDSRMESV EKIISRVRKL GADGVLVSNL GTLGLAKTEK IPFIADSPLN IFNSRTFDFI LKEGAQMAVI SPELTLEELK SIASHGPAEC IIYGRLELME SEHCLIGGLL GNNKGQCSAP CRSGKFTLVD EKNYEFPLLM DYECRMHLLN SRSLCMLEYL PEILESGVSS LRVETLGMDN PEEIRKVTRA YRRAIDTYLE TGKKGQENCE KLGKGFTTGH YFRGVQ
|
| |