Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_1489 |
Symbol | |
ID | 5410409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | - |
Start bp | 1540813 |
End bp | 1543038 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640868724 |
Product | Legumain |
Protein accession | YP_001404650 |
Protein GI | 154151032 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.26033 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATACGAA TCGAACAAAA AAAACGCTGG CCAAAGTTCC TGCTTATTGC TGTTGTCATC CTCATTATCG CAGCCGGAGC TGTGGCCGGG TATACGATCC TGCATGCACG TACTCCCGTC CGGATCGGTG TCCTCCTCCC CATCACCGGT GGGGTAGATA TCAAAGAACC GCTCGAATGG GCAAAAGATA CCATTAACCA GCAGGGTGGC ATCGGAGGGA GGCAGGTCGA GCTTGTATAC ATGGACACCG GCACGGGTAA CACCACGCAG ATGGCCGAAG AACTCCTTAA TGACGATTCT GTACAGATTG TCATCGGGCC CCGGAACAGC GACGATGTCT TCACGCTTGC CCCGGAATTT ATCAAAAAGA AAAAACTCCT TATCAGTCCG ATGGCCACCG CCGGCAGCAT CACTCTTGCA TTTGGAAATC AGGGATACTT CTGGCGGACT ACGCAGAGCG ATGCGGCACA GGTACAAGTG ATCGTGAACA TCCTGAATAA ACAAGGGGCT CACCGCGTTG CCCTTCTTGC TGAAAACTCC GCGTACGGGG AGACTTTCTA TAACTGGATG GGGTTCTTTA CCATAGAGAA TGGCCTCGAT CTTGTCTCGA TACAGCAGTT TGACCAGAAC AGCAGCTCGC TTGACGCGGA TGTTGCAGAC GCCCTGAAGT CAGATCCCGA TTATATTATT GCAGCCTGCG ACAATCCGTC CGACGCTGCA ACGATAAAAC GGGCGATCGA CAAGTCAGGT AAACCGGCAA AGCTCTTTCT TACCGACGGG GCCGTCTCCC CGGCGCTCAT CAGTTCGCTT GGGTCTGAGG CTGAAGGGAT TGAAGGGACC GCTCCCACCG CGGATCCATC CACCAATTTC CCGGCTGCGT ATGAAGAGAA ATTCGGCCAC GCTCCCACGG GTTTTGCAGC GCCAGCGTAC GATGCCCTCC TGCTTGCCGC GTATACCTCG GCCCGGCAGG AGGCAAACCC CACTGAGTCC CTTGCCGCTT CGATACACGA CGTGGTGTCC GGCAACGGTA CTTCACCGGG CTGGGATGCA CAGGCCATCC ATGAGAATCT TCTTCTGATC GAAAGCGGGC AGACACCCGG GATCAGCGGA GCAAGCGGAC CGCTTATCTT TGACAGGGAT GTTGGCGTGG ATCCGCTGGT GACCTATTAT TCCCACTGGG TCATCAGGGA CGGGGATTTC CAGACCGTTG GCGTCTTTTC GGCAAACGCG AGCGCAAACG GTGTGTCGAT TGCCCGGAGC CGGCCAACAA TACCGCCCCC GAACCCTCCC TCCAACAGCG GGATCTCATC AGTACCCCCG GCGGCAGGGG AATATCTTCC TCTCCTCTCA AACGCGAGCA CAAACGGTGT GTCAATTTCC CAAATCTCAC CTTCGGACAA CGGGATGTCG CCAGTATCTC AGGTTACAGG GAAAAATATC CCTTTCCTCA CAAAGACCGA TTTTGAGGCA GTCATTATTG CGCCCACAAA CGGGTGGATT AATTACCGGC ACCAGGCAGA CGGGCTGACC CTGTACACCC TGCTCCGTGA TAACGGGGTC CCTGATGACC ACATCATCCT CATGTTGTAT GATGATATCC CCGCCCTCCC GGAAAATCCT ATCCCGGGAA ATGTCCACCA TGTTCCCGAG GGGTCCAATA TCCGGCTCGG TGCCAATGTG GCGTATACCG GTTCGCAGGT GACTGCTGCC ACGCTCAATA ATGTCCTGAC CGGTACAAAA ACCGATTTAA CCCCGGTGGT ACTGGACAGC AACGCGAGTA CCGATGTGTT TATCTATATT GTCGGCCACG GCGATCCGGG GACTATCGAC TTCTGGAACG GCAATCTCTT TACTACGGAT AATATTACCC GTATAACGGA TACGATGAGC CGGGAACAGA AATACCGGCA GCTCGTTTTC ATGGATGATA CCTGTTTTGG CGAGAGTATC GCCGCGAACC TAACGGCGCC GGGTATCATC TACCTTACGG GAGCTTCCAG TACCGAGCCC TCGTTTGCCG CGACCTACGA CATTGATATT AAGCAATGGA TCTCGGATGA ATTTACCTTA GAAGCGGTGG ACCTTATCCA GGAAAACCCG GACATTACCT TCCAGGAACT CTATACGGAA GCATACACGA ACGTGACCGG CTCGCATGTC CAGCTGATAA CGACAGGAAA TGTGAGCACG CTTCATGAAC CGGTCCTGGA ATTCTTAAAA CCATAA
|
Protein sequence | MIRIEQKKRW PKFLLIAVVI LIIAAGAVAG YTILHARTPV RIGVLLPITG GVDIKEPLEW AKDTINQQGG IGGRQVELVY MDTGTGNTTQ MAEELLNDDS VQIVIGPRNS DDVFTLAPEF IKKKKLLISP MATAGSITLA FGNQGYFWRT TQSDAAQVQV IVNILNKQGA HRVALLAENS AYGETFYNWM GFFTIENGLD LVSIQQFDQN SSSLDADVAD ALKSDPDYII AACDNPSDAA TIKRAIDKSG KPAKLFLTDG AVSPALISSL GSEAEGIEGT APTADPSTNF PAAYEEKFGH APTGFAAPAY DALLLAAYTS ARQEANPTES LAASIHDVVS GNGTSPGWDA QAIHENLLLI ESGQTPGISG ASGPLIFDRD VGVDPLVTYY SHWVIRDGDF QTVGVFSANA SANGVSIARS RPTIPPPNPP SNSGISSVPP AAGEYLPLLS NASTNGVSIS QISPSDNGMS PVSQVTGKNI PFLTKTDFEA VIIAPTNGWI NYRHQADGLT LYTLLRDNGV PDDHIILMLY DDIPALPENP IPGNVHHVPE GSNIRLGANV AYTGSQVTAA TLNNVLTGTK TDLTPVVLDS NASTDVFIYI VGHGDPGTID FWNGNLFTTD NITRITDTMS REQKYRQLVF MDDTCFGESI AANLTAPGII YLTGASSTEP SFAATYDIDI KQWISDEFTL EAVDLIQENP DITFQELYTE AYTNVTGSHV QLITTGNVST LHEPVLEFLK P
|
| |