Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2302 |
Symbol | |
ID | 7270563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 2449329 |
End bp | 2451239 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643570907 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002467310 |
Protein GI | 219852878 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0725] ABC-type molybdate transport system, periplasmic component [COG2998] ABC-type tungstate transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.319071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.250502 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAATAT CTATGTCTGA ACGAACCATC ACATGTTGCA TCGCTCTGGT TGTGCTGCTG CTGATCAGTC TCTGCATTTC AGGATGCATG ACAACCCAGC AACCGTCTGC CAATGGTACG GCAGCCAATG TGACCAGTGC ATCCAATGCC TCAACTCCGG TGAAGGAACT GCTGGTCGCG ACGACGACAA GCCTGTACGA TACCGGGCTC CTCGATTATC TTGCACCGAT CTACGAAAAA CAGTACAATA CCCACCTGAA GATCACATCA CAGGGGACTG GAAAGGCAAT AGATCTAGCC CAGCGTGGGG ATGCTGATGT GCTGCTGGTC CATTCACCTT CAGATGAGGT GACCTTCATG GAGAACGGGA ACGGTGTCAA CCGGCGCTCG TTTGCGTACA ACTACTTCGA GATCGTTGGT CCTGCCAGCG ACCCGGCACA GATCAAGGGG CTCTCGCCTG AGGACGCATT TAAGAAGCTC AAAGCGGACG GCGCAAACAA CACCGCCAAT GTCGCCTTCG TCTCCCGCGG TGACGGGTCC GGAACCCAGT CTGCAGAGAA GAAGATCTGG AAGAACGCCG GCTTCGACTA TGCCACCCAG ATCGAGAAAT CCGGGGACTG GTATGTCGAG GCAGGCAAAG GAATGGGTGA GACCCTCCAG ATGGCCGACC AGAAGCAGGC CTACACCCTG ACCGATGAGG GGACGTTCCT CGCATACAAG GGGAACCTGA CTCTCATACC GGTGATCATC AAAGGGGATA GCCTACTCAA CATCTACAGT GTGATGTCAG TGGTGCCGAA GAACAATGCG ACGGCTAGCA TTACTGCCGC CAACACCCTC GCGACGTTCC TGACCGACAA CACCACCCAG CAATTGATCG CCGACTATGG GAAGGACAAG TATGGCAAGG GGCTCTTCTC CCCGATGAAT GCAACGATGG CAAAGACGTT CAAGGTCGAC AACTCGGCCC CGCTGAATGC GACCACCCCA GCGATAGTCT TCCAAGCTGG CTCTCTGGCG ACCCCGTTCA AGGCCGTCAA GTCGATCTTT GAGAAGGACA CCACTGGGGC TGAGGTAGAA CTCTTCAGTG GCTCCTCGAT CACGATGATC GAGAAGGTGA CCAAGATGGG CGAGAAGGCT GACATCGTCG CCAGTGCTGA CGCCGACCAG ATCTCCCGTT TGATGGTCCC TGACAACGCC TCGTTCACCC TGAACTTCGC GCGGAACGCA ATGGTGCTCG TCTACACCAA CAACAGCACC AGTGCGTCCA CCATCACAAA GGACAACTGG TACTCGGTGC TGGGTCAGGA TAATGTCAAA CTCGTCACCA GTGACCCGAC CTCTGACCCA GGGGGTTACC GTGCCTACAT GACACTGGTC CTGGCCGAGT CATATTACAA GGTCCCCGAC CTCTTCAAGA AGGTCGTCAG TGATCACAGT GCGATCACAG TGAACAAGAC CGGAGCCAAC ACCACCATCG ACCTGACGAA CCCGAAGCAG GACAACAAGG GTCTGTTGAT TCCGAACGCC ACCGGCCCGA CCTACCTGGA CCTCCTTACG AGCGGTAAGG CCGACTATGC CCTGGTCTAC CGGTCGAGCG CTGTCGACGC CGGGCTCCCG TATATCGAGC TGCCAGATGA GATCTCGCTG GCAAACCTGA GCATGAGCAA AGATTACGCT TTGGTGCAGG CGAAGACCCC ATCAGGACTG ATCCCTGCCA CCCCGATCGT CTATGGACTG ACGATCCCGA CCCGGGCTGC GCATCCAGAC CTGGGTGTCG CGTTCATCAA GGCGCTCGCT TCTGACCAGG GCGCAGCAGC TCTGAACAAG AGTGGGTTGG CCCCCATGAC CCCGATGACG GCGACCGGCA CGGTGCCGGA CTCACTGAAG TCGCTGGTCA GCACCACCTG A
|
Protein sequence | MVISMSERTI TCCIALVVLL LISLCISGCM TTQQPSANGT AANVTSASNA STPVKELLVA TTTSLYDTGL LDYLAPIYEK QYNTHLKITS QGTGKAIDLA QRGDADVLLV HSPSDEVTFM ENGNGVNRRS FAYNYFEIVG PASDPAQIKG LSPEDAFKKL KADGANNTAN VAFVSRGDGS GTQSAEKKIW KNAGFDYATQ IEKSGDWYVE AGKGMGETLQ MADQKQAYTL TDEGTFLAYK GNLTLIPVII KGDSLLNIYS VMSVVPKNNA TASITAANTL ATFLTDNTTQ QLIADYGKDK YGKGLFSPMN ATMAKTFKVD NSAPLNATTP AIVFQAGSLA TPFKAVKSIF EKDTTGAEVE LFSGSSITMI EKVTKMGEKA DIVASADADQ ISRLMVPDNA SFTLNFARNA MVLVYTNNST SASTITKDNW YSVLGQDNVK LVTSDPTSDP GGYRAYMTLV LAESYYKVPD LFKKVVSDHS AITVNKTGAN TTIDLTNPKQ DNKGLLIPNA TGPTYLDLLT SGKADYALVY RSSAVDAGLP YIELPDEISL ANLSMSKDYA LVQAKTPSGL IPATPIVYGL TIPTRAAHPD LGVAFIKALA SDQGAAALNK SGLAPMTPMT ATGTVPDSLK SLVSTT
|
| |