Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_1472 |
Symbol | |
ID | 5409854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 1511757 |
End bp | 1513493 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640868707 |
Product | amidohydrolase 3 |
Protein accession | YP_001404633 |
Protein GI | 154151015 |
COG category | [C] Energy production and conversion |
COG ID | [COG1229] Formylmethanofuran dehydrogenase subunit A |
TIGRFAM ID | [TIGR03121] formylmethanofuran dehydrogenase subunit A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.883375 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGGA TTTCCGGTGA AATTATAATC AGGGGAGGAT TTGTTGTCGA TCCGTCCCAG AAGATCGACG GGGATGTTGC CGATATCGCA ATAAAGGACG GCAAGATCGT TGATAAGGTC AGCAGTGCAG CAAAGGTCAT CGACGCCAAA GGCAAGGTTG TCATGGCCGG CGGGGTGGAT GTTCACTCTC ACGTGGCAGG TCCCAAGGTT AACGTAGGCC GCCTGATGCG CCCCGAGGAC AAGCTCCTCT CTGGCGTATC CCGAAGCGCA ATGGCGCAGG CAAACGGTTT CCGTATGGAG TCGGGTTTCT CCATCCCCAG CGTCTTAAAG ACCGGGTACG ACTACGCCCG CATGGGCTAC GGCTTTGTCA TGGAAGCGGC AATGCCCCCG ATTCACGCCC CTCACGTACA CGAGGAGATC CACGACACTC CCATTATTGA TGAAGCGGCA CTGCCGGTCT TTGGGAACAA CTGGTTTGTC ATGGAATACC TCAAGAAAGG CGAGATCGAG AACACGGCAG CGTATATTGC CTGGCTGATC CGTGCAACAA AGGGATTCGG CATCAAGGTT GTCAACCCCG GTGGCACGGC GGCATGGGCA TGGGGGCTCA ACTGTCTCTC GCTCAGCGAC AATGTTCCCT ACTTTGACAT TACGCCCCAC GAGATCATCA CCGGCCTCAT ACAGGCAAAC GAGTACCTTG GCCTGCCCCA TTCGGTCCAC CTTCACCAGA ACGATCTGGG TAACCCCGGG AACTACAAGG TCACCCTTGA CTCCCTGCGT CTTGCCGAGG GCGTTAAGGC AAAGAACAAG TTCGGCCGCG AACAGGTCAT TCACTCGACC CACATCCAGT TCCACTCGTA CGGCGGGGAT TCCTGGGCAA ACTTCGAGAC CAGGGCAAAG GATGTCATGG ACTACGTCAA CAGGCAAAAG AACATCACCG TAGATCTTGG CTGCGTAACG CTGGATGAAA CCACGACCAT GACCGCGGAC GGTCCGTTCG AGCACCACTT AACCGGCCTC AACCACCTCA AGTGGGCAAA CACCGATGTC GAGATGGAGA CCGCAGCAGG TGTTGTTCCC TATGTGTATG ACCCTAACAT CAAGGTCTGT GATATCCAGT GGGCGATCGG CCTTGAACTG GGACTCTATG CAAAGGATCC AATGCGCTGC TTTGTCACCA CCGACCACCC GAACGCCGGG CCATTTACCC GCTACCCCCG CATCATCAAG TGGCTCATGA GCAAAAAGGC ACGGGAGGCC ACGCTTGATT CCTTTAAGCA CAAGGACAAG GTTATCGAGG CAACCGATCT GCACTCCCTT GACCGTGAGC TCACCCTGTA CGAGATCGCT GCAATGACCC GGGCTGGCCC GGCAAAGTGC CTGGGCCTCT CCAGCATCTA CGGGGGTCTT GCCCCGGGCA TGAACGCCGA TGTTGCAGTC TTTGACCTCA ATTACAAGAG CATGCCAAGC GATCCCGAAA AGATCGAGTC CGCATTCCTG CGGGCGGCCT GCTTTGTCAA GTCAGGCGAG ATCGTAGTAA AAGACGGCGA GGTGCTCAGC CACGGCCACA AGAAGACGGT CTGGGTCAAC CCGAAGATGA AAGAAAACCC CCAGGTTAAG CGCGATATCG CAGAGAGCTT CAACAAGGGA TATTACACGG TCGGGCTCAC CAATTACCCG GTCCGCGAAT ACCTTGCACC ACACCCGTTC GTGATCGATG TCGATGTGGA GGCCTAG
|
Protein sequence | MSGISGEIII RGGFVVDPSQ KIDGDVADIA IKDGKIVDKV SSAAKVIDAK GKVVMAGGVD VHSHVAGPKV NVGRLMRPED KLLSGVSRSA MAQANGFRME SGFSIPSVLK TGYDYARMGY GFVMEAAMPP IHAPHVHEEI HDTPIIDEAA LPVFGNNWFV MEYLKKGEIE NTAAYIAWLI RATKGFGIKV VNPGGTAAWA WGLNCLSLSD NVPYFDITPH EIITGLIQAN EYLGLPHSVH LHQNDLGNPG NYKVTLDSLR LAEGVKAKNK FGREQVIHST HIQFHSYGGD SWANFETRAK DVMDYVNRQK NITVDLGCVT LDETTTMTAD GPFEHHLTGL NHLKWANTDV EMETAAGVVP YVYDPNIKVC DIQWAIGLEL GLYAKDPMRC FVTTDHPNAG PFTRYPRIIK WLMSKKAREA TLDSFKHKDK VIEATDLHSL DRELTLYEIA AMTRAGPAKC LGLSSIYGGL APGMNADVAV FDLNYKSMPS DPEKIESAFL RAACFVKSGE IVVKDGEVLS HGHKKTVWVN PKMKENPQVK RDIAESFNKG YYTVGLTNYP VREYLAPHPF VIDVDVEA
|
| |