Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1214 |
Symbol | |
ID | 6273754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1451048 |
End bp | 1453939 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642613265 |
Product | molybdopterin oxidoreductase, iron-sulfur binding subunit |
Protein accession | YP_001877820 |
Protein GI | 187735708 |
COG category | [C] Energy production and conversion |
COG ID | [COG0437] Fe-S-cluster-containing hydrogenase components 1 |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.639454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGGC GTTCCTTCAT GAAATGGATG GGTGCAGGGG CAGCCCTGGC AGGGATAGGG CTGCCGGCCT GCCGCCGTGT GGAAAAATAC CTGGTTCCCT ATAACGAAGG GCCTGAATGG TCTGTTCCCG GCGTGGAAAC GGCTTACGCT ACCTGCCTGA CCATGGGAGG AAACGCGCAA CCTGTACTGG CCGCCTGTTA TGAAGGACGC CCCGTCAAAT TGATTCCCTC CCTGCAATAC CCGGAAGGTC CGGGGTTGCC GGCTGCGGCT CAGGCCTCCA TTCTGGACCT TTACGACCCC GGCCGCAGCA AGCATATCCT GTTCAACGGC AATCCGGCCT CCGAAGACGA ATTCCGGGGA GCCTTTTCTT CCTGGTCCCG CAATCTTCGG GATGGTGGCG GCATCGGCCT TCTCCTTCCC CCTTCGGCTT CCCCTCTGCT CCATTCCATG CTGGAGGAAA TCACCCGGAA AAATTCACTC GTGCGCATCT ACCGGTATTC CCCCGTGCCT GAACCGGGTT CCGGCATGCA GGCGGGGCTG CCGGAAGAAG TACGGTTCCG CGTGCGTTTT GCGCGCGCCA GGCGCATCCT CTCCCTGGAT TGCGATTTTC TTCATCAAAA CCCCTATGGA AACACGCGCG ACTTCATTGC GTCCCGCTCC CCGGAAGGGC TTTATTACAA AGAAGAAAAC AGGAACCGAA CACGCCTTTA CGCCGTGGAG GGACGCGTTT CCCTGACCGG GGCCCATGCC GACCACAGGC TACCCGTTCC TCCCGCACGC CTGGCGTTTT TCCTGGAGGA ATTGTTCCGC TATCTTTCCA GCAAAAAAAA CTCCGGACAA ACGCTTCCCC CTCCCCGGCG AACGGACCAT CCCCTGACGG AAAGGGAACT CAACTGGCTT CGCCACTGTG CGGATGACTT ATTCAGCCAC TCCGGAGAAA GCGTGATCCT CCTGGGGGAC AATCATCCGG AATTATCCGG CATCGTCTGG AAGCTCAACT GTCTTCTCGG CTCCATGGGG ACATGCATCC AGTTGCTAAA GGCACCGCGC CCAACTCCCT ATGGCACGCT GGACGATCTT GTTCGGGACA TCCGGGGGAA AAAGGTGGAA ATCGTTTTTT TACTGGATGC GGGAAATCCC GTGCTGGATT CCGGGCACAG TTCCGGACTG TCGGAAGCCC TGAACAGGGT GGAAAGCATC CACCTTGGAA TGTATGAAGA TGAAACCTCC CGCGTCTGCC GCTGGCACCT GCCGGCAGCC CACTTTCTGG AATCATGGGG GGTGGAGCGG GATTACCGGG GCAGATTTTG CTATCGCCAG CCCGTCATTC TTCCCCTGTA CGGGGGCATT TCTCCGGAAG AAGTGCTGTC CGGCCTGCTT TCCTCCAAAG GCCATCTCTC CACGGCGGAC AATTCCCCCA CCCACCTCTC TCCCGTTTAC CACAGAGCGC GCAAATGCTT TGAACGCGCC GTGAATCCGG AAAATAAAAC TGCGGCCTGG GCTCAGGCGC TTCAGCGCGG CTATTCGGAA GAAACGGCCT ATGCTCCCCT GTCCCCGCAG GAAGAAACGG CTCTTGGAAC AGCCATGATG CAAACGCCCG CAGCGCCTTT TGGCGGCCAC GGGAAAAAAT TGGGAACCGG GATGCTGGAA TTGCAATTCC GCGCGGACTA TTCCATTGGG GACGGACGCT GGAAACGCAA TGCATGGATG CAGGAATGCC CGGACCCAAT AACAGGCGTC AGCTGGGCGG CGTCCGCACA GGTTTCCCCC GCCACCTTTC TGCGGCTGGG AGGTTCGGAT TCCGGGCCCA TGCACTGCAC GCTTACCGCC CCCGCCACGC AAATGGAAGT CATCCTGTGC CCCATTCCGG GAGTGGCGGA TAATCTCATT ATTCTTCCTC TGGGTTATGG CGGGATCAAC CACGTAGCGG AAAGGCAGGA AAATTCCGGC GGATATGAAC TCCGCAGACA GGTGGAAAAA ACGGAAGCAT ACGGCATTGC CCCGGAACAG ATTGCCTTGG CTGCCCTCCG GGAGCGGACG GAAGCCATTC AAAGCCCCGT TGTGCAGCCT GCCCCGTTCT CCCTGCATCC CGGTCCGTCC CGCGTCCCGC CTCCCCCTCC CGGCGCGGAT GCCGTTTACC AGTGGAAAAT GGCCATTGAC ACGTCCCGTT GCATCGGCTG TAACGCCTGC ATGATTGCTT GCCGCGCGGA AAATAACATT CCCGTAGTGG GGCGTGACCA GATGGCCAGG GGCCGCGCCC TGGACTGGAT ACGCATCGAC AGGTACTTCA CGGAGGAAGG GACGCTCACC TCCATCCCCG TGGCCTGCCA GCAGTGCGGC AAGGCCCCAT GCGAATCCGT ATGCCCCGTC AATGCCACCG TGCATACTGC GGAAGGCCTG AACGCCATGG TGTACGCCCG GTGCTGGGGT ACCCGGTACT GCGCCACCAA CTGCCCGTAC AAGGCTCGGC GCTTCAATTT TTTTGATTAC GCCAAAGCAT CGGAACAGGC CACGCGCCTT CAGCGCAATC CCAATGTAAC CGTTCGCTCC CGCGGCGTCA TGGAAAAATG CACCTACTGC GTCCAAATGG TGGAACGTGC CAAAATCCGG CACAAATCCC GGTTGATGAA GGAGCATCCG GGCCAGCCAT CCACCTCCAT CCATGTGACG GCCCAGGACA TGCTGCTTCC GGACGGGGCG GCTCAAACGG CCTGCCAGCT CGCATGCCCG ATGGGAGCCA TCACCTTCGG CAATGTGCTG GACCCCGCCG CAGCCGTTTT CCGCGCCAAG TCCCTGCCGC GCCACCAGGA TCTCCTCTCC TGCCTGGGCA CGTCTCCCGG AACGGGCTAC CTGGTTCCGG CAAGAAACCC CAATCCGGCC ATGGAGAAAT AA
|
Protein sequence | MTRRSFMKWM GAGAALAGIG LPACRRVEKY LVPYNEGPEW SVPGVETAYA TCLTMGGNAQ PVLAACYEGR PVKLIPSLQY PEGPGLPAAA QASILDLYDP GRSKHILFNG NPASEDEFRG AFSSWSRNLR DGGGIGLLLP PSASPLLHSM LEEITRKNSL VRIYRYSPVP EPGSGMQAGL PEEVRFRVRF ARARRILSLD CDFLHQNPYG NTRDFIASRS PEGLYYKEEN RNRTRLYAVE GRVSLTGAHA DHRLPVPPAR LAFFLEELFR YLSSKKNSGQ TLPPPRRTDH PLTERELNWL RHCADDLFSH SGESVILLGD NHPELSGIVW KLNCLLGSMG TCIQLLKAPR PTPYGTLDDL VRDIRGKKVE IVFLLDAGNP VLDSGHSSGL SEALNRVESI HLGMYEDETS RVCRWHLPAA HFLESWGVER DYRGRFCYRQ PVILPLYGGI SPEEVLSGLL SSKGHLSTAD NSPTHLSPVY HRARKCFERA VNPENKTAAW AQALQRGYSE ETAYAPLSPQ EETALGTAMM QTPAAPFGGH GKKLGTGMLE LQFRADYSIG DGRWKRNAWM QECPDPITGV SWAASAQVSP ATFLRLGGSD SGPMHCTLTA PATQMEVILC PIPGVADNLI ILPLGYGGIN HVAERQENSG GYELRRQVEK TEAYGIAPEQ IALAALRERT EAIQSPVVQP APFSLHPGPS RVPPPPPGAD AVYQWKMAID TSRCIGCNAC MIACRAENNI PVVGRDQMAR GRALDWIRID RYFTEEGTLT SIPVACQQCG KAPCESVCPV NATVHTAEGL NAMVYARCWG TRYCATNCPY KARRFNFFDY AKASEQATRL QRNPNVTVRS RGVMEKCTYC VQMVERAKIR HKSRLMKEHP GQPSTSIHVT AQDMLLPDGA AQTACQLACP MGAITFGNVL DPAAAVFRAK SLPRHQDLLS CLGTSPGTGY LVPARNPNPA MEK
|
| |