Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0238 |
Symbol | |
ID | 8413086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 277832 |
End bp | 280603 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 645021806 |
Product | selenium-dependent molybdenum hydroxylase 1 |
Protein accession | YP_003179261 |
Protein GI | 257784044 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR03311] selenium-dependent molybdenum hydroxylase 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.171864 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.466551 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAAG AAGTAGAAGA GTATACCTTT ACGGTAAACG GAGAAACAAT TACTACTACA AAAAATAAAT CGTTGCTTCG TTTTCTTCGA GATGATTTAC ATCTTTTATC GGTAAAAGAT GGTTGTTCAC AAGGCGCTTG TGGAACTTGT ACGGTTGTTA TTGATGGTGT TGCCACGCGT GGCTGCATTA TGAACACTAA GCGTGCACAG GGAAAAGTAA TTGAAACTGT AGAAGGCCTT TCTCATGAGG AGCAAGAGGC TTTTGTGTAC GCTTTTGGTG CAGTTGGAGC TGTTCAGTGT GGTTTTTGTA TCCCTGGCAT GGTAATGAGT GGCGCAGCTT TAATCCGTCG AAACCCTAAT CCAACTGAAG CAGAAGTTAA AGAAGCAATT AAAAATAATA TTTGTCGATG CACTGGATAT AAGAAAATTA TTGAAGGTAT CTTAAAAGCC GCACGTATTC TTCGTGGAGA AGAGCAAATT GATCCAGACC TAGAGCGTGG CGATAATTAC GGTGTTGGTT CAAAAGCATT CCGTGTTGAT GTTCGTGCAA AGGTACTTGG ATATGGTAAA TATCCTGACG ATGTTACCAA TGTTGATTTC CCAGATATGG CGTATGGCTC GTGTGTACGT TCCAAGTATC CTCGTGCACG TGTGGTAAAA ATTGATACAA CAGAGGCAGA GGCTCTACCA GGTGTTGTTG GAGTCTTAAA AGCTGAAGAC GTACCTGTAA ATAAAGTTGG TCACATTCAG CAGGATTGGG ATGTATTTAT CCCTGAGGGT TCAAATACAC GTTATTTGGG CGATGCACTT TGCATGGTAG TCGCAGAGGA TGAAGAGACT CTTGAGGCTG CAAAAAAACT GGTAAAAGTT GAGTATGAAG AGCTTCCTAT TGTGCGTAAT ATTGAAGAGG CTGCTGCTGA AGGTGCTCCT CTTGTACATA CAGAAGCAGA AAGTAATCTT TGCCAGAGCC GTCATATTAC ACGCGGTAAT GTTCAGGATG CACTTAAAAA TTCTGCTCAC ATTATCACTA AGCATTTCTC GACACCTTTT ACTGAGCATG CATTCCTTGA GCCTGAGTGT GCTGTTGCAT TCCCCTATAA AGACGGTGTA AAGATTCTTT CTACCGATCA AGGCGCCTAC GACACTCGTA AAGAAGTTGC CCACATGTTT GGATGGGATG AAACTCCTGA TAAGGTAGTT GTAGAAACCA TGCTGGTAGG CGGCGGATTT GGTGGAAAAG AGGATGTAAC TGTCCAGCAT ATTTCTGCAT TGGCTGCTTA CATTTTTAAG CGTACGGTTA AGTGTAAATT TACTCGTAAC GAGTCGCTTA TCTTCCATCC AAAACGTCAT GCTATGGAGG CAGATTTTAC ACTTGGCTGC GATGCTGAAG GTCACCTTAC TGCTCTTGAT TGCGATATTT ATTTTGATAC AGGAGCATAT GCGTCACTTT GCGGACCAGT TCTTGAACGT GCTTGTACAC ACGCTGTTGG TCCGTATAAG TATCAAAATA CTGATATCCG TGGTTATGGA TACTATACTG ATAATCCACC TGCTGGTGCA TTCCGTGGCT TTGGAGTTTG CCAAACTGAG TTTGCTCTTG AGGAGCTCAT GGATCTTCTC GCCGAGAAGG TCGGTATAAG TCCTTGGGAG ATGCGCTGGA GAAATGCAGT TGCTCCTGGA GATGTACTTC CTAATGGTCA GATCTGTGAT CAGTCAACGG CACTTAAAGA AACACTTCTT GCAGTTAAAG ATGTATATGA AGCCCATAAG GGACGTGCTG GTCTTGCCTG CGCTATGAAA AACTCTGGTG TTGGCGTTGG TCTGCCTGAC GCAGGTCGTG CAAACATTCG TATTGAAGAC GGCAAGGTTG TTGTCTACTC TGCTACTTCA GATATTGGTC AGGGTTGCAA TACAGTCTTT TTGCAGGATG TTGCTGAAGC AATTGGCCTT CCAAAGTCAG TTATCGTTAA CGGTGAGTGC TCTACAGAAA ATGCACCTGA TTCAGGTACT ACTTCTGGCT CTCGTCAAAC GGTTGTTACC GGAGAGGCTG TTCGTGGCGT GGCGTTTTTG CTGCGTGATG CGCTTTTGGA TATTGAAGCC GGCAAGGAAG TATCGTCTGA GCCTGTTGAG GCACACGGTG ATGGAAAGAC TATTGTGTAC TCTGATGGTC GTCCTTACGA GGGTCTTGGA TACGCAGATG GTAAAGCATT GGTTGCAGGT GCAGGTATTC ATCCAAAGGA TCCAGTTGCA GGCTTGAAGA AGCTTGAAGG TCATGAGTTC CGTTATGTTT ACTTTGAGCC AACTGACAAG CTTGGCGCCG ATAAGCCTAA TCCAAAGAGC CACATTTGTT ATGCCTTTGC CACAACGTGC GTGGTTTTAG ACGATGAGGG CAAGGTTACT GATGTATATG CCGCGCATGA TTCCGGCAAG GTCATTAACC CTATTGCAAT CCAAGGTCAA ATTGAGGGAG GCGTGTTGAT GTCGCTTGGC TATGCCACAA CAGAAAACTA CAAGCTTCAG GATTGTGTGC CAAAGTCAAA GTTTGCTACT CTAGGCCTCT TCCATGCTCC TGATATTCCT CATATTGAGG CAATTTATGT CGAGAAGGAG CATTTACTCC CTGTTGCTTA CGGAGGAAAA GGCATTGGTG AGATTTCAAC AATCCCAACT GCTCCTGCAG TAGCAAATGC ATACTATGCC TATGATCATG TGATGCGTAC TAAGCTTCCT ATGGAAGATA CGTATTACAC TAAATCTAAG GCCAAGAAGT AA
|
Protein sequence | MAEEVEEYTF TVNGETITTT KNKSLLRFLR DDLHLLSVKD GCSQGACGTC TVVIDGVATR GCIMNTKRAQ GKVIETVEGL SHEEQEAFVY AFGAVGAVQC GFCIPGMVMS GAALIRRNPN PTEAEVKEAI KNNICRCTGY KKIIEGILKA ARILRGEEQI DPDLERGDNY GVGSKAFRVD VRAKVLGYGK YPDDVTNVDF PDMAYGSCVR SKYPRARVVK IDTTEAEALP GVVGVLKAED VPVNKVGHIQ QDWDVFIPEG SNTRYLGDAL CMVVAEDEET LEAAKKLVKV EYEELPIVRN IEEAAAEGAP LVHTEAESNL CQSRHITRGN VQDALKNSAH IITKHFSTPF TEHAFLEPEC AVAFPYKDGV KILSTDQGAY DTRKEVAHMF GWDETPDKVV VETMLVGGGF GGKEDVTVQH ISALAAYIFK RTVKCKFTRN ESLIFHPKRH AMEADFTLGC DAEGHLTALD CDIYFDTGAY ASLCGPVLER ACTHAVGPYK YQNTDIRGYG YYTDNPPAGA FRGFGVCQTE FALEELMDLL AEKVGISPWE MRWRNAVAPG DVLPNGQICD QSTALKETLL AVKDVYEAHK GRAGLACAMK NSGVGVGLPD AGRANIRIED GKVVVYSATS DIGQGCNTVF LQDVAEAIGL PKSVIVNGEC STENAPDSGT TSGSRQTVVT GEAVRGVAFL LRDALLDIEA GKEVSSEPVE AHGDGKTIVY SDGRPYEGLG YADGKALVAG AGIHPKDPVA GLKKLEGHEF RYVYFEPTDK LGADKPNPKS HICYAFATTC VVLDDEGKVT DVYAAHDSGK VINPIAIQGQ IEGGVLMSLG YATTENYKLQ DCVPKSKFAT LGLFHAPDIP HIEAIYVEKE HLLPVAYGGK GIGEISTIPT APAVANAYYA YDHVMRTKLP MEDTYYTKSK AKK
|
| |