Gene Apar_0238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0238 
Symbol 
ID8413086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp277832 
End bp280603 
Gene Length2772 bp 
Protein Length923 aa 
Translation table11 
GC content44% 
IMG OID645021806 
Productselenium-dependent molybdenum hydroxylase 1 
Protein accessionYP_003179261 
Protein GI257784044 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR03311] selenium-dependent molybdenum hydroxylase 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.171864 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.466551 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAG AAGTAGAAGA GTATACCTTT ACGGTAAACG GAGAAACAAT TACTACTACA 
AAAAATAAAT CGTTGCTTCG TTTTCTTCGA GATGATTTAC ATCTTTTATC GGTAAAAGAT
GGTTGTTCAC AAGGCGCTTG TGGAACTTGT ACGGTTGTTA TTGATGGTGT TGCCACGCGT
GGCTGCATTA TGAACACTAA GCGTGCACAG GGAAAAGTAA TTGAAACTGT AGAAGGCCTT
TCTCATGAGG AGCAAGAGGC TTTTGTGTAC GCTTTTGGTG CAGTTGGAGC TGTTCAGTGT
GGTTTTTGTA TCCCTGGCAT GGTAATGAGT GGCGCAGCTT TAATCCGTCG AAACCCTAAT
CCAACTGAAG CAGAAGTTAA AGAAGCAATT AAAAATAATA TTTGTCGATG CACTGGATAT
AAGAAAATTA TTGAAGGTAT CTTAAAAGCC GCACGTATTC TTCGTGGAGA AGAGCAAATT
GATCCAGACC TAGAGCGTGG CGATAATTAC GGTGTTGGTT CAAAAGCATT CCGTGTTGAT
GTTCGTGCAA AGGTACTTGG ATATGGTAAA TATCCTGACG ATGTTACCAA TGTTGATTTC
CCAGATATGG CGTATGGCTC GTGTGTACGT TCCAAGTATC CTCGTGCACG TGTGGTAAAA
ATTGATACAA CAGAGGCAGA GGCTCTACCA GGTGTTGTTG GAGTCTTAAA AGCTGAAGAC
GTACCTGTAA ATAAAGTTGG TCACATTCAG CAGGATTGGG ATGTATTTAT CCCTGAGGGT
TCAAATACAC GTTATTTGGG CGATGCACTT TGCATGGTAG TCGCAGAGGA TGAAGAGACT
CTTGAGGCTG CAAAAAAACT GGTAAAAGTT GAGTATGAAG AGCTTCCTAT TGTGCGTAAT
ATTGAAGAGG CTGCTGCTGA AGGTGCTCCT CTTGTACATA CAGAAGCAGA AAGTAATCTT
TGCCAGAGCC GTCATATTAC ACGCGGTAAT GTTCAGGATG CACTTAAAAA TTCTGCTCAC
ATTATCACTA AGCATTTCTC GACACCTTTT ACTGAGCATG CATTCCTTGA GCCTGAGTGT
GCTGTTGCAT TCCCCTATAA AGACGGTGTA AAGATTCTTT CTACCGATCA AGGCGCCTAC
GACACTCGTA AAGAAGTTGC CCACATGTTT GGATGGGATG AAACTCCTGA TAAGGTAGTT
GTAGAAACCA TGCTGGTAGG CGGCGGATTT GGTGGAAAAG AGGATGTAAC TGTCCAGCAT
ATTTCTGCAT TGGCTGCTTA CATTTTTAAG CGTACGGTTA AGTGTAAATT TACTCGTAAC
GAGTCGCTTA TCTTCCATCC AAAACGTCAT GCTATGGAGG CAGATTTTAC ACTTGGCTGC
GATGCTGAAG GTCACCTTAC TGCTCTTGAT TGCGATATTT ATTTTGATAC AGGAGCATAT
GCGTCACTTT GCGGACCAGT TCTTGAACGT GCTTGTACAC ACGCTGTTGG TCCGTATAAG
TATCAAAATA CTGATATCCG TGGTTATGGA TACTATACTG ATAATCCACC TGCTGGTGCA
TTCCGTGGCT TTGGAGTTTG CCAAACTGAG TTTGCTCTTG AGGAGCTCAT GGATCTTCTC
GCCGAGAAGG TCGGTATAAG TCCTTGGGAG ATGCGCTGGA GAAATGCAGT TGCTCCTGGA
GATGTACTTC CTAATGGTCA GATCTGTGAT CAGTCAACGG CACTTAAAGA AACACTTCTT
GCAGTTAAAG ATGTATATGA AGCCCATAAG GGACGTGCTG GTCTTGCCTG CGCTATGAAA
AACTCTGGTG TTGGCGTTGG TCTGCCTGAC GCAGGTCGTG CAAACATTCG TATTGAAGAC
GGCAAGGTTG TTGTCTACTC TGCTACTTCA GATATTGGTC AGGGTTGCAA TACAGTCTTT
TTGCAGGATG TTGCTGAAGC AATTGGCCTT CCAAAGTCAG TTATCGTTAA CGGTGAGTGC
TCTACAGAAA ATGCACCTGA TTCAGGTACT ACTTCTGGCT CTCGTCAAAC GGTTGTTACC
GGAGAGGCTG TTCGTGGCGT GGCGTTTTTG CTGCGTGATG CGCTTTTGGA TATTGAAGCC
GGCAAGGAAG TATCGTCTGA GCCTGTTGAG GCACACGGTG ATGGAAAGAC TATTGTGTAC
TCTGATGGTC GTCCTTACGA GGGTCTTGGA TACGCAGATG GTAAAGCATT GGTTGCAGGT
GCAGGTATTC ATCCAAAGGA TCCAGTTGCA GGCTTGAAGA AGCTTGAAGG TCATGAGTTC
CGTTATGTTT ACTTTGAGCC AACTGACAAG CTTGGCGCCG ATAAGCCTAA TCCAAAGAGC
CACATTTGTT ATGCCTTTGC CACAACGTGC GTGGTTTTAG ACGATGAGGG CAAGGTTACT
GATGTATATG CCGCGCATGA TTCCGGCAAG GTCATTAACC CTATTGCAAT CCAAGGTCAA
ATTGAGGGAG GCGTGTTGAT GTCGCTTGGC TATGCCACAA CAGAAAACTA CAAGCTTCAG
GATTGTGTGC CAAAGTCAAA GTTTGCTACT CTAGGCCTCT TCCATGCTCC TGATATTCCT
CATATTGAGG CAATTTATGT CGAGAAGGAG CATTTACTCC CTGTTGCTTA CGGAGGAAAA
GGCATTGGTG AGATTTCAAC AATCCCAACT GCTCCTGCAG TAGCAAATGC ATACTATGCC
TATGATCATG TGATGCGTAC TAAGCTTCCT ATGGAAGATA CGTATTACAC TAAATCTAAG
GCCAAGAAGT AA
 
Protein sequence
MAEEVEEYTF TVNGETITTT KNKSLLRFLR DDLHLLSVKD GCSQGACGTC TVVIDGVATR 
GCIMNTKRAQ GKVIETVEGL SHEEQEAFVY AFGAVGAVQC GFCIPGMVMS GAALIRRNPN
PTEAEVKEAI KNNICRCTGY KKIIEGILKA ARILRGEEQI DPDLERGDNY GVGSKAFRVD
VRAKVLGYGK YPDDVTNVDF PDMAYGSCVR SKYPRARVVK IDTTEAEALP GVVGVLKAED
VPVNKVGHIQ QDWDVFIPEG SNTRYLGDAL CMVVAEDEET LEAAKKLVKV EYEELPIVRN
IEEAAAEGAP LVHTEAESNL CQSRHITRGN VQDALKNSAH IITKHFSTPF TEHAFLEPEC
AVAFPYKDGV KILSTDQGAY DTRKEVAHMF GWDETPDKVV VETMLVGGGF GGKEDVTVQH
ISALAAYIFK RTVKCKFTRN ESLIFHPKRH AMEADFTLGC DAEGHLTALD CDIYFDTGAY
ASLCGPVLER ACTHAVGPYK YQNTDIRGYG YYTDNPPAGA FRGFGVCQTE FALEELMDLL
AEKVGISPWE MRWRNAVAPG DVLPNGQICD QSTALKETLL AVKDVYEAHK GRAGLACAMK
NSGVGVGLPD AGRANIRIED GKVVVYSATS DIGQGCNTVF LQDVAEAIGL PKSVIVNGEC
STENAPDSGT TSGSRQTVVT GEAVRGVAFL LRDALLDIEA GKEVSSEPVE AHGDGKTIVY
SDGRPYEGLG YADGKALVAG AGIHPKDPVA GLKKLEGHEF RYVYFEPTDK LGADKPNPKS
HICYAFATTC VVLDDEGKVT DVYAAHDSGK VINPIAIQGQ IEGGVLMSLG YATTENYKLQ
DCVPKSKFAT LGLFHAPDIP HIEAIYVEKE HLLPVAYGGK GIGEISTIPT APAVANAYYA
YDHVMRTKLP MEDTYYTKSK AKK