Gene Apar_0237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0237 
Symbol 
ID8413085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp274659 
End bp277652 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content45% 
IMG OID645021805 
Productputative selenate reductase subunit YgfK 
Protein accessionYP_003179260 
Protein GI257784043 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR03315] putative selenate reductase, YgfK subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA TTATGCGTCC CATGAAATTC GACCATCTGA TGAATTGGAT TTTAGATGAG 
TACGAAAATC AGAAGACTAT CTTTGGAATT CATGCATTTG CCAAGACAGC TGGAGCTGCT
CGTCCTATTT TTAACGAGAA AATTGAGACC CCATTTGGTC CTGCAGCGGG CCCCAATACA
CAGCTTGCAC AGAATATCGT AGCATCATAT GTTACTGGTG CTCGTTTCTT TGAGCTTAAA
ACAGTTCAGA AAATGGATGG AGAAGAGCTT TCAGCTTGCG TTAATAAGCC TTGTATTTTG
GCATCCGATG AGGGATACAA CTGCGAGTGG TCTACGGAGT TAACCGTACC TCAGGCATTT
GATGAGTATG TCAAGGCATG GGTTATCTGC CATATTCTTT CTCGAGAGCT TGGCCTGGGA
GATGCTGACG GTTTTGTCTT TAATATGTCC GTTGGATATG ACCTTGAAGG AATTAAAACT
CCTAAGGTAG ATAAATATAT AAATGACATG AAGGATGCCT CTGAGACTCC TGTATTTAAG
GAGGCAATTG CATGGGCAAA AGCTAACATT AATCGCTTCC ACAATGTTGA CGAGGCATTT
ATTGATTCCA TTCCTTCTCG TATTTCAGAT TCTATTACAG AGTCAACTCT GCATGGTTGT
CCTCCAGATG AGATTGAGCG AATTGCTTCC TATCTCATTA CCGAGAAGCA CCTCAATACC
TTTATTAAGT GCAATCCAAC ACTTTTGGGA TATGAGTATG CACGTAAAAC GCTGGATTCT
TTAGGATTTG ACTACATTGC CTTTGATGAT CATCACTTCC TAGAAGATTT GCAGTGGGCT
GATGCAGTTC CTATGCTTGA GCGTCTTATG AAACTGTGTG CAGAAAAGGG TGTTGAGTTT
GGTGTCAAAC TTACCAATAC CTTCCCAGTA GATGTTACTC GTGGCGAGTT GCCAAGCGAA
GAAATGTATA TGTCTGGTCG TTCTCTTTGG ACCTTGTCAC TTTCACTTGC TAAGCGTCTT
TCTGAGCAGT TTGATGGTAA GCTTCGTATT TCGTACTCTG GCGGCGCAGA CTATAACAAC
ATTAAAGATC TTGTTGATAC AGGTATTTGG CCTGTTACTA TGGCAACTAC CATTTTGAAG
CCTGGCGGCT ATGAGCGCAT GACGCAGATT GCTGGACTCT TTGCAGATGA AAATACGGAA
GCATTTTCTG GCGTTGATGT GGCAAAGGTA TCTGCCTTGC TTGAAAGTTC GCTCAAAAAT
GCACGTTATC ACAAAGAGAT TAAGCCACAA CCTGATCGTC ACGTACGCGG GGCGCTTCCG
CTGACTGACT GCTTTATTGC GCCATGTCGT GATTCTTGTC CAATTCATCA GGACATTCCA
GGATATCTTA AGGCAGTAGA TGAAGGCCGT TATGCTGACG CCTTACACAT CATTCTTGAG
CGCAATGCCC TTCCTTTTAT TACCGGTACT ATTTGTCCTC ATACCTGCAC AAATTCCTGC
ATGCGTAATT ACGTTGATGA GCATGTTCAT ATTCGTAGCT GCAAACTTAC CGCCGCAGAG
AACGGCCTTA TGGAAGTCCT TCCAACACTT GCCTCTCGTG GTGTGGTTAA GGATAAGAAG
GTTGCTATTA TTGGCGGTGG TCCCGCTGGT CTTTCGGCGG CATCTTTCTT ATCACGCGCA
GGTATTGAAG TAGTTGTCTT TGAGCGTACT AATAAACTGG GCGGTATTGT TCGTCACGTT
ATTCCTGGTT TCCGTATTTC TGAGGAAGCT ATCGATAACG ATGTCAAGCT GTGCCAGGCA
TATGGTGCAA CCTTTAAGAC AGGCGTTGAG GTTACAGATG TAAATACTCT GCTTTCTGAG
GGCTATACCG ACGTTGTAGT AGCAATTGGT GCATGGGCTC CTGGTCGTAA GACCTTGAAG
TCCGGCGAGG CTCTGGATGC TCTTGAGTTC CTCGAGGAGT TTAAGCGTGC ACCTGAGTCC
GTTAACCTTG GCACTGACGT AGTTGTTATC GGTGCCGGTA ATACTGCTAT GGATGTGGCT
CGTGCAGCAA AGCGTGTTGC TGGTGTACAA AATGTACGTC TGGTATATCG TCGTACCAAG
CGCTATATGC CTGCTGATGA GGAAGAGCTC CAGATGGCCA TTGATGATGG CGTGGAGTTC
ATGGAGCTTC TCGCCCCAGG AGATTTGTCC AATAATCATC TTACCTGTGA GGTAATGAAA
CTTGGCGCTC CTGATGAGTC TGGTCGTCGT CGTCCCGAGG GTACGGGCGA GTTTGTGACG
GTTCCTGCAA CCGCTGTTAT TACAGCAGTT GGTGAGCAGA TTGAGTCTAA TCTTTATACC
ACCTCTGGAA TTGCCCTTGA CGAGAAGGGC CGTCCAGTTG TGGACAATAA TCTTCAGACT
TCTCTTGCGC ACGTCTATGC AGTTGGTGAT GGACGTCGTG GTCCAGCAAC TGTTGTCAAG
GGTATTGCAG ACGCTATGAC CGTAGCCGTT GCCATTGCAG ATTTCAATTT CTCAAGCAAG
GAACTTTCCA ACCTTGAGGA AGACGTTGCT AAGATTTTTG GTAAGCGTGG ACAGCTTTGC
GGTGATACTG ATGCGTGCGA GCAGTCACGT TGTCTTTCTT GTCCTTCGGT ATGTGAAGCA
TGCGTTGAGG TATGTCCGAA CCGTGCTAAT GTTGCTATTA AGGTTCCTGG TGTTCAACAG
CAGCAGATTA TTCACGTTGA TGGTATGTGT AATGAGTGCG GTAACTGTGC CGTGTTCTGC
CCATACTCTG AAGGTCGTCC TTATAAAGAT AAGTTCACTG CATTTTGGAG TCGTGAAGAC
TTTGACAACT CTGAGAATGA AGGCTTCCTT CCAACTGCAG ACGGTTTCTT AGTACGTCTT
GATGGCAGCA CAGCAATTTA TAACGTTGAC GATGAAGCTT GTGGTCTTCC AGAAAATATC
CGCAAGATGA TTGTCACGGT AAGAGATTCT TATAGGTATC TTTTAAATGC ATAG
 
Protein sequence
MSDIMRPMKF DHLMNWILDE YENQKTIFGI HAFAKTAGAA RPIFNEKIET PFGPAAGPNT 
QLAQNIVASY VTGARFFELK TVQKMDGEEL SACVNKPCIL ASDEGYNCEW STELTVPQAF
DEYVKAWVIC HILSRELGLG DADGFVFNMS VGYDLEGIKT PKVDKYINDM KDASETPVFK
EAIAWAKANI NRFHNVDEAF IDSIPSRISD SITESTLHGC PPDEIERIAS YLITEKHLNT
FIKCNPTLLG YEYARKTLDS LGFDYIAFDD HHFLEDLQWA DAVPMLERLM KLCAEKGVEF
GVKLTNTFPV DVTRGELPSE EMYMSGRSLW TLSLSLAKRL SEQFDGKLRI SYSGGADYNN
IKDLVDTGIW PVTMATTILK PGGYERMTQI AGLFADENTE AFSGVDVAKV SALLESSLKN
ARYHKEIKPQ PDRHVRGALP LTDCFIAPCR DSCPIHQDIP GYLKAVDEGR YADALHIILE
RNALPFITGT ICPHTCTNSC MRNYVDEHVH IRSCKLTAAE NGLMEVLPTL ASRGVVKDKK
VAIIGGGPAG LSAASFLSRA GIEVVVFERT NKLGGIVRHV IPGFRISEEA IDNDVKLCQA
YGATFKTGVE VTDVNTLLSE GYTDVVVAIG AWAPGRKTLK SGEALDALEF LEEFKRAPES
VNLGTDVVVI GAGNTAMDVA RAAKRVAGVQ NVRLVYRRTK RYMPADEEEL QMAIDDGVEF
MELLAPGDLS NNHLTCEVMK LGAPDESGRR RPEGTGEFVT VPATAVITAV GEQIESNLYT
TSGIALDEKG RPVVDNNLQT SLAHVYAVGD GRRGPATVVK GIADAMTVAV AIADFNFSSK
ELSNLEEDVA KIFGKRGQLC GDTDACEQSR CLSCPSVCEA CVEVCPNRAN VAIKVPGVQQ
QQIIHVDGMC NECGNCAVFC PYSEGRPYKD KFTAFWSRED FDNSENEGFL PTADGFLVRL
DGSTAIYNVD DEACGLPENI RKMIVTVRDS YRYLLNA