Gene Apar_0497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0497 
Symbol 
ID8413346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp566845 
End bp568323 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content46% 
IMG OID645022065 
Productmetal dependent phosphohydrolase 
Protein accessionYP_003179519 
Protein GI257784302 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTGTG AAGGATGTAT ACTTTTCTCA TTAGTAGTAG CGGAAGGAAT GGTCGTGGGC 
ATAGAGAAAA ACAGCTCTTC ACCATGTCTT TCTCGTCCTG TGGATTTACA AGTTCCCACG
TATGCTTTAC GCGTATTAGA AGTTCTCGAA AATGCTGGGT TTGAGGCATG GATTGTTGGC
GGCTGGGTAC GAGACGCCCT GCGGGGTTCA TTTGCTCATG ATATTGACAT CACAACTTTA
GCTACTTGGC AGCAGAGTAA GGCGGCCTTT GTAGCTGCAG GTATTCCTGT ACATGAGACG
GGTATTGCAT ACGGAACTGT TACTGCTGTT GTTGAGCAAC ATCCCATTGA AGTTACAACG
TATCGCTGTG ACGGAGAATA TCTGGATGGT CGTCGTCCTG ATTCTGTGCA GTTTGTCTCT
TGTATAGAGA AAGATCTTGC TCGTAGAGAC TTCACCATAA ACGCAATGGC ATATCATCCT
AAGAGGGGAC TGCTAGATCT TTATGGCGGT CAAGAAGATT TATCTGCACA TGTGATTCGC
TCTGTTGGGG AGCCAAAAGC TCGCTTCACT GAGGATGCAT TGCGTATGCT ACGTGCATTA
AGGTTTGCGT GCAGATTCTC GTTTTCAGTT GAGGAGAAAA CACATCAAGC GCTCATTGAA
TGTGCGCCGT TGCTCTCACA AGTTGCTTCT GAGCGTATTG GCTTAGAAGT GGCTCAAATT
GTTGAAGGGG GTCATATTGC TCATGCTATC AAGTTGGGTT TCTCTGTTTT AGCAGTAGCA
ATTCCAGAAC TCTTACTGCT ACAAAACTTT GATCAAAGGA GCCCTTATCA CGCCTATGAT
GTATTGAAGC ACACTGCTCG GGTGTGCTCT GCAACAGAGG CATTTACGGT AGGTTGTGCT
ACTCCGATAC TTAGGTGGGC TGCGCTTCTA CACGATATTG CAAAGCCGGA AATGTTTAGC
GTTGATGCAG ATGGTAGAGG TCATTTTTAT GGTCATCCAG AAAAGGGTGC AGATGTAGCA
AAAGTCCTGC TTAAACGCTT AGCTTTACCG CAGCGCTTGA TTAATGAGAT TTGTGCCCTT
GTGGCTCTGC ATGACTATGA CGTGGATGTT ACAACGGCGT CGCTTAGGCA TATGGTTGCC
CTGTTAGCCG AGGTAAGTAA ATCAGACGGC ATCACCCTTG CGTATGATCT TTTGACGTTA
AAGCAGGCAG ATGCCTTGGC AAAAGCGGTT CCATATCGCG GATATGCTGT TGCTTTAGAG
CGTATGTTTA GCTTATTGAA GAACGAAGCT AAGAAAGGTA TCGCTCAAAG ACCTCAGGAC
TTGTGTATTT CGGGAGCCGA TATTATGCAG GCACTCTCAA TTACTCCTGG TCCTATTGTT
GGTGCATATC AACATAAGCT TTTTGAGGCA TATCTTATTG GAACGGTAGA AAATAGGCGA
GAAGAGCTGC TTGCCCTTCT CGCCCAATTA GCCAAATAA
 
Protein sequence
MTCEGCILFS LVVAEGMVVG IEKNSSSPCL SRPVDLQVPT YALRVLEVLE NAGFEAWIVG 
GWVRDALRGS FAHDIDITTL ATWQQSKAAF VAAGIPVHET GIAYGTVTAV VEQHPIEVTT
YRCDGEYLDG RRPDSVQFVS CIEKDLARRD FTINAMAYHP KRGLLDLYGG QEDLSAHVIR
SVGEPKARFT EDALRMLRAL RFACRFSFSV EEKTHQALIE CAPLLSQVAS ERIGLEVAQI
VEGGHIAHAI KLGFSVLAVA IPELLLLQNF DQRSPYHAYD VLKHTARVCS ATEAFTVGCA
TPILRWAALL HDIAKPEMFS VDADGRGHFY GHPEKGADVA KVLLKRLALP QRLINEICAL
VALHDYDVDV TTASLRHMVA LLAEVSKSDG ITLAYDLLTL KQADALAKAV PYRGYAVALE
RMFSLLKNEA KKGIAQRPQD LCISGADIMQ ALSITPGPIV GAYQHKLFEA YLIGTVENRR
EELLALLAQL AK