Gene Apar_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0149 
Symbol 
ID8412995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp169975 
End bp174729 
Gene Length4755 bp 
Protein Length1584 aa 
Translation table11 
GC content51% 
IMG OID645021719 
ProductCoA-substrate-specific enzyme activase 
Protein accessionYP_003179176 
Protein GI257783959 
COG category[S] Function unknown 
COG ID[COG3581] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR00241] CoA-substrate-specific enzyme activase, putative 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGTC CAAAGTTTGA GCTGGTAGAA AATGGTGCAG AGCTTGTACC AAACGTTCCA 
CTTGACCTGT CTCAGGCCCA CCTTATGCTA GGTATTGATG TTGGCTCCAC AACCGTAAAG
CTGTCCGTCA TCGATGAGGA TGGCACGCTG GTTTACGCAA ATTACGAGCG TCACCATACT
GATGTTCGCG CAACTGCGCG TTCCTTGTTT GTTAAAGCTC AGAAGTACAT TGGTGATACA
CCTATGTATG CCGCAATCAC CGGTTCCGGC GGTATGCTGC TGGCACAATG GCTTGACTTG
GAGTTTGTCC AGGAGGTTAT CGCTTCTAAG CGTGCTGTTG AGACGTTGAT TCCTCAGACC
GACGTTGCCA TTGAGCTGGG CGGCGAGGAC GCAAAGATTA TTTACTTTGA CAACGGCATT
GAGCAGCGCA TGAATGGTAC ATGTGCTGGT GGTACTGGCG CATTCATCGA CCAGATGGCA
TCTCTTCTTA AGACTGATGC AACCGGTCTT AACGAGCTGG CAAAAGACGT TAAGCAGATT
TACCCAATTG CTTCTCGCTG CGGTGTTTTT GCTAAGTCAG ACGTTCAGCC TCTGCTTAAT
GAGGGAGCTG CTCCCGCTGA CATTGCCGCT TCTATCTTCC AGGCTGTTGC AAACCAGACC
GTCTCTGGTT TGGCTTGTGG TCACCCTATC CGCGGTTACG TTGCGTTTCT TGGCGGTCCT
TTACAGTATC TTTCTGAGCT GCGCCGCCGC TTCTACATTA CGCTTAACCT GGATGACGAG
CATATCGTTC TGCCAAAGAA CGCACACCTC TTTGTTGCAA CGGGTGCTGC TCTGGCAGGC
GAGTCCGACC GACCAATCAC GTTTACCCAG GTCATGGAGG CGCTGGATAA TCTCAAGGAC
CTCCAGGGTT CCGAGGTTGC TCGTTTGGAT CCACTCTTTG CTACTCAGCA GGACTATGAT
GACTTTAAGA CTCGTCACGA CCAGGAGGTA GTGCCCAAGG GTCAGCTTGC CAACTATCAC
GGTCGTGTTT TCATCGGTAT TGACGCAGGT TCTACCACCA TGAAGGCTGC TGTTGTTGGC
GAGAAGGGCG AGCTGCTTTA CACCTGGTAT GACAATAACA ACGGCGATAT TTTGGGCACT
GCCCGCAAGA TCATGGACTC TATCTATGAT GAGATGCCTT CTGACTGCAT TATTGGTCAC
GTTACTACCA CTGGTTATGG TGAAGGTATT CTGATTGAGG CTCTGCGTGC AGACTCCGGA
GAGATTGAGA CTGTCGCTCA TCTGCGTGGT GCAAAAGCGT TTGTCCCAGA CGTTGATTTC
ATTCTTGATA TTGGCGGCCA GGACATGAAG TGTCTGCAGG TCAAAGATGG CGTCATTGAG
CACATCATGC TCAACGAGGC ATGCTCTGCA GGCTGCGGTT CCTTTATTGC ATCCTTCGCT
GACTCTATGA AGATGGATGT CCGTGAGTTT GCAACTGCTG CGACTAAGGC AAAACTGCCT
GTAGACCTGG GCTCTCGTTG TACCGTCTTT ATGAACTCCC GCGTCAAGCA GGCGCAGAAG
GAGGGTGCAA CCATCGGTGA CGTTGCTGCT GGTCTTTCGT ACTCCGTCAT CAAGAACGCC
CTGTTCAAGG TCATTAAGCT TCGCGACTTC AGCGAGATTG GTGAGCACTG CATCGTTCAG
GGCGGTACCT TTATGTCAGA CGCAACGCTT CGCGCCTTTG AGCTGCTTAC CGGCAGAGAT
GTTATCCGAC CTGACATTGC AGGCGTTATG GGTGCCTACG GCGCCGCTCT TCTCGCCCGA
GACCGCGCAG GTATTGATGG AGTGTCCACC TTGCTTGATC GAGAGTCCAT CGACAATCTG
CAGGTCAAGC TTACCAATAC ACACTGCCGC ATTTGTCCTA ACAGCTGCAT GCTTACTATC
AACGACTTTG GCGGCGGTCA CAGATTTATC ACCGGCAACC GTTGCGAGAA AGGCGCTGGT
AAGAAGCGTG GTGCAAAGAA GAATCAGGCA CCAAACCTCT TTGCGTATAA GAACAAGCTG
CTGTTTGATC GCGAGTCGCT TCCTGTAGAT GAGGCTTCAC GTGGCACTGT CGGCATTCCT
CGTGCGTTGA ACATGTACGA GAACTATCCG TTCTGGCACA CGTTTTTCAC TAAGCTGGGC
TTCTCGGTCA TTTTGTCCGA CCAGACCACC GCAAAGACAT ACGACGCGGG TATTGAATCC
ATGCCTTCCG AGTCTGCTTG CTATCCTGCA AAGCTTTCCC ATGGCCATAT CATGAACTTG
CTGGCTAAGG ACCCAGATTT CATTTGGATG CCATGTATTC GTTGGGAGCG CAAGGAGGAC
GATTCCGCAA CTAACCACTA TAACTGCCCA ATTGTCATGA GCTACCCTCA GGCGCTGGGC
CTCAACGTGG ACGAGCTCTC CGATCCGTCC ATTCAGTACC TGGCTCCGTT TATTCCATAC
GACAAGAAAA ATGAGCTTAA GCGTCGTTTG TACGAGCTCA TCAGCGAACA GCGCGAGAAG
GACGCCCAGG CAGGTAAGGG TCGTTTCCGC GGCGAGCACA TTACGCGTGC TGAAATTGAT
GCCGCTGTTG AGGCTGCTTG GCAGGAAGAC AGTAACTTTA AGGATCAGAT GCACCGTGCG
GGTGATGAGG CTCTTGCTTG GATTGAGGAG CATGACGCTC ACGGCATTGT TCTTGCAGGT
CGCCCTTATC ACAACGATCC AGAAATTAAT CACGCTATTC CTGAGCTGGT CTCTTCGTTT
GGTTTTGCTG TTCTTACCGA GGATTCCATT GCGCATAAGA TGCTGCCCGA GCGTCCAATT
CGCATTGTTG ACCAGTGGAT GTATCACTCA CGTCTGTATC GTGCGGCTCG CTTTGTTGCA
TCTCGAAACG ATCTTGACCT TATTCAGCTC TTCTCGTTTG GTTGCGGCCT TGATGCACTT
ACCACCGATC AGGTTCAAGA GATTCTTGAG GCTTCGGGCA AGATTTACAC CATGCTTAAG
GTTGATCAGG TTTCCAACTT AGGTGCTGCA CGAATTCGTA TCCGCTCTCT AATGGCAGCC
CTGAATGAGC AGCAAGCAGA GCTAGAGCGA CTTGCAGCTG CCGGCTTGGT TACCGAGGCC
GTTCCACAGG GCGTTCGCAT GGCCGATGGT TCTTTGGAAA AGGCTAGGAG TGCAAGTTCT
TCTCGCCGTG CGCCAGTGTA TCGTGAGGCA GAATCTGCAG CTTACGAGAA GGTCCGCTAC
ACCAAGGAGA TGCAGGAAGC GGGTTACACC ATTCTGGCTC CACAGATGGC ACCGATTCAC
TTTGAGCTGG TAGAGGAGTT GCTGAAGGGT GCGGGCTATA ACGTTGTGCT GTTGCCTTCA
GTTGACCAGG GTGCCGTTGA CATGGGTCTT CGTTACGTTA ACAACGATAT TTGCTACCCA
TCTATTCTGG TTACAGGTCA GATTATGGAG GCGGTGCTTT CGGGCAAGTA TGATCTGACT
AAGACTGCAG TCCTGATTTC GCAGACTGGT GGTGGCTGCC GTGCAACCAA CTACATTGCG
CTGATTCGTA AGGCGCTTAA AGATGCAGGC CATCCAGAGA TTCCAGTTAT TTCTATCTCC
GCTGCTTCTG GTCTTGACGA GGACAACCCG GGATTCAAGC TCTTCAAGCC TGATTTGCTG
ATCAAGGCAG TCTATGCGTT GCTTTACGGC GATCTAATCA TGCAGCTACT GTATCGCGTA
CGTCCATACG AAGCTGTCAA GGGCTCTGCA AACGAGTTAT ATGACCAGCT CATGGCCGAT
ATGCGTTCCA AGATCAATAG AATTTCTCGC AAGGAATTCT ACAAGCAGTG CCAGCGTACC
ATTGAGTTGT TTGACAGTCT GCCAGTGGTG AACGATCGAC AGAAGCCACG TGTTGGCGTT
GTCGGTGAGA TTCTGGTTAA GTTCCACCCA ACAGCCAACA ATGAGCTGAT CAAGGTCATT
GAGTCCGAGG GTTGCGAAGC AAACGTTCCT GGTCTGGTGG ACTTCTTCCT GTTTGGTCTC
TCTAATGCAA TCAATACGCA TAAGGAGCTA GGCACCACGT TCAAGAGCCG CATGACGCAT
ATCGCAGGAA TTAAGATGGT TCAGGGTCTG CGCGCGCCAA TCAATAAGAT GCTGGAGAAG
TCAGAGCGCT TTGAGCCGTA CCCCAACATT GATGAATTGG CCGAGAAGGC TGGTCAGATT
CTTTCGCTCT GTAACACGAT GGGCGAGGGT TGGCTGCTTA CCGCAGAGAT GTGCGACCTT
ATTGAAACGG GTACACCAAA CATTGTCTGT GCTCAGCCGT TTGCCTGTCT GCCAAACCAC
GTTGTTGGTA AGGCGGTTAT TAAGCGCTTG CGTCAGATGC ATCCTGAGTC CAACATTGTT
GCCGTTGACT ACGATCCGGG TGCGTCTGAG GTTAACCAGC TTAATCGTAT TAAGTTGATG
ATTTCGGTTG CTAAGGAGAA TTACAAAAAC GGTGTGAATG GCGAGTTTAA GCTAGAAAAC
GCTGATGACC CTGTTACTAA TGACGCAACT ATGCCGTATA CCGGCCGAGA TAAGTATGGC
CTAGATTCCG TTGTTCGCGA AGGCGGACAT TCTTGCTCGT CGCATCATGT ATCTGCATTG
GAAGCAACTG GCGCTAATTT GGGTGATCAT GGTCGCACGG CATCTGACCA CGGAAAAGAT
TACGGTTCTA TTAGGCTTTC TGAGGAGCAG ATAGCTGCAA TTGAGCGTGC CAAGAAGAAG
GCAGGCGTCA AATAG
 
Protein sequence
MISPKFELVE NGAELVPNVP LDLSQAHLML GIDVGSTTVK LSVIDEDGTL VYANYERHHT 
DVRATARSLF VKAQKYIGDT PMYAAITGSG GMLLAQWLDL EFVQEVIASK RAVETLIPQT
DVAIELGGED AKIIYFDNGI EQRMNGTCAG GTGAFIDQMA SLLKTDATGL NELAKDVKQI
YPIASRCGVF AKSDVQPLLN EGAAPADIAA SIFQAVANQT VSGLACGHPI RGYVAFLGGP
LQYLSELRRR FYITLNLDDE HIVLPKNAHL FVATGAALAG ESDRPITFTQ VMEALDNLKD
LQGSEVARLD PLFATQQDYD DFKTRHDQEV VPKGQLANYH GRVFIGIDAG STTMKAAVVG
EKGELLYTWY DNNNGDILGT ARKIMDSIYD EMPSDCIIGH VTTTGYGEGI LIEALRADSG
EIETVAHLRG AKAFVPDVDF ILDIGGQDMK CLQVKDGVIE HIMLNEACSA GCGSFIASFA
DSMKMDVREF ATAATKAKLP VDLGSRCTVF MNSRVKQAQK EGATIGDVAA GLSYSVIKNA
LFKVIKLRDF SEIGEHCIVQ GGTFMSDATL RAFELLTGRD VIRPDIAGVM GAYGAALLAR
DRAGIDGVST LLDRESIDNL QVKLTNTHCR ICPNSCMLTI NDFGGGHRFI TGNRCEKGAG
KKRGAKKNQA PNLFAYKNKL LFDRESLPVD EASRGTVGIP RALNMYENYP FWHTFFTKLG
FSVILSDQTT AKTYDAGIES MPSESACYPA KLSHGHIMNL LAKDPDFIWM PCIRWERKED
DSATNHYNCP IVMSYPQALG LNVDELSDPS IQYLAPFIPY DKKNELKRRL YELISEQREK
DAQAGKGRFR GEHITRAEID AAVEAAWQED SNFKDQMHRA GDEALAWIEE HDAHGIVLAG
RPYHNDPEIN HAIPELVSSF GFAVLTEDSI AHKMLPERPI RIVDQWMYHS RLYRAARFVA
SRNDLDLIQL FSFGCGLDAL TTDQVQEILE ASGKIYTMLK VDQVSNLGAA RIRIRSLMAA
LNEQQAELER LAAAGLVTEA VPQGVRMADG SLEKARSASS SRRAPVYREA ESAAYEKVRY
TKEMQEAGYT ILAPQMAPIH FELVEELLKG AGYNVVLLPS VDQGAVDMGL RYVNNDICYP
SILVTGQIME AVLSGKYDLT KTAVLISQTG GGCRATNYIA LIRKALKDAG HPEIPVISIS
AASGLDEDNP GFKLFKPDLL IKAVYALLYG DLIMQLLYRV RPYEAVKGSA NELYDQLMAD
MRSKINRISR KEFYKQCQRT IELFDSLPVV NDRQKPRVGV VGEILVKFHP TANNELIKVI
ESEGCEANVP GLVDFFLFGL SNAINTHKEL GTTFKSRMTH IAGIKMVQGL RAPINKMLEK
SERFEPYPNI DELAEKAGQI LSLCNTMGEG WLLTAEMCDL IETGTPNIVC AQPFACLPNH
VVGKAVIKRL RQMHPESNIV AVDYDPGASE VNQLNRIKLM ISVAKENYKN GVNGEFKLEN
ADDPVTNDAT MPYTGRDKYG LDSVVREGGH SCSSHHVSAL EATGANLGDH GRTASDHGKD
YGSIRLSEEQ IAAIERAKKK AGVK