Gene Apar_0656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0656 
Symbol 
ID8413516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp731785 
End bp733941 
Gene Length2157 bp 
Protein Length718 aa 
Translation table11 
GC content49% 
IMG OID645022233 
Productformate acetyltransferase 
Protein accessionYP_003179676 
Protein GI257784459 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01255] formate acetyltransferase 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0887776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAA CACCAGCACA GGAGGCTTGG GACGGCTTTG TCGGTGGTAA CTGGCAAAAA 
GCAATTGACG TCCGAGACTT CATCCAGAGG AATTACACCC CTTACGATGG TGATGATTCC
TTCCTCGCTG GTCCAACTGA AGCTACTACA AAGCTTTGGG CGGATGTTAT GGATCTTTTT
GCTCAAGAAA CCGCAAATGG TGGCGTACTT GATATGGATA CCAAGCTGGT TTCCACTATC
ACTTCCCACG AGGCTGGCTA CATTGAGAAG CCACTTGAGC AGATTGTCGG CCTTCAAACG
GACAAGCCAC TTAAGCGCGC CCTTATGGTA GACGGTGGCA TTCGTATGGC AATGGCTGCC
TGCAAGGCAT ATGGATACGA GGTTGACCCA GAGATTGTTA ACTTCTACAC CTATCGTCGC
AAGACTCACA ACGCTGGCGT TTTTGATGTC TATACCGAGG AAATGCGTAA ATGCCGTCAC
TCCCACATCA TCACTGGTCT GCCTGACGCA TATGGCCGTG GCCGCATCAT CGGCGATTAT
CGTCGTGTTG CACTTTACGG CGTTGATGCT CTTATTGCAG ACAAGACAAA TCAGAAAAAT
GGCACCGGCT CCATCATGGA CGAGAAGACC ATTCGTCATC GCGAAGAGCT CTCTGAGCAG
ATTCGTGCGC TCAAAGAACT TAAGCAGCTT GGTGAGATTT ACGGCTTTGA TTTGGGCCGT
CCTGCTGAGA ACTTCAAAGA GGCTGTTCAG TGGCTCTACC TGGGTTATCT TGCCGCTGTA
AAAGAGCAGA ATGGCGCTGC AATGTCCATT GGTCGCAACA CCACCTTCTT GGACATCTAT
GCAGAGCGCG ATCTTGCTCG TGGTACCTTC ACTGAATCTG AGATTCAGGA GATTGTTGAT
CACCTGGTTA TGAAGCTTCG CATGGTCAAG TTTGCCCGTA CCCCTGAGTA CAACGAGCTC
TTCTCCGGAG ACCCTCAGTG GGTCACTGAG TCCATTGGTG GCATTGGCAT TGATGGCCGC
TCTATGGTTA CCAAGTCCAG CTTCCGCTGG CTCCACACTC TTGAAAACAT GGGTACCAGC
CCAGAGCCTA ATCTGACTGT TCTGTGGTCC ACCAAGCTTC CAGTTGGCTT CAAACGCTAC
TGCGCAAAGA TTTCTATTAC CACCAGCTCC ATCCAGTATG AGAACGATGA TCTTATGCGC
GTTTATCATG GCGATGACTA CGCAATTGCT TGCTGCGTCA GCTCCATGCG CATTGGTAAA
GAGATGCAGT TCTTTGGCGC TCGTGCAAAC CTTGCTAAAT GTCTTCTTTA CGCAATCAAT
GGTGGTCGTG ACGAGAAGAC CGGCGAGCAG ATTGGTCCTA AGTATCGTGC AGTTGAAGGC
GAGTATCTTG ATTACGACGA TGTCTTCTCT AAGTATATGG ACATGATGCG CTGGCTTGCA
GGCGTTTACG TCAACGCTCT TAATGCAATC CACTACATGC ACGATAAGTA CAGCTACGAG
CGCATTCAAA TGGCTCTACA CGATGAGCAC GTTCATCGCT GGTTTGCAAC AGGCATTGCA
GGACTTTCTG TTGTTGCTGA CTCCCTCTCT GCAATCAAGT ACGCAAAGGT AAGGGTTGTC
CGCGATGAGA CTGGTCTTGT CACCGATTAC ATTATTGAGG GAAACTTCCC TAAATACGGT
AACGATGATG ACCGCGTTGA CACTATTGCT CATGACATCG TTGAGATCTT CATGAAGATG
ATTCGTCAGA ACCACACCTA CCGCGATTCC GTTCCAACCA CTTCTATCCT GACCATTACC
TCTAACGTTG TGTATGGTAA AGCTACCGGC AACACCCCTG ATGGTCGTCG CGCTGGTGTT
CCTTTTGCTC CTGGTGCTAA CCCAATGCAC CGACGTGACA CTTATGGCGC GGTTGCATCT
CTAGCCTCCG TTGCTAAGCT TCCATTCAAC GACGCACAGG ACGGTATATC CAATACCTTC
TCAATCATTC CTAATGCTCT TGGTAAGGGC TCTGATGTCT ACTTCCACGG CAGTGAGCTT
AACCTTGACC ACATCGACTT CTCTGACATG AATCTTGACA TTCAGATTGA GAACAAGATT
GATTGCGCCT GTGAGGCAGA TCCATCCGCA GCTGAAGGTG CTGAGGATCG TAAATAA
 
Protein sequence
MQETPAQEAW DGFVGGNWQK AIDVRDFIQR NYTPYDGDDS FLAGPTEATT KLWADVMDLF 
AQETANGGVL DMDTKLVSTI TSHEAGYIEK PLEQIVGLQT DKPLKRALMV DGGIRMAMAA
CKAYGYEVDP EIVNFYTYRR KTHNAGVFDV YTEEMRKCRH SHIITGLPDA YGRGRIIGDY
RRVALYGVDA LIADKTNQKN GTGSIMDEKT IRHREELSEQ IRALKELKQL GEIYGFDLGR
PAENFKEAVQ WLYLGYLAAV KEQNGAAMSI GRNTTFLDIY AERDLARGTF TESEIQEIVD
HLVMKLRMVK FARTPEYNEL FSGDPQWVTE SIGGIGIDGR SMVTKSSFRW LHTLENMGTS
PEPNLTVLWS TKLPVGFKRY CAKISITTSS IQYENDDLMR VYHGDDYAIA CCVSSMRIGK
EMQFFGARAN LAKCLLYAIN GGRDEKTGEQ IGPKYRAVEG EYLDYDDVFS KYMDMMRWLA
GVYVNALNAI HYMHDKYSYE RIQMALHDEH VHRWFATGIA GLSVVADSLS AIKYAKVRVV
RDETGLVTDY IIEGNFPKYG NDDDRVDTIA HDIVEIFMKM IRQNHTYRDS VPTTSILTIT
SNVVYGKATG NTPDGRRAGV PFAPGANPMH RRDTYGAVAS LASVAKLPFN DAQDGISNTF
SIIPNALGKG SDVYFHGSEL NLDHIDFSDM NLDIQIENKI DCACEADPSA AEGAEDRK