Gene Apar_1274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1274 
Symbol 
ID8414154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1429888 
End bp1431900 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content51% 
IMG OID645022866 
ProductVacB and RNase II family 3'-5' exoribonuclease 
Protein accessionYP_003180289 
Protein GI257785072 
COG category[K] Transcription 
COG ID[COG0557] Exoribonuclease R 
TIGRFAM ID[TIGR00358] VacB and RNase II family 3'-5' exoribonucleases
[TIGR02063] ribonuclease R 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTAGAA AACGCATACA TAAAGGAAAT CGCCGTCAGG GACGTGCAAA AGTCTCAAAG 
CGACCATCAA CGCTGCTTAC AGGTACGTTG AGCCTTTCTC GCCCAGGAGT TGCTACCGTA
AGCACCCCTG AGGGAATGTA TGCGCTGGCA AAGCACGGTA TTCATGAAGC TATGCACGGA
GATGAGGTTC AGGTTTCGCT TGTCTCGGCA AAGGGAAAGG CTCCTCTGGC AAACGTGCGC
TTTGTGCTTA CGCGTGCTAC TCAGACGTTT TTGGGCATCT ACCATGACGC CGGACCACTT
GGCGCCGTGG TGCCGCTGGA TAGTCGCATT CAGCGAGATT TCTTTGTACT CCCTGAAGAT
CCTTCTGTTG GACGTCATTA TGTTTTTGAA GGTGACGTGG TTGTTGCACG TATTACTGAG
TACCCAACAC GCAAAAGTGC TGCTGTGGTT ACTATTGAGC GTCGTGTGGG ATCTGCCGAA
GATTTGGACA TGAACGTTGA GGCCACTATC GCTTCATATG GTTTGGCAAC AGAATTTCCC
ATTAAGGCAC TCAGACAAGC TGAAAAAATC TCTGTTGACG CAGATAAAGC TCTTGCAGAG
GACGCTTCAA GACACGATTT GCGAGAAGAG CTGTGCATTA CCGTAGATCC TGCGGATGCT
AAAGACTTTG ATGACGCTGT TGGCGCGCGC AAGCTTGAAG ATGGCAGCTT TGAGCTCTGG
GTTCATATTG CAGATGTGGC ACACTACGTG AAATGGGATT CTCCCATTGA CCTGGAAGCT
CGCATGCGCA CCTGCTCGGC ATATTTAGTT GACCGCGTGC TGCCCATGCT GCCAGAAAAG
CTCTGTAACG ACGTATGCTC TCTGCGACCA GCTGAAGACA GGCTGGCCAT GTCGGTAAAG
ATGAAATTGA GTAGCTCAGG AAAGATTCTT GGCGCAACCG CTATGAATTC TGTCATTCGT
TCAAGGGCAA GGCTTAGCTA CGACCAGGTA GATAGCTATC TTCAAGGAGA TGTCTCGGCA
CTTGACTCCG CCGTGTCTAG AGAAGACGCT GGTGCGATCA AAGAAATGAT CGACGTGCTC
AATCAAATCA GAGCACTGAG AGAAGAAATT CGCGAGAAAC GCGGCTCAGT AGATTTTGAG
AGCGTGGAGA CTCGCGTGGT ACTTGATGAG AATAACAAGC CTGTTGGCGT GTCGGTGCGA
GAGCGCACTC AGGCAACAGG TCTTATCGAG GAGGCAATGC TTGCAGCTAA CGAAAGCGTT
GCTCACATGC TCAGTCAGCA TGACCTAGAG TCAGCGTATC GCGTACATGA GCAGCCTTCT
CCAGAGTCAC TTAAGCTTGC GATTACTCCG CTTGTGGCGA TGGGCGCTCT TGAGCCAGAT
GTTGCGTCTC GTATTGCTAT TGGTGATCAA ACAGCATTGC AAGAAGCACT GGAGTCAGTT
CATGGCACAC GTTATTCTCG CGTGGTAAAT GCGCAGCTGC TCAGGGCTCA AAAACGAGCC
ATCTATCTGC CTACAAACCA GGGACACTTT GCCTTGGGAG CAGATGCGTA CTGTCACTTT
ACTTCACCAA TTAGGCGCTA TCCTGACGTG ATAGTACACC GCACATTGAA ACGATTGCTT
TGCGGTCAGA CAGCAGCTCG CGCGGAGCTG AGTGCGCTAG CGGATATCTG CAGCACGTGT
TCTGAGCAGG AGCGTAAGGC AGACGCGGCA GCTCGTGCTA CGCAAAAGAT TAAGCTGGCT
GAGTACTATC AGAGCCGTCT GGGAGAAGAG ACATGGGGCA CCATTGACGG CTGCGAGCGT
TTTGGTCTGT TCATTACGTT AGATGATACC TACGCCGATG GGCTTCTCAC CGTTAGGGAT
TTGGGCCACG AATGGTACGT GTATGATCAG GAGACGTTGG CGTTGATAGG GGAATCAACG
GGTAAAACAT ACCGCATTGG TGGACGTGTA CGCGTGAAAA TTTCTGGCGT AAATGTGGCT
CGTGGTCAGA TTGATTTGGC ACTTGTGGAA TAA
 
Protein sequence
MARKRIHKGN RRQGRAKVSK RPSTLLTGTL SLSRPGVATV STPEGMYALA KHGIHEAMHG 
DEVQVSLVSA KGKAPLANVR FVLTRATQTF LGIYHDAGPL GAVVPLDSRI QRDFFVLPED
PSVGRHYVFE GDVVVARITE YPTRKSAAVV TIERRVGSAE DLDMNVEATI ASYGLATEFP
IKALRQAEKI SVDADKALAE DASRHDLREE LCITVDPADA KDFDDAVGAR KLEDGSFELW
VHIADVAHYV KWDSPIDLEA RMRTCSAYLV DRVLPMLPEK LCNDVCSLRP AEDRLAMSVK
MKLSSSGKIL GATAMNSVIR SRARLSYDQV DSYLQGDVSA LDSAVSREDA GAIKEMIDVL
NQIRALREEI REKRGSVDFE SVETRVVLDE NNKPVGVSVR ERTQATGLIE EAMLAANESV
AHMLSQHDLE SAYRVHEQPS PESLKLAITP LVAMGALEPD VASRIAIGDQ TALQEALESV
HGTRYSRVVN AQLLRAQKRA IYLPTNQGHF ALGADAYCHF TSPIRRYPDV IVHRTLKRLL
CGQTAARAEL SALADICSTC SEQERKADAA ARATQKIKLA EYYQSRLGEE TWGTIDGCER
FGLFITLDDT YADGLLTVRD LGHEWYVYDQ ETLALIGEST GKTYRIGGRV RVKISGVNVA
RGQIDLALVE