Gene Apar_0871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0871 
Symbol 
ID8413737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp969257 
End bp972385 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content46% 
IMG OID645022454 
Producthypothetical protein 
Protein accessionYP_003179891 
Protein GI257784674 
COG category[L] Replication, recombination and repair 
COG ID[COG3857] ATP-dependent nuclease, subunit B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.124792 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTGC TTCTGTACAA ACAACAAACA GGTGCTTTCT TATCAAACTC TGTTCAAGAG 
CAGCTAAGGG CTTCACTTGA GAGGTGGGGC TCTGCTTGGC TTATAGTGCC TTCTCCCGAT
GCTGTTTTGT TAGTTAAGAG ACAACTTGGA TCCATCCAAG AGCTTTCTGT TGGCGTGAAT
GTTTCAACTT TTGACGAGTG GATGCGCGAC CAATGGAAGT TGTACGGAAG CTCTGATCGT
CTGCTTTCAA CTACGCTTCG CAAGGTATTT TTCCAGCAGA TTTTGGATGG AATGTCTGCT
GATGAGCTAG GCACTTTGAA TAATAGCAAG GGTACTGTGG AGCTTCTGTC TAAGCTTGCT
CCAGAGTATC TTACAGAGCT TGATCAGATT ATGGCCTCTG GTCAACTTTC TGCTGGTCAG
ATGAGTGCTT GTAAGGTGCT AGAGCGCTAC AAGGCACTGC TTGAAGAAAA GTCTTATGTC
GAGGTGTGCC AGTGTCTGGA TTATCTACTG CAGTCCATTC CAGCGCAAGG ACCTGCACTG
ATTTTCTCGC GAGTTGAAGA TCTTTCGGAG GCGCGTCTTA AGTTTGTGCG TAAGCTAGCG
CAGAAGCGTG ATGTGACGTT TTCTCTCTAC GTGCCAGAAG GACCTGCAGG TTATGCAGCA
GAGCAGCAGC TGGAGTTGGT CGGAGGCCCG GGATGCGATT GTCGTGTGGA TGCGGAGCCT
GCGCCAGCCG CAAAAAGCCA GGAGCTCAAC GATCTTCTCG CCAGTGTGTT TAGGGCCAAG
GAGGGCAATA AGATTACGCC GAGCGGTGCG GTAACGTTTT TGTCGCCGCT GGGCCAGCAT
GCCGAGGCAG AGTCTATAAG CAGGTATATC TCTCAGTGCG TAGAGTCTGG CAGCAAGAGC
TTTGTTGTGT ACACTTCAAA TCCGCAAAGG GTTTGGGATG CGCTTTCGCA AAAACTTGCA
GCTAAGGGAA TTGCTGTTCA CTATAGGCGT TCGGTGCGTA TTCAGGATTC ACTTGCAGGT
CGTGCATTTG CCAGCTTGAT TGATGCGTAT GTTACGCTCA GTGAACGCGC AGAGCTCGAG
AAAAACATTG ACTACCAGGC ATCCGATCAC CAGATGGGGG ATATGTCGTG GTGGCCTCCG
CATACCCTTA CGGATTATTT GATTTCTCCT ATTTCAGGCA TAAGCGTTGA GCGCGCATGG
ATGCTGGATA AGAGTTGGCG TGGTAACAGA ACGCTATATG CTACTAGAGT GCTTGAGACG
CTTTCTAAAG CAGCTATGAG TTCACGTCTG TGCGCAGAGA CCATCAAGAG CCTTGAGCTG
GGCAGAATTG GTTCTGCAGC CCAGCGTATT ATTGAACATC TTTCAGCAGA AATATCTGAT
GAGCAGTCAG AGGTTGCTCA GTCTAAAGAG TTGACTCTTC AGTCTCTTCA AAATCAAGAG
TCGCTTAAGG TAATGAGCAA GATTGTTTCT TCTGCTCAAG AGCTTCATGA GGCAGGATTA
AAGCTCACTC CTCAGACACT CAAATCGTTT ATGGATCTCT GCAAAGATCA GGCGGTCATG
ATGCCTGTTT CTAATGGTAT TAAGTCTGAT ATTCAGGTGT TAATCGCTCC CGTGAGCCAA
GCTCATTCTT TTGATGCCGT CATTTTCCAG GGCATGGATA CGTTGAATTT TGGCGTTAAG
GCATCAGACG GTGCTCTTCA AGAGTTTGTT CGCCATGCTA GTAAAACGCC TAAGGTATCA
GAGTTTGCAC GGTACCAAAG AGATTTTTAT ACGGCGCTTG CAACCGCTTC TACCAGCGTT
GCATTTGAAA AAGTAGAGCA AAAAGATGTC TTTAATGCGG TGGCTCTCAG TGAAGTCAAA
GCATGCTATC CAAAAGACTA TGCAAAGAAG ACGGGGCTTG TGCGCGGAGA AGAAGAGGTG
CTGGTAAACC TGTTGCCTCA AGCTTCAGAT CTAAAACGTG TTGCTGAACT TCCAACAATC
GAATCTGGTG AGATTGACTC TAAACTCAAA AATATTGTGG TACTCCCACG TCATTTGACT
AGGGAGACTC TTGAGCAGGA ATTACAGGGT CTTATTGAGG TCTCGCGAGA AGGACTTCCG
CTTCTTTCCG CTTCTCAAAT TGAAACGTAT CTTGAGTGCC CCTACAAATG GTTCACACAA
CGTCGCATCA AGATTTCTCA GGTAGACACT GAGTTTGCTC CAATGCAGAT GGGTACTTTT
ATCCATCGTG TACTAGAGCT CACACACGCA ACGCTTTTAG CAGAAGCGCT TGGTTGTGAT
GTGACTGAGG TTGATACGGC AGTTGAATCC GTACTTTTGC AAGACGTTGC CGGATCTAGG
ATTACAACTG ATAACTTAGA TCATGCAAAA CAAGTGTTAG ACAGCTGTTT TGCTCAGGTG
TGGGATGAGC AGTTTAACAA CATTAACCGA GCATCTTCAA ATGAGCTTAT TCCGCATAGC
ATTCAAGAAA GAAAACAGGT TGAGAATATT CGAGAAAATC TTAAGGATCT TCTCGAGTTT
GAAGCTTCGC ACTTTATTGG TTATCAACCC AGATTCTTTG AGCTTCGTTT TGGTAGAGAA
GAAAATGTTG TTGAGTATGC AGGAGCTCAG TTTACTGGTT CGATCGACCG CGTAGATGTA
AACGCTCATG GTCAGGCACT TATTATTGAC TATAAGCACA AGGGGACAAA AGATCTGAAG
GCTTATTCAG CAAAGTTAAG TCTGGATAGT GAAGTTTCAA AAGAGGTCTT GCCAAGGCAT
GTGCAGTCCG CAATATATGC TCAGATTATG AGAAAACAGC TCACCAAGTA TGAGCTTGAG
TCAGTTGCCG CAATTTATCT GGGCACCAAA GAGCAAAAAG ATAAGCCTTC ATTTGCTCTT
GCGGGTATGG CAACAGAAGC GGCAACAGAA CATATTTGGA ACATACATCC GGAAGATAAA
AAGCTCAGGG ACCAGGCGGT TATGGTTGTG TCTCAAAATT CTGCAGAGTT TGCAGACTTT
TTGGACGCTT GGGAGAATTT AATTGCGCAG AAGGTTCAAG CTATGCTTTC TGGAGATGTC
CGAGCCAATC CTTGCGATAA GGATGCGTGT AAGTATTGCC CAGTAAAACT ATGTGACAAG
AGGAGGTAA
 
Protein sequence
MSLLLYKQQT GAFLSNSVQE QLRASLERWG SAWLIVPSPD AVLLVKRQLG SIQELSVGVN 
VSTFDEWMRD QWKLYGSSDR LLSTTLRKVF FQQILDGMSA DELGTLNNSK GTVELLSKLA
PEYLTELDQI MASGQLSAGQ MSACKVLERY KALLEEKSYV EVCQCLDYLL QSIPAQGPAL
IFSRVEDLSE ARLKFVRKLA QKRDVTFSLY VPEGPAGYAA EQQLELVGGP GCDCRVDAEP
APAAKSQELN DLLASVFRAK EGNKITPSGA VTFLSPLGQH AEAESISRYI SQCVESGSKS
FVVYTSNPQR VWDALSQKLA AKGIAVHYRR SVRIQDSLAG RAFASLIDAY VTLSERAELE
KNIDYQASDH QMGDMSWWPP HTLTDYLISP ISGISVERAW MLDKSWRGNR TLYATRVLET
LSKAAMSSRL CAETIKSLEL GRIGSAAQRI IEHLSAEISD EQSEVAQSKE LTLQSLQNQE
SLKVMSKIVS SAQELHEAGL KLTPQTLKSF MDLCKDQAVM MPVSNGIKSD IQVLIAPVSQ
AHSFDAVIFQ GMDTLNFGVK ASDGALQEFV RHASKTPKVS EFARYQRDFY TALATASTSV
AFEKVEQKDV FNAVALSEVK ACYPKDYAKK TGLVRGEEEV LVNLLPQASD LKRVAELPTI
ESGEIDSKLK NIVVLPRHLT RETLEQELQG LIEVSREGLP LLSASQIETY LECPYKWFTQ
RRIKISQVDT EFAPMQMGTF IHRVLELTHA TLLAEALGCD VTEVDTAVES VLLQDVAGSR
ITTDNLDHAK QVLDSCFAQV WDEQFNNINR ASSNELIPHS IQERKQVENI RENLKDLLEF
EASHFIGYQP RFFELRFGRE ENVVEYAGAQ FTGSIDRVDV NAHGQALIID YKHKGTKDLK
AYSAKLSLDS EVSKEVLPRH VQSAIYAQIM RKQLTKYELE SVAAIYLGTK EQKDKPSFAL
AGMATEAATE HIWNIHPEDK KLRDQAVMVV SQNSAEFADF LDAWENLIAQ KVQAMLSGDV
RANPCDKDAC KYCPVKLCDK RR