Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0871 |
Symbol | |
ID | 8413737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 969257 |
End bp | 972385 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645022454 |
Product | hypothetical protein |
Protein accession | YP_003179891 |
Protein GI | 257784674 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3857] ATP-dependent nuclease, subunit B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.124792 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTGC TTCTGTACAA ACAACAAACA GGTGCTTTCT TATCAAACTC TGTTCAAGAG CAGCTAAGGG CTTCACTTGA GAGGTGGGGC TCTGCTTGGC TTATAGTGCC TTCTCCCGAT GCTGTTTTGT TAGTTAAGAG ACAACTTGGA TCCATCCAAG AGCTTTCTGT TGGCGTGAAT GTTTCAACTT TTGACGAGTG GATGCGCGAC CAATGGAAGT TGTACGGAAG CTCTGATCGT CTGCTTTCAA CTACGCTTCG CAAGGTATTT TTCCAGCAGA TTTTGGATGG AATGTCTGCT GATGAGCTAG GCACTTTGAA TAATAGCAAG GGTACTGTGG AGCTTCTGTC TAAGCTTGCT CCAGAGTATC TTACAGAGCT TGATCAGATT ATGGCCTCTG GTCAACTTTC TGCTGGTCAG ATGAGTGCTT GTAAGGTGCT AGAGCGCTAC AAGGCACTGC TTGAAGAAAA GTCTTATGTC GAGGTGTGCC AGTGTCTGGA TTATCTACTG CAGTCCATTC CAGCGCAAGG ACCTGCACTG ATTTTCTCGC GAGTTGAAGA TCTTTCGGAG GCGCGTCTTA AGTTTGTGCG TAAGCTAGCG CAGAAGCGTG ATGTGACGTT TTCTCTCTAC GTGCCAGAAG GACCTGCAGG TTATGCAGCA GAGCAGCAGC TGGAGTTGGT CGGAGGCCCG GGATGCGATT GTCGTGTGGA TGCGGAGCCT GCGCCAGCCG CAAAAAGCCA GGAGCTCAAC GATCTTCTCG CCAGTGTGTT TAGGGCCAAG GAGGGCAATA AGATTACGCC GAGCGGTGCG GTAACGTTTT TGTCGCCGCT GGGCCAGCAT GCCGAGGCAG AGTCTATAAG CAGGTATATC TCTCAGTGCG TAGAGTCTGG CAGCAAGAGC TTTGTTGTGT ACACTTCAAA TCCGCAAAGG GTTTGGGATG CGCTTTCGCA AAAACTTGCA GCTAAGGGAA TTGCTGTTCA CTATAGGCGT TCGGTGCGTA TTCAGGATTC ACTTGCAGGT CGTGCATTTG CCAGCTTGAT TGATGCGTAT GTTACGCTCA GTGAACGCGC AGAGCTCGAG AAAAACATTG ACTACCAGGC ATCCGATCAC CAGATGGGGG ATATGTCGTG GTGGCCTCCG CATACCCTTA CGGATTATTT GATTTCTCCT ATTTCAGGCA TAAGCGTTGA GCGCGCATGG ATGCTGGATA AGAGTTGGCG TGGTAACAGA ACGCTATATG CTACTAGAGT GCTTGAGACG CTTTCTAAAG CAGCTATGAG TTCACGTCTG TGCGCAGAGA CCATCAAGAG CCTTGAGCTG GGCAGAATTG GTTCTGCAGC CCAGCGTATT ATTGAACATC TTTCAGCAGA AATATCTGAT GAGCAGTCAG AGGTTGCTCA GTCTAAAGAG TTGACTCTTC AGTCTCTTCA AAATCAAGAG TCGCTTAAGG TAATGAGCAA GATTGTTTCT TCTGCTCAAG AGCTTCATGA GGCAGGATTA AAGCTCACTC CTCAGACACT CAAATCGTTT ATGGATCTCT GCAAAGATCA GGCGGTCATG ATGCCTGTTT CTAATGGTAT TAAGTCTGAT ATTCAGGTGT TAATCGCTCC CGTGAGCCAA GCTCATTCTT TTGATGCCGT CATTTTCCAG GGCATGGATA CGTTGAATTT TGGCGTTAAG GCATCAGACG GTGCTCTTCA AGAGTTTGTT CGCCATGCTA GTAAAACGCC TAAGGTATCA GAGTTTGCAC GGTACCAAAG AGATTTTTAT ACGGCGCTTG CAACCGCTTC TACCAGCGTT GCATTTGAAA AAGTAGAGCA AAAAGATGTC TTTAATGCGG TGGCTCTCAG TGAAGTCAAA GCATGCTATC CAAAAGACTA TGCAAAGAAG ACGGGGCTTG TGCGCGGAGA AGAAGAGGTG CTGGTAAACC TGTTGCCTCA AGCTTCAGAT CTAAAACGTG TTGCTGAACT TCCAACAATC GAATCTGGTG AGATTGACTC TAAACTCAAA AATATTGTGG TACTCCCACG TCATTTGACT AGGGAGACTC TTGAGCAGGA ATTACAGGGT CTTATTGAGG TCTCGCGAGA AGGACTTCCG CTTCTTTCCG CTTCTCAAAT TGAAACGTAT CTTGAGTGCC CCTACAAATG GTTCACACAA CGTCGCATCA AGATTTCTCA GGTAGACACT GAGTTTGCTC CAATGCAGAT GGGTACTTTT ATCCATCGTG TACTAGAGCT CACACACGCA ACGCTTTTAG CAGAAGCGCT TGGTTGTGAT GTGACTGAGG TTGATACGGC AGTTGAATCC GTACTTTTGC AAGACGTTGC CGGATCTAGG ATTACAACTG ATAACTTAGA TCATGCAAAA CAAGTGTTAG ACAGCTGTTT TGCTCAGGTG TGGGATGAGC AGTTTAACAA CATTAACCGA GCATCTTCAA ATGAGCTTAT TCCGCATAGC ATTCAAGAAA GAAAACAGGT TGAGAATATT CGAGAAAATC TTAAGGATCT TCTCGAGTTT GAAGCTTCGC ACTTTATTGG TTATCAACCC AGATTCTTTG AGCTTCGTTT TGGTAGAGAA GAAAATGTTG TTGAGTATGC AGGAGCTCAG TTTACTGGTT CGATCGACCG CGTAGATGTA AACGCTCATG GTCAGGCACT TATTATTGAC TATAAGCACA AGGGGACAAA AGATCTGAAG GCTTATTCAG CAAAGTTAAG TCTGGATAGT GAAGTTTCAA AAGAGGTCTT GCCAAGGCAT GTGCAGTCCG CAATATATGC TCAGATTATG AGAAAACAGC TCACCAAGTA TGAGCTTGAG TCAGTTGCCG CAATTTATCT GGGCACCAAA GAGCAAAAAG ATAAGCCTTC ATTTGCTCTT GCGGGTATGG CAACAGAAGC GGCAACAGAA CATATTTGGA ACATACATCC GGAAGATAAA AAGCTCAGGG ACCAGGCGGT TATGGTTGTG TCTCAAAATT CTGCAGAGTT TGCAGACTTT TTGGACGCTT GGGAGAATTT AATTGCGCAG AAGGTTCAAG CTATGCTTTC TGGAGATGTC CGAGCCAATC CTTGCGATAA GGATGCGTGT AAGTATTGCC CAGTAAAACT ATGTGACAAG AGGAGGTAA
|
Protein sequence | MSLLLYKQQT GAFLSNSVQE QLRASLERWG SAWLIVPSPD AVLLVKRQLG SIQELSVGVN VSTFDEWMRD QWKLYGSSDR LLSTTLRKVF FQQILDGMSA DELGTLNNSK GTVELLSKLA PEYLTELDQI MASGQLSAGQ MSACKVLERY KALLEEKSYV EVCQCLDYLL QSIPAQGPAL IFSRVEDLSE ARLKFVRKLA QKRDVTFSLY VPEGPAGYAA EQQLELVGGP GCDCRVDAEP APAAKSQELN DLLASVFRAK EGNKITPSGA VTFLSPLGQH AEAESISRYI SQCVESGSKS FVVYTSNPQR VWDALSQKLA AKGIAVHYRR SVRIQDSLAG RAFASLIDAY VTLSERAELE KNIDYQASDH QMGDMSWWPP HTLTDYLISP ISGISVERAW MLDKSWRGNR TLYATRVLET LSKAAMSSRL CAETIKSLEL GRIGSAAQRI IEHLSAEISD EQSEVAQSKE LTLQSLQNQE SLKVMSKIVS SAQELHEAGL KLTPQTLKSF MDLCKDQAVM MPVSNGIKSD IQVLIAPVSQ AHSFDAVIFQ GMDTLNFGVK ASDGALQEFV RHASKTPKVS EFARYQRDFY TALATASTSV AFEKVEQKDV FNAVALSEVK ACYPKDYAKK TGLVRGEEEV LVNLLPQASD LKRVAELPTI ESGEIDSKLK NIVVLPRHLT RETLEQELQG LIEVSREGLP LLSASQIETY LECPYKWFTQ RRIKISQVDT EFAPMQMGTF IHRVLELTHA TLLAEALGCD VTEVDTAVES VLLQDVAGSR ITTDNLDHAK QVLDSCFAQV WDEQFNNINR ASSNELIPHS IQERKQVENI RENLKDLLEF EASHFIGYQP RFFELRFGRE ENVVEYAGAQ FTGSIDRVDV NAHGQALIID YKHKGTKDLK AYSAKLSLDS EVSKEVLPRH VQSAIYAQIM RKQLTKYELE SVAAIYLGTK EQKDKPSFAL AGMATEAATE HIWNIHPEDK KLRDQAVMVV SQNSAEFADF LDAWENLIAQ KVQAMLSGDV RANPCDKDAC KYCPVKLCDK RR
|
| |