Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0941 |
Symbol | |
ID | 8413812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1056462 |
End bp | 1059356 |
Gene Length | 2895 bp |
Protein Length | 964 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 645022529 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_003179961 |
Protein GI | 257784744 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCAA GCAAAATTGC TATCCGTGGT GCACGAGAGC ACAACCTGCA AGATATTGAT ATTGATATTC CTCGTGATCA ACTGGTAGTT ATTACGGGAC TTTCTGGCTC AGGTAAATCG AGTCTTGCCT TTGACACAAT TTATGCAGAA GGACAGCGTC GCTACGTTGA ATCTTTGTCC AGCTATGCCC GCCAGTTTTT GGGACAGATG GATAAGCCAG ACCTTGACTC TATTGACGGT CTTTCTCCAG CTGTGTCTAT TGATCAGAAG ACAACTTCTA GAAACCCCCG CTCAACCGTA GGTACTGTTA CAGAAATTTA TGATTATTTG CGTCTTTTGT ATGCTCGCAT GGGTACTCCT CACTGCCCTG AGTGTGGACG CGTTATTGAG CGCCAGACAA CTGATCAAGT TGCCGATAAA ATCCTCGAGG CAGGTCAAGG CCGCAGAGCT TATGTTTTAG CTCCTGTTGT TTTGGGCCGC AAGGGTGAGT ACGTCAAGCT TTTTGAGGAC CTGCGCAAAG AAGGATTTTC TCGCGTTCGT GTTGACGGTG TGGTGCGTGA GCTTGATGAA GAGATCATAC TGGGCAAAAC GCTCAAGCAC GATATTGAGG TAGTTGTTGA CCGTATCGTC ATCCGTCCTG ATTCTTTGGG TCGCATTGTT GAGGGAGTCG AGCAGGCAAC TAAGCTTGCC CAGGGCAAGG TAGGCATTTT ACTTCTTGCT GATAAGTCCA ATCCAGAAAC CATGCCAGAA GAGCTTTTTC AGTACTCACT GGCGCTTGCT TGTCCTATCC ATGGTCACTC CATGGATGAC CTACAGCCTC GTGATTTCTC GTTTAACGCC CCATACGGCG CTTGTCCTGA CTGCGATGGT TTGGGAACTC GTAAAATTAT TGATGCTGCA GCACTCATTG CTGATCCAAA GCTGTCCGTA TCAGAGGGCG TTTTTGGAAG TCTCTTTGGT CACTCAAATT ACTATCCTCA GATTCTGTCT GCAGTCTGTA AGCATTTTGA CGTATCTGAT ACAACACCTT GGAACAAGCT GCCTAAGAAG GTACAAGATG CCCTTCTCGG TGGTCTTGGT TCTACTAAGA TTCGCGTTGA CTACAAGACG CGCGACGGGC GCAATACACA CTGGTTTACC ACGTTCTCTG GTGTCAGAAA GATTCTTTTT GACAAGTATC AAGAGACCAC GTCAGAAAAT ATGAAGACAC ATCTTGAGAA GTATATTCGC GAGATGCCCT GTACCACCTG TCATGGAGCT CGCTTAAAGC CAGAAATTCT TTCCGTTACA GTTGGTAAGA AAAATATCTG GGAAGTCTGT GAACTTTCTT GTAAAGAATC TTTGGAGTTC TTCAAGCAGC TAACTATTAC CGATCGCCAA AAGGTTATTG CAGGTCCTAT TGTTAAAGAG ATTGTGGCCA GGCTGCAGTT CTTGGTGAAT GTTGGCTTGG ACTATCTCAC GCTTTCTCGT GCGGCCGCAT CGCTTTCTGG TGGAGAAGCC CAGCGTATTC GCTTGGCCAC TCAGATTGGT GCTGGTCTTA TGGGCGTCCT TTACATTTTG GACGAGCCTT CTATTGGTCT TCATCAAAGA GATAACAATC GCCTTATCGA GACGCTCAAG CAGCTTAGAG ATCGTGGTAA CACCGTGCTT GTTGTTGAGC ACGATGAAGA CACCATTCGC GCGGCTGATT ACGTTATTGA TATGGGTCCC GGTGCCGGTG AGCTTGGTGG CTACGTTGTT GCTGCAGGAA CCCCAGAAGA TATTGTTAAA AATCCTGATT CCATTACAGG TGCTTACCTT ACGGGAAAGA AGCAGATCAA GCTACCGGAG GCCCGTCGTA AACCTGGTCG TGGAAAGATT AAGATTACGG GAGCTAGCGC TAACAACCTC AAGAATGTTT CTGCTTTTAT TGAGCTGGGT ACGCTGACGG TAGTTACGGG TGTTTCTGGA TCTGGTAAGT CTTCTCTAGT TACCGATACC CTTGCGCCTG CGCTTACCAA TGCAGTTCAG CATTCAAAAC GCGTGGTAGG AGAGTATAAA AAGCTCGAGG GCGTTGATCT TATCGATAAG GTCATTGATA TTGATCAGAG TCCTATTGGC AGGACCCCGC GTTCTAACCC AGCAACGTAT ATTGGCCTTT GGGATGATCT GCGCGCACTG TATGCTTCAG TCCCAGAGTC CCGTGCACGC GGCTACTCGG CTGGTCGTTT CTCGTTTAAC GTCCAAGGAG GTCGCTGCGA GGCCTGTAAG GGCGACGGCC AGATCAAGAT CGAGATGAAC TTCCTGCCTG ACGTTTATGT TCCCTGTGAG GTTTGCCACG GTAAGCGTTA TAACCGCGAG ACGCTAGAGA TTCTCTACCA CGGCAAGTCT GTCTCTGACG TACTTGATAT GACTGTTCAC GAGGCACTAG CGTTTTTTGC AAACATTCCT CGCATTAAGA ATAAGCTACA GACCCTCCAT GATGTTGGTC TTGGCTACAT TCATCTTGGT CAGCCAGCAA CCACGCTATC TGGTGGCGAG GCGCAGCGCG TCAAGCTTGC AAAAGAGCTT CACCGTCAGC AGACTGGTAA AACTCTTTAT ATTCTGGATG AGCCAACAAC TGGTCTTCAT TTTGAGGATG TCAGGCAACT TATTGTTGTT CTTGAGCGCC TTGTTGATGC TGGCAACACG GTTCTTGTCA TTGAGCACAA TCTAGATGTC ATTAAAATGG CTGATCGCAT TATTGACATG GGTCCAGAAG GCGGCGACGG TGGAGGAACC GTAGTTGTTT CTGGTACGCC AGAAAAGGTT GCCGCAACTC CAGAAAGCCA CACAGGCAAG TTCCTTAAAG AGATTCTTGA CCGTGATAAT GCACGCCTCG CTGCAGAGAA AAAAGCTCAG AAGAAACGTG CATAA
|
Protein sequence | MASSKIAIRG AREHNLQDID IDIPRDQLVV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS SYARQFLGQM DKPDLDSIDG LSPAVSIDQK TTSRNPRSTV GTVTEIYDYL RLLYARMGTP HCPECGRVIE RQTTDQVADK ILEAGQGRRA YVLAPVVLGR KGEYVKLFED LRKEGFSRVR VDGVVRELDE EIILGKTLKH DIEVVVDRIV IRPDSLGRIV EGVEQATKLA QGKVGILLLA DKSNPETMPE ELFQYSLALA CPIHGHSMDD LQPRDFSFNA PYGACPDCDG LGTRKIIDAA ALIADPKLSV SEGVFGSLFG HSNYYPQILS AVCKHFDVSD TTPWNKLPKK VQDALLGGLG STKIRVDYKT RDGRNTHWFT TFSGVRKILF DKYQETTSEN MKTHLEKYIR EMPCTTCHGA RLKPEILSVT VGKKNIWEVC ELSCKESLEF FKQLTITDRQ KVIAGPIVKE IVARLQFLVN VGLDYLTLSR AAASLSGGEA QRIRLATQIG AGLMGVLYIL DEPSIGLHQR DNNRLIETLK QLRDRGNTVL VVEHDEDTIR AADYVIDMGP GAGELGGYVV AAGTPEDIVK NPDSITGAYL TGKKQIKLPE ARRKPGRGKI KITGASANNL KNVSAFIELG TLTVVTGVSG SGKSSLVTDT LAPALTNAVQ HSKRVVGEYK KLEGVDLIDK VIDIDQSPIG RTPRSNPATY IGLWDDLRAL YASVPESRAR GYSAGRFSFN VQGGRCEACK GDGQIKIEMN FLPDVYVPCE VCHGKRYNRE TLEILYHGKS VSDVLDMTVH EALAFFANIP RIKNKLQTLH DVGLGYIHLG QPATTLSGGE AQRVKLAKEL HRQQTGKTLY ILDEPTTGLH FEDVRQLIVV LERLVDAGNT VLVIEHNLDV IKMADRIIDM GPEGGDGGGT VVVSGTPEKV AATPESHTGK FLKEILDRDN ARLAAEKKAQ KKRA
|
| |