Gene Apar_0941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0941 
Symbol 
ID8413812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1056462 
End bp1059356 
Gene Length2895 bp 
Protein Length964 aa 
Translation table11 
GC content48% 
IMG OID645022529 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003179961 
Protein GI257784744 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCAA GCAAAATTGC TATCCGTGGT GCACGAGAGC ACAACCTGCA AGATATTGAT 
ATTGATATTC CTCGTGATCA ACTGGTAGTT ATTACGGGAC TTTCTGGCTC AGGTAAATCG
AGTCTTGCCT TTGACACAAT TTATGCAGAA GGACAGCGTC GCTACGTTGA ATCTTTGTCC
AGCTATGCCC GCCAGTTTTT GGGACAGATG GATAAGCCAG ACCTTGACTC TATTGACGGT
CTTTCTCCAG CTGTGTCTAT TGATCAGAAG ACAACTTCTA GAAACCCCCG CTCAACCGTA
GGTACTGTTA CAGAAATTTA TGATTATTTG CGTCTTTTGT ATGCTCGCAT GGGTACTCCT
CACTGCCCTG AGTGTGGACG CGTTATTGAG CGCCAGACAA CTGATCAAGT TGCCGATAAA
ATCCTCGAGG CAGGTCAAGG CCGCAGAGCT TATGTTTTAG CTCCTGTTGT TTTGGGCCGC
AAGGGTGAGT ACGTCAAGCT TTTTGAGGAC CTGCGCAAAG AAGGATTTTC TCGCGTTCGT
GTTGACGGTG TGGTGCGTGA GCTTGATGAA GAGATCATAC TGGGCAAAAC GCTCAAGCAC
GATATTGAGG TAGTTGTTGA CCGTATCGTC ATCCGTCCTG ATTCTTTGGG TCGCATTGTT
GAGGGAGTCG AGCAGGCAAC TAAGCTTGCC CAGGGCAAGG TAGGCATTTT ACTTCTTGCT
GATAAGTCCA ATCCAGAAAC CATGCCAGAA GAGCTTTTTC AGTACTCACT GGCGCTTGCT
TGTCCTATCC ATGGTCACTC CATGGATGAC CTACAGCCTC GTGATTTCTC GTTTAACGCC
CCATACGGCG CTTGTCCTGA CTGCGATGGT TTGGGAACTC GTAAAATTAT TGATGCTGCA
GCACTCATTG CTGATCCAAA GCTGTCCGTA TCAGAGGGCG TTTTTGGAAG TCTCTTTGGT
CACTCAAATT ACTATCCTCA GATTCTGTCT GCAGTCTGTA AGCATTTTGA CGTATCTGAT
ACAACACCTT GGAACAAGCT GCCTAAGAAG GTACAAGATG CCCTTCTCGG TGGTCTTGGT
TCTACTAAGA TTCGCGTTGA CTACAAGACG CGCGACGGGC GCAATACACA CTGGTTTACC
ACGTTCTCTG GTGTCAGAAA GATTCTTTTT GACAAGTATC AAGAGACCAC GTCAGAAAAT
ATGAAGACAC ATCTTGAGAA GTATATTCGC GAGATGCCCT GTACCACCTG TCATGGAGCT
CGCTTAAAGC CAGAAATTCT TTCCGTTACA GTTGGTAAGA AAAATATCTG GGAAGTCTGT
GAACTTTCTT GTAAAGAATC TTTGGAGTTC TTCAAGCAGC TAACTATTAC CGATCGCCAA
AAGGTTATTG CAGGTCCTAT TGTTAAAGAG ATTGTGGCCA GGCTGCAGTT CTTGGTGAAT
GTTGGCTTGG ACTATCTCAC GCTTTCTCGT GCGGCCGCAT CGCTTTCTGG TGGAGAAGCC
CAGCGTATTC GCTTGGCCAC TCAGATTGGT GCTGGTCTTA TGGGCGTCCT TTACATTTTG
GACGAGCCTT CTATTGGTCT TCATCAAAGA GATAACAATC GCCTTATCGA GACGCTCAAG
CAGCTTAGAG ATCGTGGTAA CACCGTGCTT GTTGTTGAGC ACGATGAAGA CACCATTCGC
GCGGCTGATT ACGTTATTGA TATGGGTCCC GGTGCCGGTG AGCTTGGTGG CTACGTTGTT
GCTGCAGGAA CCCCAGAAGA TATTGTTAAA AATCCTGATT CCATTACAGG TGCTTACCTT
ACGGGAAAGA AGCAGATCAA GCTACCGGAG GCCCGTCGTA AACCTGGTCG TGGAAAGATT
AAGATTACGG GAGCTAGCGC TAACAACCTC AAGAATGTTT CTGCTTTTAT TGAGCTGGGT
ACGCTGACGG TAGTTACGGG TGTTTCTGGA TCTGGTAAGT CTTCTCTAGT TACCGATACC
CTTGCGCCTG CGCTTACCAA TGCAGTTCAG CATTCAAAAC GCGTGGTAGG AGAGTATAAA
AAGCTCGAGG GCGTTGATCT TATCGATAAG GTCATTGATA TTGATCAGAG TCCTATTGGC
AGGACCCCGC GTTCTAACCC AGCAACGTAT ATTGGCCTTT GGGATGATCT GCGCGCACTG
TATGCTTCAG TCCCAGAGTC CCGTGCACGC GGCTACTCGG CTGGTCGTTT CTCGTTTAAC
GTCCAAGGAG GTCGCTGCGA GGCCTGTAAG GGCGACGGCC AGATCAAGAT CGAGATGAAC
TTCCTGCCTG ACGTTTATGT TCCCTGTGAG GTTTGCCACG GTAAGCGTTA TAACCGCGAG
ACGCTAGAGA TTCTCTACCA CGGCAAGTCT GTCTCTGACG TACTTGATAT GACTGTTCAC
GAGGCACTAG CGTTTTTTGC AAACATTCCT CGCATTAAGA ATAAGCTACA GACCCTCCAT
GATGTTGGTC TTGGCTACAT TCATCTTGGT CAGCCAGCAA CCACGCTATC TGGTGGCGAG
GCGCAGCGCG TCAAGCTTGC AAAAGAGCTT CACCGTCAGC AGACTGGTAA AACTCTTTAT
ATTCTGGATG AGCCAACAAC TGGTCTTCAT TTTGAGGATG TCAGGCAACT TATTGTTGTT
CTTGAGCGCC TTGTTGATGC TGGCAACACG GTTCTTGTCA TTGAGCACAA TCTAGATGTC
ATTAAAATGG CTGATCGCAT TATTGACATG GGTCCAGAAG GCGGCGACGG TGGAGGAACC
GTAGTTGTTT CTGGTACGCC AGAAAAGGTT GCCGCAACTC CAGAAAGCCA CACAGGCAAG
TTCCTTAAAG AGATTCTTGA CCGTGATAAT GCACGCCTCG CTGCAGAGAA AAAAGCTCAG
AAGAAACGTG CATAA
 
Protein sequence
MASSKIAIRG AREHNLQDID IDIPRDQLVV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS 
SYARQFLGQM DKPDLDSIDG LSPAVSIDQK TTSRNPRSTV GTVTEIYDYL RLLYARMGTP
HCPECGRVIE RQTTDQVADK ILEAGQGRRA YVLAPVVLGR KGEYVKLFED LRKEGFSRVR
VDGVVRELDE EIILGKTLKH DIEVVVDRIV IRPDSLGRIV EGVEQATKLA QGKVGILLLA
DKSNPETMPE ELFQYSLALA CPIHGHSMDD LQPRDFSFNA PYGACPDCDG LGTRKIIDAA
ALIADPKLSV SEGVFGSLFG HSNYYPQILS AVCKHFDVSD TTPWNKLPKK VQDALLGGLG
STKIRVDYKT RDGRNTHWFT TFSGVRKILF DKYQETTSEN MKTHLEKYIR EMPCTTCHGA
RLKPEILSVT VGKKNIWEVC ELSCKESLEF FKQLTITDRQ KVIAGPIVKE IVARLQFLVN
VGLDYLTLSR AAASLSGGEA QRIRLATQIG AGLMGVLYIL DEPSIGLHQR DNNRLIETLK
QLRDRGNTVL VVEHDEDTIR AADYVIDMGP GAGELGGYVV AAGTPEDIVK NPDSITGAYL
TGKKQIKLPE ARRKPGRGKI KITGASANNL KNVSAFIELG TLTVVTGVSG SGKSSLVTDT
LAPALTNAVQ HSKRVVGEYK KLEGVDLIDK VIDIDQSPIG RTPRSNPATY IGLWDDLRAL
YASVPESRAR GYSAGRFSFN VQGGRCEACK GDGQIKIEMN FLPDVYVPCE VCHGKRYNRE
TLEILYHGKS VSDVLDMTVH EALAFFANIP RIKNKLQTLH DVGLGYIHLG QPATTLSGGE
AQRVKLAKEL HRQQTGKTLY ILDEPTTGLH FEDVRQLIVV LERLVDAGNT VLVIEHNLDV
IKMADRIIDM GPEGGDGGGT VVVSGTPEKV AATPESHTGK FLKEILDRDN ARLAAEKKAQ
KKRA