Gene Apar_0519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0519 
Symbol 
ID8413370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp598042 
End bp601404 
Gene Length3363 bp 
Protein Length1120 aa 
Translation table11 
GC content46% 
IMG OID645022089 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_003179541 
Protein GI257784324 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.388344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00158278 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGGCA TCGAGAAAAG CACTCGCTGG AGTGTTCTTA CGCAAGACGT ATATCTAGAG 
TCATTGTTTG AAAAAGAATT GCACGTTACC CCACTTGTTG CTCGTGTGTT GGTAGCACGT
GGATTTACCG ATGTAGCTGA AGCTCAAGAG TTTCTCTCAC CATCCCTGCA AAGAGATTGG
CTTGATCCAC AGTGTATTCC AGGAATGGCT GAGGTTGCCA ATCGTGTTCA AAAGGCCATT
GTTGAACAGC AAAAGATTGC TGTTTTTGGT GACTTTGACG TTGATGGAAT GAGTTCAACT
TGTTTGCTCA CCTTGGCTTT GCGCCGATTG GGTGCTGATG TTTTGCCGTA TATTCCACAT
CGATTTGGTG AAGGATACGG ACTTTCTAAA GAGGCTTTAG CAAGAGTTTT ACTAGATAGA
AAGCCTGATT TAATTATTAC GGTAGACAAT GGTATTGCTG CTGCTCAAGA GGTTGCATGG
CTTTTGGAAC AGGGGATTGA TGTTGTTATT ACCGATCACC ATGAACCCGC AGATTTGGTG
CCCCAGGGTG TCCCTGTAAC TGATCCAAAA CTTATTGAGG ATTGTCCTTC CAGAGAGCTT
GCTGGTGCAG GCGTTGCCCT TAAGCTAGTA CAAGTTTTAG GGCAACTTCA AAATCAGCCT
CAGCTTTGGC TTGATTACAT TGATGTTGCC ATGTTGGGCA CGCTGTCTGA CATGATGATG
CTCAATAAAG AAAATCGTGC ACTGGTCTCT GAAGGTGTTA AACGTCTACA GAAAGGCCTT
CGTCCTGGTC TGGTGGCTCT TGCTGCTGTT GCCGGTCAAG ATATTGCACA AATTTCTGCA
GATAATCTGC CATTTTCTAT TATTCCACGT CTTAACGCGG CAGGTCGTAT GGGTACTACC
GATATTGCCC TTGACCTTCT TCTCACAGAA GATGCAGAGG AGGCAACTAT TCTGGCTGGC
AAGCTGGAAG AGATTAATGC TGAACGTCGT GCTATTGAGG CAAAACTCAC TGATGAAGCC
CTTGAGCAGG CAAAAGAAAT CTACGACGGT GGTCACGCTA TTGTGCTTGC CAAAGAGGGT
TGGCACGAGG GTGTAAAGGG TATTGTTGCT TCAAGAATTG TTAATCGCTA TCACGTACCT
TGCATTCTGT TTACCATCCA AGATGGCGTA GCTCGTGGTT CTGGACGCTC AGTTGGCTCC
GTTGATTTAT TCCATGCGGT AGAGCAGTGT GCAGATCTTA CTGTTCGTTT TGGTGGTCAC
CAGGGTGCTG TTGGCGTAAC GGTAGAAACA TCAAAGATTG ATGCATTTAG AAAGCGCTTA
TCTGAGGTTA TGGCAGAGCT TCCAGAAGAA GAGTTTGAGT CTTCAGGAGA AGTAACTGCC
TTGGTAAACC TTGACGAGAT TACTATTAAC TCTATTGATG CACTTGAGGC GCTGCAGCCT
TTTGGTCAGG GTAACAAAAA ACCGCTCTTT GGCGTTAAGG GCGTAGTGAT GAAGAATCGT
TCGCGCGTTG GCGCAGGCGG AGCTCATCTT TGTTTTATGG CTTCAAATGG CATCTCTTCA
ATTTCATCCA TTATGTTCAG AACACCTCAT GTAGAGGCGG CAGCTGAGTA TGACGGTGCT
GTTGATTTGA TTTTTGAGGC AGTTAACGAG ACTTGGCAAG GTAGAACCAA ACCAAAACTC
ATGGTTAGAG ACATTCTCTA CAGAGACTTT ACCGATGAAG ATGATGGTCC CATTGAAGCG
GACCTTCTCG GCAGTGCTGT TGACTTGCCA CTTGTGGTTA CCAAAGAGGC TACAGAGACT
GAAGAAGCGT CTCAAGAACA GTCGCAACAA AAACGCCATG AGCTGGAGAG CCTGCCTTAT
GAGGAGCTCA CTCAGACCCT GGTTACATCC ATGATTGGTT CAAGAGAACT CTTGCCTCTA
CAAAAGAAAA CATTAGAGGT TCTCTCTCAA GGAGAATCCT GTCTCTCTGT TATGGCAACA
GGTAGAGGTA AATCACTCAT TTTCCAAGTT TACGCTGCTT GTGTTGCACT TCTGCAACAC
AAGATGAGCG TATTTGTTTA CCCACTGCGT GCCCTCATCA ACGACCAGGT TCAAAGTATG
AAGAATACCT TTGAGCCTCT GGGAATTTCT GTTGAGGTGC TTAACGGAGA AACAGACCTT
ACCAGTCGAG AAGACCTGTT TGCTCGCATG GCAAACAATG AGCTTGATAT TGTGCTGACT
ACACCAGAGT TTTTCTCGCT CCATGCTGAT CAGTTTGCTG CAGCGCATAA CATTTCGTTT
GTGGTGTTTG ACGAGGCCCA TCACGTGGGA ATGAATGCTT CTGAGGGCAG ACTCGCCTAC
GCACAGATGC CACAGGTACT TAAGATGCTT GGAAATCCTC AGGTACTTGC CACAACGGCT
ACGGCTACCA CGCAGGTAGC CCAGCGTATC TGTGAACTTC TCTCTATTGA TGCAGACCAC
GTATATAAAG ACAAAACTGC TCGTACTAAC CTTGAACTTA AGGATCTTCG TAGTGCAAAA
GATAGAGAGT GTGCTCTACT TTCTCTTGTG TCTGATGGCA CAAAGACGGT TGTCTACGTT
AATTCAAGGG AGCAAGCGCA GGCTTTGACG CGCATGCTGC GTCATCGCAT TCCAGAAATT
GGTCATAAGG TTGCGTTTTA TCATGCGGGT TTAACGCCTC AGATACGTAA GAACGTTGAA
CGTGCATTTA GACGCGGTGA CGTTTGCTGC ATTATTGCTA CAAGTGCGTT TGGCGAGGGC
GTTAATATCT CCGATATTCG CCAGGTCATT TTGTATCACC TGCCGTTTGG TCGTGTTGCA
TTTAATCAGA TGAGTGGACG TGCAGGTAGA GATGGCAAGC CTTCTAAGAT TTATATGCTT
TTTGATGCAC ATGATATGCG CGTTAACGAG CGTATTCTTT CTTCTCGTGC GCCTCAGCGA
GAGGCCCTTG TTGCAGTTTA TCGTGCGCTT ACTGCATTGC AACCAAAGCT TGTCGCACAG
CCAACTAATC CCGCACTTAC TGATGAAGAT ATTGCACAGA TGGCGCTTGA AATTGATCCA
AGATCTAAGG CCGATGAACA AATCGTTCGT GTTGCACTTG ACGTTTTTGT TGAGCTAGGG
TTTATTAGCA TTGAGGGATT TGATCAAACA CGAGTAATTT CCGTCAATAG TCAAGCTTCG
CACATGGACT TGTTGGAGTC TGCTCGCTAC GCTGAAGGAC TTAAAGCGCA CACTGAGTTT
GAATCATTTG CTCAGTGGGT CTTTTCTGCT TCACTAGATG AGCTGGCAGA TGCTATAACA
AAACCCATCG TTCCAGAGAT TGGTCCAATA TTTGATGGGG GGAGGGAGTC CTCTAATGAG
TGA
 
Protein sequence
MPGIEKSTRW SVLTQDVYLE SLFEKELHVT PLVARVLVAR GFTDVAEAQE FLSPSLQRDW 
LDPQCIPGMA EVANRVQKAI VEQQKIAVFG DFDVDGMSST CLLTLALRRL GADVLPYIPH
RFGEGYGLSK EALARVLLDR KPDLIITVDN GIAAAQEVAW LLEQGIDVVI TDHHEPADLV
PQGVPVTDPK LIEDCPSREL AGAGVALKLV QVLGQLQNQP QLWLDYIDVA MLGTLSDMMM
LNKENRALVS EGVKRLQKGL RPGLVALAAV AGQDIAQISA DNLPFSIIPR LNAAGRMGTT
DIALDLLLTE DAEEATILAG KLEEINAERR AIEAKLTDEA LEQAKEIYDG GHAIVLAKEG
WHEGVKGIVA SRIVNRYHVP CILFTIQDGV ARGSGRSVGS VDLFHAVEQC ADLTVRFGGH
QGAVGVTVET SKIDAFRKRL SEVMAELPEE EFESSGEVTA LVNLDEITIN SIDALEALQP
FGQGNKKPLF GVKGVVMKNR SRVGAGGAHL CFMASNGISS ISSIMFRTPH VEAAAEYDGA
VDLIFEAVNE TWQGRTKPKL MVRDILYRDF TDEDDGPIEA DLLGSAVDLP LVVTKEATET
EEASQEQSQQ KRHELESLPY EELTQTLVTS MIGSRELLPL QKKTLEVLSQ GESCLSVMAT
GRGKSLIFQV YAACVALLQH KMSVFVYPLR ALINDQVQSM KNTFEPLGIS VEVLNGETDL
TSREDLFARM ANNELDIVLT TPEFFSLHAD QFAAAHNISF VVFDEAHHVG MNASEGRLAY
AQMPQVLKML GNPQVLATTA TATTQVAQRI CELLSIDADH VYKDKTARTN LELKDLRSAK
DRECALLSLV SDGTKTVVYV NSREQAQALT RMLRHRIPEI GHKVAFYHAG LTPQIRKNVE
RAFRRGDVCC IIATSAFGEG VNISDIRQVI LYHLPFGRVA FNQMSGRAGR DGKPSKIYML
FDAHDMRVNE RILSSRAPQR EALVAVYRAL TALQPKLVAQ PTNPALTDED IAQMALEIDP
RSKADEQIVR VALDVFVELG FISIEGFDQT RVISVNSQAS HMDLLESARY AEGLKAHTEF
ESFAQWVFSA SLDELADAIT KPIVPEIGPI FDGGRESSNE