Gene Apar_0938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0938 
Symbol 
ID8413809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1052258 
End bp1054192 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content49% 
IMG OID645022526 
Productexcinuclease ABC, C subunit 
Protein accessionYP_003179958 
Protein GI257784741 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCATTA GAGAGCAGGT TGATCAGGTT CCCACTGATC CAGGCTGTTA TCTCTGGAAA 
GACGGCTCTG GCAAGGTAAT CTATGTAGGC AAGGCTAAAA ACCTGCGTGC TCGCATGAGA
CAATATGTCA CGCTTCAAGA TGACCGTGCC AAGATTCCTC TGATGATGCA GGTCGTGCGC
AGTTTTGACT ACATTGTGGT AGAAAACGAG CATGAGGCAT TGGTGCTGGA GCGCAATCTC
ATCGCTCAGT ATCGTCCATA CTTTAACGTG GACTTTAAAG ACGATAAGAG CTATCCCTAC
ATTGCTATTA CCGAGTCAGA CACCTTCCCG GCAATTAAAT ACACGCGCGA GAAACACAAG
AAGGGTACGC GCTATTTTGG TCCCTATACC GATTCTTACG CGGCAAGACA AACCATCGAG
ACGCTCAGAA AAGTAGTGCC CATCTGCTCT GCTACGTGCG TGGAGTGGAA GCGTGCCAAG
CGCCTGCTTG AAAAAGATCC CGATGGCGCA GCTGTTGCTA ACTTGCTGCT GGCAAAGAAG
GGGAGACCTT GCTTTGATTA CCACGTAGGC AAGGGTCCTG GCGTGTGCGT TGGTGCTATT
GACACCGTTT CTTATGCTAA GAACGTTAGA CAGGTAGAAA ACTTCCTTCG CGGAAATCGC
TCCGAGATTG TTTCTGAGCT CAAAGATCAG ATGCAAGAAG CCGCTGCTGA CTTGGATTTT
GAAAAGGCAG CTAGGCTCAA GTCCCGTCTG CGTTCACTTT CTGATCTTGA TGACCGTCAG
CAAGTCACGT TTCCTACCTC GGTTAATATT GACCTCATTG GCATTTACCG AGAAGAAACC
ATCTCTGCAG CCTGCGTGTT TGTTGTGCGA GAAGGCCGTA CCATTCGCTC GGTTGAGTTT
ATCCTGGACA AAGGTTTGGA CGTTTCCGAG GAAGAGCTGG TTTCGGGCTT CCTAAAGCGC
TACTACGACG AAACCGCTGA CATTCCCGCA GAGGTTAACC TGCCCATCGA TCTTCTCGAT
GCGGAGGTTT TGTCCGAGTG GCTGACACAA AAGCGAGGTC ACAACTGCGT ACTTCACCAC
CCACAACGTG GCGAGAAATT CCGTCTACTT CAGATGGCTT CTGCTAATGC TCGTCACGCC
CTAATGCGCC ATATGATTCG TACAGGCTAT GCCGATGACC GCACTAACCA GGCGCTTCTT
CAGCTTGAAA GTGCACTTGC GCTTGATGCG CCGCCACTGC GCATTGAGTG CTTTGATATC
TCTACGCTTC ATGGAAACTT TACCGTTGCG TCTATGGTGG TCTTTACTAA CGGAAAGGCT
GATAAAAGCC AGTATCGACG CTTTAAAATC CGCGCTGAGC TTGATGAAGC AAACGACTTT
GTCTCAATGA CAGAGGTTCT GGGCAGAAGA TATAGTCCGG AGCGCATGGC AGACGAACGT
TTTGGCTCAA GGCCTGATTT GCTGGTAGTT GATGGCGGCA AACCTCAGCT GACCGCCGCA
ATCAATCAGC TCAATGAACT GGGATTAGAT ATTCCTGTCT GCGGCCTTGC TAAGTCTGAT
GAGGAAGTCT TTGTACCTTG GGATGACACG CCAATCGTGC TGCCTACAGG GTCAGCTTCT
CTGTATCTTA TTAAGCAAGT CCGTGATGAG TCTCACCGTT TTGCAATTAC CTTTCATCGT
GAGCTTAGAG ATAAGGCAAT GACCGTCTCT ATTCTGGACG AAATCCCTGG TGTAGGACCT
AAACGTAAAA AAGATATCAT GCGCCATTTT GGCTCTTTTA AGCGCTTAAA GGCTGCAAGT
GTTGAGGACA TCTCTCAGGT AAAAGGTGTT TCTGTGAACT TAGCAGAAAC CATCTACAAA
GAACTTAAAG CTTGGGAAGA ATCTTCAACA GCTGTACATG AGAAGTTAGA TGCAAGGGGA
GGAACTCATG AGTAA
 
Protein sequence
MSIREQVDQV PTDPGCYLWK DGSGKVIYVG KAKNLRARMR QYVTLQDDRA KIPLMMQVVR 
SFDYIVVENE HEALVLERNL IAQYRPYFNV DFKDDKSYPY IAITESDTFP AIKYTREKHK
KGTRYFGPYT DSYAARQTIE TLRKVVPICS ATCVEWKRAK RLLEKDPDGA AVANLLLAKK
GRPCFDYHVG KGPGVCVGAI DTVSYAKNVR QVENFLRGNR SEIVSELKDQ MQEAAADLDF
EKAARLKSRL RSLSDLDDRQ QVTFPTSVNI DLIGIYREET ISAACVFVVR EGRTIRSVEF
ILDKGLDVSE EELVSGFLKR YYDETADIPA EVNLPIDLLD AEVLSEWLTQ KRGHNCVLHH
PQRGEKFRLL QMASANARHA LMRHMIRTGY ADDRTNQALL QLESALALDA PPLRIECFDI
STLHGNFTVA SMVVFTNGKA DKSQYRRFKI RAELDEANDF VSMTEVLGRR YSPERMADER
FGSRPDLLVV DGGKPQLTAA INQLNELGLD IPVCGLAKSD EEVFVPWDDT PIVLPTGSAS
LYLIKQVRDE SHRFAITFHR ELRDKAMTVS ILDEIPGVGP KRKKDIMRHF GSFKRLKAAS
VEDISQVKGV SVNLAETIYK ELKAWEESST AVHEKLDARG GTHE