Gene Apar_0946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0946 
Symbol 
ID8413817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1063910 
End bp1066150 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content42% 
IMG OID645022534 
Productexcinuclease ABC, B subunit 
Protein accessionYP_003179966 
Protein GI257784749 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCAT CTCATGCTGA AGATCAACAT AAAGAGTATA TAGGCGCTGA GCTGCGTCGA 
TTTGGTCGTG AAGCAGGAGA AGAGAAGCTT CAGGTAGTAT CTCCTTTTGA GCCAGCAGGT
GATCAGCCGC AGGCAATAGA AAAGTTGGCT CAAGGAATTG AAGATGGGCT TAGGTATCAG
ACACTCTTAG GTGTTACGGG CTCAGGTAAG ACTTTCTCTA TGGCTAAGAC TATCGAGAAA
CTCAATCGCC CCACGCTTAT TATGGAACCT AATAAAACGC TGGCCGCTCA GGTTGCTTCA
GAGATGAAGG AACTGTTTCC TAACAATGCT GTTGTGTACT TTGTCTCATA TTATGATTAT
TACCAGCCAG AAGCGTATGT TCCGCAGTCA GATACGTATA TAGAAAAAGA TGCTTCCATT
AATGAAGAGG TAGAGAAGCT CCGTCACCAG GCAACTTCGT CTCTTTTGTC CAGGCGTGAT
GTCATTGTTG TTGCATCTGT ATCGTGTATT TACGGTATTG GTAGTCCACA AGATTATGCT
GGTCTTGCTC CTAATGTAGA TAAATCTGTT CCTCTTGAGC GCGATGATTT TATTAAAGAT
CTTATTAATG TTCAGTACGA TAGAAATGAT TATGACTTGC AGCGTGGCAT GTTTAGGGTT
CGTGGAGACG TTGTTGATGT ATTTCCGCCA TATGCAGAAA ATCCGTTGCG TATTGAGTTT
TTTGGTGACG AAGTAGAAAG CATTTCAGAG GTTAGTACTG TAACAGGCGA GGTGCTTAGA
GAGTTTGACG CTATCCCAAT TTGGCCAGCT TCTCACTATG TTACTGAGCG CCCAAAGATT
ACTCATGCTC TTACAACCAT CTCTGAAGAG ATGGAAGCTC GTGTTAAAGA GCTTAAAGAA
AACGATAAAC TATTAGAAGC CCAACGTCTT GCTCAGAGAA CAAACTATGA TTTAGAGATG
CTTGAGACCA TGGGTTACTG CAATGGTATT GAGAACTATT CCCGCCATTT GGACGGCCGT
GCTCCAGGTG AGCCTCCTTA TACACTCATT GACTATTTCC CTAAAGATAT GATTTGCATT
ATCGATGAGT CACACGTTAC TGTTCCTCAA ATTAGAGGTA TGTATGAGGG TGATAGATCT
CGTAAAGTTA CGCTGGTAGA TCATGGATTT AGACTTCCTT CTGCGCTTGA CAACAGACCA
CTCAGATTTG ATGAGTTTGA AGGACGTATC CCTCAGTTTA TTTACGTATC TGCAACTCCT
GGAGACTATG AAGAGACAGT GGCTCAGCAA CAAGTTGAGC AGATCATCCG TCCAACTGGT
CTTTTGGATC CAAAGATTGA TGTCAGGCCA GTAAGAGGTC AGATTGACGA TCTTATTTCA
GAGGTTAAAG AGCGTGTGGC AAAAAAAGAA CGCGTTCTTG TTACTACACT TACTAAACGT
ATGGCAGAAG ACCTTACAGA TCACCTGTTA GATGAGGGTA TTAAAGTCAA CTATATGCAC
TCAGATACTG CCACACTTGA CCGTGTAGAA ATCATTAGAG ATTTGCGTCA GGGTAAGATT
GACGTACTGG TTGGAATTAA CCTTTTACGA GAAGGACTTG ATATTCCTGA GGTTTCTCTT
GTTGCTATTC TTGATGCAGA TAAAGAAGGC TTCTTAAGAA ATAGAAGATC TCTCATTCAG
ACTATGGGTA GAGCAGCAAG AAATGCTTCT GGTCAGGTTA TTATGTACGC CGATAAAATT
ACCGATTCCA TGCGTATTGC TATTGATGAG ACAAAAAGGC GTCGAGAGCT TCAGGAAGCA
TTTAACAAAG AGCATGGTAT TGTGCCTAAA ACTGTTAAGA AATCTATTAC TGATATTGCA
GGCTTTATCG CTGAAGCCTC AGAGAATATT GATAAGCGTA AGCGCAAAAA TGGAGAGTTT
TATACCGCTT CCAATGATGA AGATTCGCTT GAGGAACAGC AAGAATCAAT TCTTGAGATG
CCTGCAGAGT TGCTTACTGA AGAGCTTCAA AATCTCCCTC GTTCAGAAGT TGAAGCTATG
CTTTCTGGTA TGGAAGCAGA GATGGCAGAA GCTTCTGCGT CTATGGATTA TGAGCACGCG
GCAGAACTCA GAGACCAGAT TGTTGCCATC AGAAGCCAGC TTGAGGGTAC TACCTCAGAT
GATGTTATTA AAAGGCTTAA GACAGGTGCT AGAAAGGGTA GCGCACACGC TACTCGTAGA
CGCTATCGAG GTAAGCATTA G
 
Protein sequence
MDSSHAEDQH KEYIGAELRR FGREAGEEKL QVVSPFEPAG DQPQAIEKLA QGIEDGLRYQ 
TLLGVTGSGK TFSMAKTIEK LNRPTLIMEP NKTLAAQVAS EMKELFPNNA VVYFVSYYDY
YQPEAYVPQS DTYIEKDASI NEEVEKLRHQ ATSSLLSRRD VIVVASVSCI YGIGSPQDYA
GLAPNVDKSV PLERDDFIKD LINVQYDRND YDLQRGMFRV RGDVVDVFPP YAENPLRIEF
FGDEVESISE VSTVTGEVLR EFDAIPIWPA SHYVTERPKI THALTTISEE MEARVKELKE
NDKLLEAQRL AQRTNYDLEM LETMGYCNGI ENYSRHLDGR APGEPPYTLI DYFPKDMICI
IDESHVTVPQ IRGMYEGDRS RKVTLVDHGF RLPSALDNRP LRFDEFEGRI PQFIYVSATP
GDYEETVAQQ QVEQIIRPTG LLDPKIDVRP VRGQIDDLIS EVKERVAKKE RVLVTTLTKR
MAEDLTDHLL DEGIKVNYMH SDTATLDRVE IIRDLRQGKI DVLVGINLLR EGLDIPEVSL
VAILDADKEG FLRNRRSLIQ TMGRAARNAS GQVIMYADKI TDSMRIAIDE TKRRRELQEA
FNKEHGIVPK TVKKSITDIA GFIAEASENI DKRKRKNGEF YTASNDEDSL EEQQESILEM
PAELLTEELQ NLPRSEVEAM LSGMEAEMAE ASASMDYEHA AELRDQIVAI RSQLEGTTSD
DVIKRLKTGA RKGSAHATRR RYRGKH