Gene Apar_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0804 
Symbol 
ID8413669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp885786 
End bp888173 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content40% 
IMG OID645022386 
ProductComEC/Rec2-related protein 
Protein accessionYP_003179824 
Protein GI257784607 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.569588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTAAGA CGATGGAACA GCCCTTTCGT CCTTTTATTC CGTATAGCCT ATGTGCGCTG 
CTTTTTGTTT GTTTATGGCT TCAAGTTTTT ATTGCAAGAA GCGCAACGCT CGATACAGTT
CTGATTCATA TTCTAGGAAT CTGTGCGGTG GTTGCTTTGG GATTTTGGTA TGTAAAGCAT
GCGCATACCA GCATTATGAC TGTTGGGGTA GTTTTACTCT TACTTGTAAT TATCCTTATC
AGGTCTTTCT TTATATATGA CAGTCAAATG GCGTCAGTGA ATCTTTTAGA GTCAACTTCC
GTACATGATT TTGAACTAGT TGTATCAAGA GATTTAGTCT ACAAAAATCA CACATGGATT
GGTCAAGCAA ACGTCATCTA CCAAGGTAGA TGCATTGGAT GTGTGGGACT ACATACCAAG
GAACAGTTCT TACGAGAAAC ACATCTGTGC TGTGAAGGTA GGTTTACACG ATATAAAGAT
AAAGATTTTT CAGAAGAACA ATTTAAGAGG GGAGTACTTG GTTCAATTCA GGTAACCAAG
GTGAATTCAA AAACCTATGA AGAGGGAATT GTTGGGGAAG TTGCCCAATT TAGAGAAAAC
TGTATTTCTC AACTTCAACC AGAGCTGAGC TTTGGGCGAG CGCTAGCTGC ATGCGGTCTT
TGTGCCTATA GGCCCAGCCT GTATAGCTTT AACATTCCGA GAATATTTAT GCGCTGCGGC
CTTATGCACC TCATTGCAAT TTCTAGTTGT CACATTGTAA TTCTCTCTGC CTATATTGAT
GCTTTGTTTA AAAAACTTAC ATTAAAGCCT CTTTTGAGGG GAGTGTTAAA TTTATTTCTA
TTAAGCAGTT ATGCACTTTT CTGTGGAGTG CCTGCGTCTG CATTGAGGGC AATACTCCTT
GTTGAGCAAA AATACATTAT GCAGTGTGTT GGAAGGAAGA ATCATACGTT ATCTGCTGTT
TCTATTGTTG CATTGCTTAT GCTTTGTGTT GATCCACAAC TGAGCGTAAA TATTGCTTTT
ACATTGTCTT TAACTTGTAT TTTAGGAATG AATATATATG GCGGTCTTGC TCAATATTAC
GCTAAAACTG CATTTCATTT ATCTGGATAT GGAGCAGTAC AAAAGGTGCT GAGAAAGCCA
TTTTCTAGTG TACGAAACAC AATGTGCGCA ACAACAGTTG CTCAGTTTTC ATGCCTTCCA
ATTTCATGTG TGTGCTTTGG CCATTTTTCA TTACTCGCAC CTCTTTCAAG TGCACTGATT
ACAAGTCCAT TTTCACTTCT CAGTCTATTT GGAATTGGAG CAATAATTTT AAGCAGTATT
AAACCAGCTC AAGATACTGT TCTCTTTTTT ATAGATTTTC TTGGGAAGCT TATAGAGACT
CTTGCATCTT TCTTATCAGA AAGACCTTTT GCAAGTGTGT TTGTAGCCGA CATTGGTTGG
ATAGTCTCAT TAGCTTTTAT TGCAGTAATG GTGGCGCTCT ATGTGTTATG GCCAAAGGGA
AGAAGAATAG TTTTAGTTGG CATATGTTTG GCAGTATCTA TGCTTATTGT CGGATTAAGT
ATCTATTGGA GGTTTTTCTC GCCAACAAGA ATTTGTGTGA TGGATATAGG TCAGGGAGAC
GCCATATTGC TGAGCGATGG CGTGCATTCT CTGCTTGTGG ATACTAGTGC AGGAGATGTG
GTTAACGATG CTCTGGAGCG ACAGCACGTC TCATATCTTG ATGCAATTTT GCTTACACAT
CTTGATGAAG ACCATGCTGG CGGCGTGAGA TATATGGTTG GCTCAGTAAA GGCAGGCCGT
GTATTAGTTG GAGAAGGAAT AACAAAACAA GAAAAGCCTG AGTTACAGAG AGCTATTCAG
AGGATTTCTG GTAGTGGTTC TTATGAAGTT TTATATGGCG ACGAATTTGA TGTTGGTCGC
TTTCATGTTC GTGTGGTGTG GCCACATAGG GGCTATAAAG CAAAAGAAGC CAATAATGCT
TCAGTCGAGC TGTATGTGAC ATATAACGAT GGACAAAATA CGTTAACTAC GCTTTTAACG
GGAGATGCTG AAAGAGATCA AACTAAAGAG ACGGTGACAT CGGGTGATGT GGGAGATATT
GATTTCTTAA AAGTTGGACA TCATGGTGCA GCAAAGTCAC TCTATCCAGC AACTGCTCAA
GTACTAAAAC CTGAGGTAGC GGTGGCAAGT GCTGGAAAGA ATAATCATTA TGGGCATCCT
AAACAGGAAG CGATAGATAT TTTAGAGGGT GTTGGTGCAA GGTTTTACTG CACAAAAGAT
TATGGAGACG TCACAGTATT TCCTGGAGAA CATGGACCTA AGGTGAGCGT GCAACATGCT
AAAGTAGATA CAGATTTAGA GGAGGAAAAA GATGGCGGAG CAGGCTAA
 
Protein sequence
MSKTMEQPFR PFIPYSLCAL LFVCLWLQVF IARSATLDTV LIHILGICAV VALGFWYVKH 
AHTSIMTVGV VLLLLVIILI RSFFIYDSQM ASVNLLESTS VHDFELVVSR DLVYKNHTWI
GQANVIYQGR CIGCVGLHTK EQFLRETHLC CEGRFTRYKD KDFSEEQFKR GVLGSIQVTK
VNSKTYEEGI VGEVAQFREN CISQLQPELS FGRALAACGL CAYRPSLYSF NIPRIFMRCG
LMHLIAISSC HIVILSAYID ALFKKLTLKP LLRGVLNLFL LSSYALFCGV PASALRAILL
VEQKYIMQCV GRKNHTLSAV SIVALLMLCV DPQLSVNIAF TLSLTCILGM NIYGGLAQYY
AKTAFHLSGY GAVQKVLRKP FSSVRNTMCA TTVAQFSCLP ISCVCFGHFS LLAPLSSALI
TSPFSLLSLF GIGAIILSSI KPAQDTVLFF IDFLGKLIET LASFLSERPF ASVFVADIGW
IVSLAFIAVM VALYVLWPKG RRIVLVGICL AVSMLIVGLS IYWRFFSPTR ICVMDIGQGD
AILLSDGVHS LLVDTSAGDV VNDALERQHV SYLDAILLTH LDEDHAGGVR YMVGSVKAGR
VLVGEGITKQ EKPELQRAIQ RISGSGSYEV LYGDEFDVGR FHVRVVWPHR GYKAKEANNA
SVELYVTYND GQNTLTTLLT GDAERDQTKE TVTSGDVGDI DFLKVGHHGA AKSLYPATAQ
VLKPEVAVAS AGKNNHYGHP KQEAIDILEG VGARFYCTKD YGDVTVFPGE HGPKVSVQHA
KVDTDLEEEK DGGAG