Gene Apar_0961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0961 
Symbol 
ID8413832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1083159 
End bp1086218 
Gene Length3060 bp 
Protein Length1019 aa 
Translation table11 
GC content42% 
IMG OID645022549 
ProductLPXTG-motif cell wall anchor domain protein 
Protein accessionYP_003179981 
Protein GI257784764 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.796478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAA AACGTCTATG TAACTTCCTG GGACTTGTGC TTGCACTCAG TCTCATGATT 
CCAAGCGGTG CCATTAAGGC AATTGCTGAG GAAGTTACTC CGGCTCAGAC AAGTTCTGAG
ACTACGGCAA GCAGTCTAGT TACTACAACT TCTGAGTCTG AGAAAAAAGA AGTTGACGCT
AACGCTGTTT CATCTGAAGA GAACTCTCAG ACAGTTTCTG AGAATCAACC CGTACAGTCT
GTACAAGAAG ATGAGCAGGC TAATGGAACG CGAGCAGGCC CTGTAACAGA ACTTGTTCCT
GATGCTTCTT TCTTTAGCGC TTCTGCTCTT ACTACTTCAC TTTTAGCTGT TTCTGGTACA
CCTAAAGCGG TTGATGTAAA TATTACAGAT TTTAAAATTC AGAATGTAGA TCATCAAGAT
GTAGATACCG TGTTTCATTC TGATACTTTT CTTTTAAACA TGAAATGGAA TGCAACAGGA
CACGGTACTA ACTTGCATGA GGGAGATTAT TTTGATGTCG ATCTTCCAAA CAATATGAAA
TTCCCATCTG GTACAACAAA GACTGATTTT GATATTACCG ATGATGATGG AAACGTTATT
GCGCATGCGC ATATTACTCC TGGCCCTAAT GATACTGGTG GTAAAATTCA CGTTACCTTT
GGTCCTGCTG TTGAGAATAA ATACAATGTA AAAGGCACCA TGAACATTGC TGCGCGTTTT
GATCAAACAA AGATTAATAA AGATTCTGAA AATAAATTTG AAGTAACGGT TAATGGTGAC
GTACCAGGAA AAAAGCATAC TGCAAATACG GGCGTAAAAA TTAATGGTTC AAAGCCCATT
GATGAAGAAT ACCTTACTAA GTGGGGACAG GGCACAGGCA TTGCAGGTGA CACTAAAGCT
GAGTGGTGGT CTCGTATTAA TTTTAGTAAG GCCAATCTTA CTAATGCAGT AGTTACTGAC
ACTCTTGGCT CTTCCGGAAT GACTTATATC AAAGATTCTT TTAAACTAAG AAAAGTTGTG
TATGACCAAT ATGGTGAAAC AACTCAGGTT CTAAAAGAGT ATACGGTTGA CGAGCTTATC
AATAGTGGAA TGTTGACATT CTCTGCAGAT ATGAGTTCAT TTACTCTTAA TTTGGGTAAT
ACAAGTGATC AGTATCGATT GGTATATAGA TCTACTTATG TTCCTGGTAC TACACTTAAG
AATCGTATGA GGCTTGATTC CGATCAGAAA CAAAAACAAG TAATTGCGAG CCATAAGTCA
GCTGAGACTG GCGGACATGG TAGTGGTGAC CTTGCAAGTA AAATTAAGAT TGTTAAGGTA
GACGAAGACG GTACCACACC TCTTAAGGGT GCAGTATTTA CCGTCACTAA ACCAGACGGC
ACTACTTTTG AGCTCACTAC TGGTGCAGAC GGTACAATCG TCTCTGATTT ATTGGAGCAA
GGCACTTATA AGGTTAAAGA AAAGACCGCT CCACAGGGTT ATGAGCTTAG TGATGAAGAG
TACACCCTTG AAGTAACTCC TACAGGCGGT GCAATCAAAA AGATTTCCAA TAAACCAATT
AAGATTTCTG TAAACGTTAC CAAGAAGTGG ATTGGCCCTA AGGCAGGACC TGTCGCCGTA
CACTTACTTG CTGACGGTAC CGATACTGGC AAGACCTTGA CTCTTGATGA GGCAGGAAAC
TGGACGGGTA CTTTTGATAA CCTTCGTAAG TGCAAAGCTG ATGGTACAGA AATCGTCTAT
ACTGTCAAGG AAGATGATGT AACTAACTAC ACAGGTGCGG TAACTGGTGA TGCTGCTTCT
GGCTTTACTA TCACCAATAC CAATACCGAG AAGACCAGTA TCTCTGGTAC TAAGACATGG
GATGATAGTA ATAACCAAGA TGGTAAGCGT CCTTCATCTA TTACCGTTAA CCTTCTTAAG
AATGGCACTA AGGTAGACTC TAAGACAGTT ACTCCTGATG CATCCGGTAA TTGGACATAC
ACCTTTGATA ACTTGGATAA GTATGATTCA ACTACAGGAG CAGAGAATAC CTATACAGTT
TCTGAAGAAG CGGTAGATGG TTATACCTCA ACCGTGACAG GAACTGACAT CAAAAACTCT
TATACTCCTG AGGTTACAAC CGTCAAAGTT TCTAAGACTT GGGTTGGACC TAAGGCAGGA
CCTGTCACCG TACACTTACT TGCTGACGGT ACCGATACTG GCAAGACCTT GACTCTTGAT
GAGGCAGGAA ACTGGACGGG ATCTTTTGGC AATCTTCCTA AATATAAAGA TGGTAACGTT
ATTGCTTACA CCGTCAAAGA AGATGATGTA ACTAACTACA CAGGTACGGT AACGGGCGAT
GCTACTTCCG GTTTTACCAT CACTAATACT AATACCGAGA AGGTAGATGT TCCTGTAACC
AAGACTTGGG TTGGACCTAA GGCAGGACCT GTCACCGTAC ACTTACTTGC TGACGGTACT
GATACTGGAA AGACACTCAC ACTTGATGAG GCTGGCAACT GGACAGGTAC CTTCTCTGGC
TTGGATAAGT ACAATGCTGA TGGTACAGAG ATCGCCTATA CCGTCAAGGA AGATGACGTT
GCCAACTATA CGAGTGAAAT AACTGGTGAT GCTACTTCTG GCTTTACTAT CACCAATACC
AATACTGAGA AGGTTGACAT TTCTGTAACT AAGACTTGGG TTGGAGATAA GGGTTCATCT
GTCACCATCC ACTTACTCGC TGATGGTACT GATACGGGTA AAATTTTGAC TCTTGATGAG
GCAGGAAACT GGAAAGGCGC GTTCTCTGGC CTAGATAAGT ACAACGCTGA CGGTACAGAG
ATTGCTTACA CAGTCAAAGA GGATGAGGTA GCAGGTTATA CATCTGAGGT TTCTGGTGAT
GCCACTTCTG GATTTACCGT AACCAACACT CAGATTCCAC CTAATACACC TCCTAGCACT
CCTAAGAAGA AGCTTCCTTA CACTGGTGAT GTAAGCACAC TCGCATCTAT TGCTTCATTT
GTTGCAGGAA GTGTTGCACT ACTTGGCACA GGAATGGCTT TAGGAAGAAA GCGTAAGTAA
 
Protein sequence
MMKKRLCNFL GLVLALSLMI PSGAIKAIAE EVTPAQTSSE TTASSLVTTT SESEKKEVDA 
NAVSSEENSQ TVSENQPVQS VQEDEQANGT RAGPVTELVP DASFFSASAL TTSLLAVSGT
PKAVDVNITD FKIQNVDHQD VDTVFHSDTF LLNMKWNATG HGTNLHEGDY FDVDLPNNMK
FPSGTTKTDF DITDDDGNVI AHAHITPGPN DTGGKIHVTF GPAVENKYNV KGTMNIAARF
DQTKINKDSE NKFEVTVNGD VPGKKHTANT GVKINGSKPI DEEYLTKWGQ GTGIAGDTKA
EWWSRINFSK ANLTNAVVTD TLGSSGMTYI KDSFKLRKVV YDQYGETTQV LKEYTVDELI
NSGMLTFSAD MSSFTLNLGN TSDQYRLVYR STYVPGTTLK NRMRLDSDQK QKQVIASHKS
AETGGHGSGD LASKIKIVKV DEDGTTPLKG AVFTVTKPDG TTFELTTGAD GTIVSDLLEQ
GTYKVKEKTA PQGYELSDEE YTLEVTPTGG AIKKISNKPI KISVNVTKKW IGPKAGPVAV
HLLADGTDTG KTLTLDEAGN WTGTFDNLRK CKADGTEIVY TVKEDDVTNY TGAVTGDAAS
GFTITNTNTE KTSISGTKTW DDSNNQDGKR PSSITVNLLK NGTKVDSKTV TPDASGNWTY
TFDNLDKYDS TTGAENTYTV SEEAVDGYTS TVTGTDIKNS YTPEVTTVKV SKTWVGPKAG
PVTVHLLADG TDTGKTLTLD EAGNWTGSFG NLPKYKDGNV IAYTVKEDDV TNYTGTVTGD
ATSGFTITNT NTEKVDVPVT KTWVGPKAGP VTVHLLADGT DTGKTLTLDE AGNWTGTFSG
LDKYNADGTE IAYTVKEDDV ANYTSEITGD ATSGFTITNT NTEKVDISVT KTWVGDKGSS
VTIHLLADGT DTGKILTLDE AGNWKGAFSG LDKYNADGTE IAYTVKEDEV AGYTSEVSGD
ATSGFTVTNT QIPPNTPPST PKKKLPYTGD VSTLASIASF VAGSVALLGT GMALGRKRK