Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0961 |
Symbol | |
ID | 8413832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1083159 |
End bp | 1086218 |
Gene Length | 3060 bp |
Protein Length | 1019 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 645022549 |
Product | LPXTG-motif cell wall anchor domain protein |
Protein accession | YP_003179981 |
Protein GI | 257784764 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4932] Predicted outer membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.796478 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAAAA AACGTCTATG TAACTTCCTG GGACTTGTGC TTGCACTCAG TCTCATGATT CCAAGCGGTG CCATTAAGGC AATTGCTGAG GAAGTTACTC CGGCTCAGAC AAGTTCTGAG ACTACGGCAA GCAGTCTAGT TACTACAACT TCTGAGTCTG AGAAAAAAGA AGTTGACGCT AACGCTGTTT CATCTGAAGA GAACTCTCAG ACAGTTTCTG AGAATCAACC CGTACAGTCT GTACAAGAAG ATGAGCAGGC TAATGGAACG CGAGCAGGCC CTGTAACAGA ACTTGTTCCT GATGCTTCTT TCTTTAGCGC TTCTGCTCTT ACTACTTCAC TTTTAGCTGT TTCTGGTACA CCTAAAGCGG TTGATGTAAA TATTACAGAT TTTAAAATTC AGAATGTAGA TCATCAAGAT GTAGATACCG TGTTTCATTC TGATACTTTT CTTTTAAACA TGAAATGGAA TGCAACAGGA CACGGTACTA ACTTGCATGA GGGAGATTAT TTTGATGTCG ATCTTCCAAA CAATATGAAA TTCCCATCTG GTACAACAAA GACTGATTTT GATATTACCG ATGATGATGG AAACGTTATT GCGCATGCGC ATATTACTCC TGGCCCTAAT GATACTGGTG GTAAAATTCA CGTTACCTTT GGTCCTGCTG TTGAGAATAA ATACAATGTA AAAGGCACCA TGAACATTGC TGCGCGTTTT GATCAAACAA AGATTAATAA AGATTCTGAA AATAAATTTG AAGTAACGGT TAATGGTGAC GTACCAGGAA AAAAGCATAC TGCAAATACG GGCGTAAAAA TTAATGGTTC AAAGCCCATT GATGAAGAAT ACCTTACTAA GTGGGGACAG GGCACAGGCA TTGCAGGTGA CACTAAAGCT GAGTGGTGGT CTCGTATTAA TTTTAGTAAG GCCAATCTTA CTAATGCAGT AGTTACTGAC ACTCTTGGCT CTTCCGGAAT GACTTATATC AAAGATTCTT TTAAACTAAG AAAAGTTGTG TATGACCAAT ATGGTGAAAC AACTCAGGTT CTAAAAGAGT ATACGGTTGA CGAGCTTATC AATAGTGGAA TGTTGACATT CTCTGCAGAT ATGAGTTCAT TTACTCTTAA TTTGGGTAAT ACAAGTGATC AGTATCGATT GGTATATAGA TCTACTTATG TTCCTGGTAC TACACTTAAG AATCGTATGA GGCTTGATTC CGATCAGAAA CAAAAACAAG TAATTGCGAG CCATAAGTCA GCTGAGACTG GCGGACATGG TAGTGGTGAC CTTGCAAGTA AAATTAAGAT TGTTAAGGTA GACGAAGACG GTACCACACC TCTTAAGGGT GCAGTATTTA CCGTCACTAA ACCAGACGGC ACTACTTTTG AGCTCACTAC TGGTGCAGAC GGTACAATCG TCTCTGATTT ATTGGAGCAA GGCACTTATA AGGTTAAAGA AAAGACCGCT CCACAGGGTT ATGAGCTTAG TGATGAAGAG TACACCCTTG AAGTAACTCC TACAGGCGGT GCAATCAAAA AGATTTCCAA TAAACCAATT AAGATTTCTG TAAACGTTAC CAAGAAGTGG ATTGGCCCTA AGGCAGGACC TGTCGCCGTA CACTTACTTG CTGACGGTAC CGATACTGGC AAGACCTTGA CTCTTGATGA GGCAGGAAAC TGGACGGGTA CTTTTGATAA CCTTCGTAAG TGCAAAGCTG ATGGTACAGA AATCGTCTAT ACTGTCAAGG AAGATGATGT AACTAACTAC ACAGGTGCGG TAACTGGTGA TGCTGCTTCT GGCTTTACTA TCACCAATAC CAATACCGAG AAGACCAGTA TCTCTGGTAC TAAGACATGG GATGATAGTA ATAACCAAGA TGGTAAGCGT CCTTCATCTA TTACCGTTAA CCTTCTTAAG AATGGCACTA AGGTAGACTC TAAGACAGTT ACTCCTGATG CATCCGGTAA TTGGACATAC ACCTTTGATA ACTTGGATAA GTATGATTCA ACTACAGGAG CAGAGAATAC CTATACAGTT TCTGAAGAAG CGGTAGATGG TTATACCTCA ACCGTGACAG GAACTGACAT CAAAAACTCT TATACTCCTG AGGTTACAAC CGTCAAAGTT TCTAAGACTT GGGTTGGACC TAAGGCAGGA CCTGTCACCG TACACTTACT TGCTGACGGT ACCGATACTG GCAAGACCTT GACTCTTGAT GAGGCAGGAA ACTGGACGGG ATCTTTTGGC AATCTTCCTA AATATAAAGA TGGTAACGTT ATTGCTTACA CCGTCAAAGA AGATGATGTA ACTAACTACA CAGGTACGGT AACGGGCGAT GCTACTTCCG GTTTTACCAT CACTAATACT AATACCGAGA AGGTAGATGT TCCTGTAACC AAGACTTGGG TTGGACCTAA GGCAGGACCT GTCACCGTAC ACTTACTTGC TGACGGTACT GATACTGGAA AGACACTCAC ACTTGATGAG GCTGGCAACT GGACAGGTAC CTTCTCTGGC TTGGATAAGT ACAATGCTGA TGGTACAGAG ATCGCCTATA CCGTCAAGGA AGATGACGTT GCCAACTATA CGAGTGAAAT AACTGGTGAT GCTACTTCTG GCTTTACTAT CACCAATACC AATACTGAGA AGGTTGACAT TTCTGTAACT AAGACTTGGG TTGGAGATAA GGGTTCATCT GTCACCATCC ACTTACTCGC TGATGGTACT GATACGGGTA AAATTTTGAC TCTTGATGAG GCAGGAAACT GGAAAGGCGC GTTCTCTGGC CTAGATAAGT ACAACGCTGA CGGTACAGAG ATTGCTTACA CAGTCAAAGA GGATGAGGTA GCAGGTTATA CATCTGAGGT TTCTGGTGAT GCCACTTCTG GATTTACCGT AACCAACACT CAGATTCCAC CTAATACACC TCCTAGCACT CCTAAGAAGA AGCTTCCTTA CACTGGTGAT GTAAGCACAC TCGCATCTAT TGCTTCATTT GTTGCAGGAA GTGTTGCACT ACTTGGCACA GGAATGGCTT TAGGAAGAAA GCGTAAGTAA
|
Protein sequence | MMKKRLCNFL GLVLALSLMI PSGAIKAIAE EVTPAQTSSE TTASSLVTTT SESEKKEVDA NAVSSEENSQ TVSENQPVQS VQEDEQANGT RAGPVTELVP DASFFSASAL TTSLLAVSGT PKAVDVNITD FKIQNVDHQD VDTVFHSDTF LLNMKWNATG HGTNLHEGDY FDVDLPNNMK FPSGTTKTDF DITDDDGNVI AHAHITPGPN DTGGKIHVTF GPAVENKYNV KGTMNIAARF DQTKINKDSE NKFEVTVNGD VPGKKHTANT GVKINGSKPI DEEYLTKWGQ GTGIAGDTKA EWWSRINFSK ANLTNAVVTD TLGSSGMTYI KDSFKLRKVV YDQYGETTQV LKEYTVDELI NSGMLTFSAD MSSFTLNLGN TSDQYRLVYR STYVPGTTLK NRMRLDSDQK QKQVIASHKS AETGGHGSGD LASKIKIVKV DEDGTTPLKG AVFTVTKPDG TTFELTTGAD GTIVSDLLEQ GTYKVKEKTA PQGYELSDEE YTLEVTPTGG AIKKISNKPI KISVNVTKKW IGPKAGPVAV HLLADGTDTG KTLTLDEAGN WTGTFDNLRK CKADGTEIVY TVKEDDVTNY TGAVTGDAAS GFTITNTNTE KTSISGTKTW DDSNNQDGKR PSSITVNLLK NGTKVDSKTV TPDASGNWTY TFDNLDKYDS TTGAENTYTV SEEAVDGYTS TVTGTDIKNS YTPEVTTVKV SKTWVGPKAG PVTVHLLADG TDTGKTLTLD EAGNWTGSFG NLPKYKDGNV IAYTVKEDDV TNYTGTVTGD ATSGFTITNT NTEKVDVPVT KTWVGPKAGP VTVHLLADGT DTGKTLTLDE AGNWTGTFSG LDKYNADGTE IAYTVKEDDV ANYTSEITGD ATSGFTITNT NTEKVDISVT KTWVGDKGSS VTIHLLADGT DTGKILTLDE AGNWKGAFSG LDKYNADGTE IAYTVKEDEV AGYTSEVSGD ATSGFTVTNT QIPPNTPPST PKKKLPYTGD VSTLASIASF VAGSVALLGT GMALGRKRK
|
| |