Gene Apar_0266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0266 
Symbol 
ID8413114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp305471 
End bp309523 
Gene Length4053 bp 
Protein Length1350 aa 
Translation table11 
GC content46% 
IMG OID645021833 
Productcell wall/surface repeat protein 
Protein accessionYP_003179288 
Protein GI257784071 
COG category 
COG ID 
TIGRFAM ID[TIGR02543] Listeria/Bacterioides repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.808645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGAA AAAATAAGAA CTTTGCAGGT AAAGGTGCTA AGACAGCTAT GTCTTTGGTA 
CTCGCACTCT CTTTAGTTCC TTCTTCAGCA TTAGCAGAAA TTGCTAACAC TCAAGAAACA
ACTCAAGCGC AAGAAGTTGC GGCAACCCAA ACAAGTGATA CAACAGCTTC AACTTCTGAA
GCAACAGCAG AAACAACAGC TGAATCTACC GTTAACACCG AAGCTACTAC TAACCAGCAG
ACTTCCACTA ACCAGCAAAC TTCCGAGCAG ACTACTACTC CTACAGATAG ATCTTCTCGC
CGTGTTGCAA GAAGCCTTTC AGTTACCTCA ACTGAGCTCT GGGTAGACGG AACCAATGGT
AACGACGCCA ATGATGGCTC TAATACAAAT GCTCTCAAAA CTCTACAGAA AGCACTGGAG
CTATGTTCAG AAACTCCATC CATCAATACC ATTCATCTTA AAGGCAATCT TGATTTGACC
TCTACGGTTA CTATTCCTGC TGGTCTCACA CTCAAAATTG CTGATGAAGG TGCAACTCTT
ACTGGCACTG GTAACAGAAT TGATGGTCTT GTTCTGAAGT CTGGTTCAAC GCTAACCGGC
ACTGGTACCC TTACTATGTC TGGCTTCAAA ACTGCCCTAA CTGCACAGCC TGGTTCAACT
ATCACCGACG GTACCTACGT CTTTAAAAAC AACGCAGGTT CTAGTGGCTC TCGCGGTATT
TCTCTTGGCG GAACCGTTAA AGGCACTACC AACAAAAACA GCGTTAGCAT CACTGCAGAC
GATAAGAGCA ATACAAACTT CTTTGAGTCC GGCTCTACTT TTGAAAACGC TACTATCAAC
GTCACTTCAC AGGCACGTAC CTGGAATGAT GCTCGTAATC TTACCCTCAA GAACACTGAT
CTTAAGGTTA AGGGCTTTGG TCAGACTTTC TATGTCAACC AGCTCAACAT GACTGATTCT
ACCCTGACCA TTAACCCATC TTACTGGGGA CAGACGGGTA TGACCATCCA GGGTCCTTCC
AATATTGTTA ACTCCACCAT CAATGCAAAT GCTGGTTCTA CTTCTGGCAT TTCTGTTGGT
GTAGCTGGAG GAACTGTAAA CGTTACTAAC TCAACCCTTA ACTTTACCAA TGGTGGAACT
GGTGGTCTGA ACGTCAACAC TGGTAATGTC ATCATTAACA ACTCAACCAT TAAAGGAGAT
GGTCGTAATT CTGGCGCTCT CTTTGGTGCA CAGACTAATG GCTCCATCCA GTTCACTGGA
AACAGCCTAG TAGAGACCCC TGCGACCAAG AACTCTGATA ACGGTGCTGG CCAGACTCGT
CAAAATTACA ATGTTGTTGG CGGTTCGTAT CTGCTCAAAT ATGCACCTGA CTACAACAGC
GGCTTTGGTA GCACTATTCC TACAAACGGT GCAGCTAACG GAAACGAAGC ACTCTCGCTT
TTTACACTTG CTGACGTATC AACTAACTCT CTAAGCCTCA TTAATGCTAA TGGTGCAACA
TACACCTACC CAGTTGCTAA CGCATCAAGT GATGGTCAGA AGCACGTCTG GGCACCTGCT
GCAACTGTCA CCTTTGATCT TAACGCCCCA GGCGTTACGG GCATTAATCC AACATTTGCT
GATGGCACTA CCGCTAACAA AACAGCTCTT GCCATGCGTG GCAACTCTCT TGCAACCGCT
TCTTCTGTAG CAGAAGGCTC CACTACCCTG CCTGCAGACC CCTATGCAAG CGGCCAGGAG
TTTGACGGCT GGTACTACAC TGACGCTTCT GGCACAGAAC ATCAATTTAC CGCTGATACT
CAGGTCACCT CTGACATGAC AGTCTATCCA CACTGGAAAG CTAACACCTC TTGGCTCTAC
GTCAGGTATC ACAATGGTAA CGGCGTCTCT GTCATTGACC GCGTTGCTAC TAACGCAAAC
CGTACCGTAA CAGTTCGCTC TAATACTGAG GTCAACAACC TCAACAATAA CTTCAATATT
GCTGGTAAGA CCTTCAAGCG CTGGACTACC CAGGCAAACG GCGCTGGCGA TGAGGTTGCA
GCTAACTCCA ACCTAGCTAT TCCTACTGGT ACCGATACCA TTGATTTGTA TGCAGACTGG
GAGGAGCAGC GCCTTACCGT GTCCTTCTCG GCAAATGGTG GAACCTTCTC TGCAAATAGC
GTATTTAAGC AGAACCCAAA CGTCTTTGAC ATCACTACCG ATGCAAACGG TGGAGAGGTT
GCTACTATCA AAACTCATCC AACCGTTGCT GAGCAGACTA ACATCAACGC ACTGCTTAGA
AACCTCAGTG GCAATACCCT AAGCGCTACA ACTGCTGGAA TTGCAAGTCC AACCGACTCT
AACACCAATA CTGCATACAC AAATATTGCA ACTCTTGAAA ACCATATCCT AGACAGCGAG
GATAAACCAA AGACCTTCTT TGGTTTTGTT GTTGGCCACG ACTATCACTA CTGGTTTACT
GACGCTGCAG GCAACTCCCC TGCAACTATC AACGGTGGTG CAACGCTTAC TAACGACGTC
ACTTACTACC TCAAGTGGAA AGATGATCCT TCCATCCAGA AAGTTGAGCT TACCAGTGAT
CTTCCTGCTG ATATGTGGAG CGATTCTCAG AACAACACCA CTCAGATTAA AGAAGTTTCT
AACGATAAGA GCTTCAGTCT TACGGGCGCC ATTGATGCTA CTTCCGTCAT TAACCAAATG
ACTAACTTTG AGAATAACAT CGCTGGTGGC CTTGACGACC TAACCAAGAT TACGCTTTCA
GGCACAACTT CTACCTTTAC TGCAAAACTT ACTCTTCCTG CTGGCGTTGT TGTCCCAGCT
AATCCAACAG TAACCACCTC TGGTCTTGGT GACCTCTTTG ATGTTTCAAA TGTCTCCGTT
AACGGCCAAG AAGTTACCGT TACCCTGAAG CTTAAGAATA CTTACAGCAA CTATAAGCAG
CTCAAAGATG CTGTTGAAAC AGTCGGCAAA GACGACGCAG CTACCGCAGA GATCGCAAGA
CCCCTTACCG TAACCGTTAA CGGCCTCACC CTTGACTCAA CTCAGGTTAC CAATGGTCAG
GAGCTTACCG CTACAGGTAC TCTTACTGGT ACTTTTGAGT CTTATGCAAA GAATACCGTC
ACTAACGTTA CCAAGAAGTA CAACCTTTCT TGGAACGGTG TTCAGATTAC TGCAAACAGA
GATCCTCGTG GTACAGATAT TCAGCAGACT CTGATTGTTA ACAACCCTAT TGATATGAAC
ATCCCTGCAG ATATGCTCGC AGACGAGAAT ACCGAGCATG ACCAGGTCAT CACCATGAAG
GCAGGCTCTA CCTTTAATCT CACTGGTTCA ATTCTTGCTT CCTCTATCCA AGAGCAGATG
AACAACATCG AGCGTGCCTA TCCTAATACC AACCACGACA CCATTACGCT GAGCAATCTT
AAGTTCAAGT TCACAGCAAC TCTGACCGTT CCTGAAGGAA TGACACTCCC AAGCAACCTT
GATGCCAGCA CTGTCCAGGC TTCTAACTTT GGTAGTGGCT TTAAGGTCAG CGATGTCCAG
GTTAACGGCC GTACCGTTAC TGTCACGTTT GAGCTCAGCG ATCCATCTTC CATTAGAACG
TACTCTGACC TTGAACGTAT TGTCGACGAA GCCAGTGCAA ACGGTGGTTG GATGAAGCTC
ACCATTCCTG GTGTCACCAT TGACTCCAAT GTTGCTGCAG GTACTCAGCT CACCGCTGTT
GGTACAGTTA CAGGCTCCTT TAGCGCTATT GCTGATTCCG CAGCCGGCAA CCGCAAAGCC
TTCTCGTTCA CTTGGAATGG TGTTCAATGG CCAGATGGTA AAGACGCTGT TGCAACTAAT
AACGACACCA TCCAGTTAAC TATTACTGTT GCAGAGGATA AGACCCCAGA GACACCTTCT
CCTTCCACTC CAACTAAGAA GAAGCCATCT AAGAAAAAGA AGACTCCATA TACAGGAGAC
GCTTCTGTTG CAGCACCTCT TGGTGTACTT CTCGCAGGTA TGACAACTGT AATGACTTCA
CTTGGCATTA CTAAGCGTCG TAAAAATAAG TAA
 
Protein sequence
MERKNKNFAG KGAKTAMSLV LALSLVPSSA LAEIANTQET TQAQEVAATQ TSDTTASTSE 
ATAETTAEST VNTEATTNQQ TSTNQQTSEQ TTTPTDRSSR RVARSLSVTS TELWVDGTNG
NDANDGSNTN ALKTLQKALE LCSETPSINT IHLKGNLDLT STVTIPAGLT LKIADEGATL
TGTGNRIDGL VLKSGSTLTG TGTLTMSGFK TALTAQPGST ITDGTYVFKN NAGSSGSRGI
SLGGTVKGTT NKNSVSITAD DKSNTNFFES GSTFENATIN VTSQARTWND ARNLTLKNTD
LKVKGFGQTF YVNQLNMTDS TLTINPSYWG QTGMTIQGPS NIVNSTINAN AGSTSGISVG
VAGGTVNVTN STLNFTNGGT GGLNVNTGNV IINNSTIKGD GRNSGALFGA QTNGSIQFTG
NSLVETPATK NSDNGAGQTR QNYNVVGGSY LLKYAPDYNS GFGSTIPTNG AANGNEALSL
FTLADVSTNS LSLINANGAT YTYPVANASS DGQKHVWAPA ATVTFDLNAP GVTGINPTFA
DGTTANKTAL AMRGNSLATA SSVAEGSTTL PADPYASGQE FDGWYYTDAS GTEHQFTADT
QVTSDMTVYP HWKANTSWLY VRYHNGNGVS VIDRVATNAN RTVTVRSNTE VNNLNNNFNI
AGKTFKRWTT QANGAGDEVA ANSNLAIPTG TDTIDLYADW EEQRLTVSFS ANGGTFSANS
VFKQNPNVFD ITTDANGGEV ATIKTHPTVA EQTNINALLR NLSGNTLSAT TAGIASPTDS
NTNTAYTNIA TLENHILDSE DKPKTFFGFV VGHDYHYWFT DAAGNSPATI NGGATLTNDV
TYYLKWKDDP SIQKVELTSD LPADMWSDSQ NNTTQIKEVS NDKSFSLTGA IDATSVINQM
TNFENNIAGG LDDLTKITLS GTTSTFTAKL TLPAGVVVPA NPTVTTSGLG DLFDVSNVSV
NGQEVTVTLK LKNTYSNYKQ LKDAVETVGK DDAATAEIAR PLTVTVNGLT LDSTQVTNGQ
ELTATGTLTG TFESYAKNTV TNVTKKYNLS WNGVQITANR DPRGTDIQQT LIVNNPIDMN
IPADMLADEN TEHDQVITMK AGSTFNLTGS ILASSIQEQM NNIERAYPNT NHDTITLSNL
KFKFTATLTV PEGMTLPSNL DASTVQASNF GSGFKVSDVQ VNGRTVTVTF ELSDPSSIRT
YSDLERIVDE ASANGGWMKL TIPGVTIDSN VAAGTQLTAV GTVTGSFSAI ADSAAGNRKA
FSFTWNGVQW PDGKDAVATN NDTIQLTITV AEDKTPETPS PSTPTKKKPS KKKKTPYTGD
ASVAAPLGVL LAGMTTVMTS LGITKRRKNK