Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_1411 |
Symbol | |
ID | 8398221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | - |
Start bp | 1518078 |
End bp | 1524458 |
Gene Length | 6381 bp |
Protein Length | 2126 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 644995776 |
Product | sugar-binding domain protein |
Protein accession | YP_003153155 |
Protein GI | 257066899 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | [TIGR01168] Gram-positive signal peptide, YSIRK family [TIGR02331] Rib/alpha/Esp surface antigen repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.183423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATACTA ATGACAAATT AGTGAAGATT ATTGAGTACA AGAAACTCAA AGGATCTATC AGAAAACCAA AGTATGCAAC CAGAAAGCTT TCTATAGGCT TAGTTTCATG CATGCTAGGT TACGCTTTAC TAGTTTCTCC AACATCTGTT GAAGCTACTG AAGGAAACGC AAATGCAAAA ACTGAAGTAG TAGAAAGAGG AGATATAGAA GAAGTAGAAA CTCCTGATGA AGGAGAAGCT AAGGTTGAAG TTCCAGTAGC TACCCCGGAA CCAGATGAAG AAGTTATAAC AGAAGCTGAT AATTTCAATG CAGAGCTTGA AAAGCTTACT GCAAAGGTAG GAGATAGCTC AATTGACTAC AAAAAAGCAA TCAAAAACCT ACCAGAAGAC GCTAAACTTA CAGTAAAAGC TCCAGCAGAC ACCAAAGAAG CAGGAGAAAA GACTGTTGAA GCTACAATTA CATTTGCTGA TGGATCTGAA AAAGATCTTA AAATCACAGT TGATGTAAAG GCTGAGGAAA AACTTGAAAA AAGCAATGAC GGAGCCGACA AAAAAGAAGT TCCAACAGAA GAACAAAGAA AACAAGACCT AGCTAACTAC CTAACAAGGA TAGAAAAATC TACAAATCAA GAAGAAATAG ATGCTATTCT TGAAGAAGCA GCAGATAAAT ATAAAGATAT TGATTTTGCT CCGTTATTAA ATGCTGGTAT TCAAATAAGT GAAAAGACAG AACCACAAGC CTTAAAAGAA TCTAATCCTG ATGAAAAAGA GTTTACAGAA GCAAAAGCAA AAGAAAATTG GGATCAAGAA CTAAAAGACA AAGGTTATTG GAAATTAGCT GAAGGCCAAA GGTTTAGAAG AGTTTCAGCT AGTGACCCAG TAAGTATATA TGATATTAAT TATGATGGAA CTTTTGCTGA TGCAGAAGGT AACACAAATT TAAGATTTAT ATATAATGAA TTAAATGGAG TTGGTTCCGG AGTCTGGCAT AGAATTGTAG TTAACTTTGG TGATTTGACA GATAAAATTG ATTTTGAAAA ATCTTATGTT GTTGGTGAAG ATGGAAAAAC AACAGAAAAA TTCTATGATG TTAATGGTGA AAAGCGTCTA GATATTACCA AATTATATAG TGGACCTAGG GCAGGCAAAA GAGTCAATTT CCCTATGAAT ATTGTTCTAA AGGATGGCAA ATCTATTAAT GAGATTCCAA AGGAAAATTA TATAGTACAG TTAAGGTTAG TAAACAAAGC AAGACCAAAT GTTGCAGAAG GAACAGAAAT TTATACCTAT GCACCTAAGG GAACTTCTGT TGACTACTCA TCCTATACAA AGGTAACTTC TGTGGACTTA TCAGATAACA TTAGTTCAAC AGTGTTAGAA GGACCAAAGC AAGAAAAAGG CGAAAGGATT GCCATCCAAA GGTCTTACAT GTCAGAATTT ATAGCAAATC CAAAAGAATA TGATGACTCT ACTCAAATAG GTATTTTAAG AACAGAATAT CTAGGAAAAA GAGGTGGATC TATTGGTGAA AACATTAAAG ATGTAGACGG TAAGCCGATT GGATTTGCAC AAGTATTTGA TGCAAAACTA GTTGATTATT TAAAAGAAGA TACTAATGGT AATGTAGCCT ATACAAATGT CCTTACTAAC GATAGAAAAC AAGGATCCTG GGTAAAGAAA ATAGGAATTA AAAAAGCTGA TATTACAATA AAAGATGGTC TAGCATATGT AGTTATTGCC AGAAATGATT CCGTAAATGC ATTCAAAAAT AATGAAATAA AGGCTGTTGG CGTTCCTACT CTTGACCAGT ATATCAATCA ATCTGGTATA TATTTTTCAT CTATAGATTA TTTAATTGAT AAGACAAAGT TGAACGAAGA GTTTACACCA GGCAAAAGAA AAACAGACTT TTCAATGGCT GCTGGTTGGG TAGAACCAAA CACAAAGGGA TGGACAATTT TTGAAAAAAC ATTTGACGAG GATTTTGTTG TACCTAAAGG GGAAAAGTAT ACTATCAACA CAACAAACGT TCCTGAAGGC GGTCAAGTTA TAATTCAAGT TGGGGATAAA AATCAGGCAA TACTAAGAAA TAAGCAGGGT TATTATAACA GTACAGTTTC TTTCTGGAAA CAAGGGGTTG ATTCAGTCGA TGAAACTAGC CCTGGAACCT ATGAATTCAC ATTAAGAGAA GGTGCCACAA TAAAGAAGGG TGAAGGAATT AGAGTTCTCT TACCTGACAC TCCTGATCAT ACTTCACCAG TAAGTTTTGT AAATCAACAT AATGCAGATG GTGGAACCGA ACTCAAAGTA GAATCAAATT CAGGAAATAT TAAGGTTAAA CTTAATAATT CTGCCAATGG AGATATCAAA TTAAAGTACA CTCTTAAAGG TGAAACAGAA CAAAGTGAAA TTGTATACAA AAAGAAGCTA GTAGGTTGGG AAAAGCCAAA TGATCCTGAT AATATCTTAG AAGGCTTAGG AAAAGCTTGG ATAACTAAAT CAAAGTTAGA GCCTGGTACA AAAATACTTG TAGAACATTA TGATGCTAGT GGAAAAAAAG TAGATAGTCT GGAATCTTAT ATTATTTATA AGGAGCTAGA AAAAAGCCCA GAGAGATATA CTAATATAGC TTGGGTTGAT TCTACAGACA CATTATCAGA AGTTTCAATG AGAAAATCAT TATATAAACC ATACCAAGTT ATCTTTACAA ATGACTATGC AGAAGGAACT GATGATTTCT ACAAAGACCC TAAGGCTTTA CCATCTGATA ACACTGAATT TATGAAAACT ACTGATAAGA TTCAAGGTTA TACAAAATAT GACGGCGGAT TAATCAGAAT GCGTACAGAG CTTTTAGATG ATATAGCACT TTTAGGTAAA ACTCAAGCCC TAGCTAATGA ATATGATAAA GATGGAAATG TCACAACAGA CAATAGTTCT AAGCTTACAT TAAAAGGAGC TAATGGAGAA AAAGAATATA GGGTTTATAG ATATGATATC GATCTTAATA AACTAGGAGA AATTTCAAAA GCTGGAACAT CAGCTAGAGA TGTTGATGCA GAAGGAAATA ACAAGCTAGT TCTTAAAAAA GATATGAAGC TCTATTTCAA TGCATCAGAC GGATCTTCCC TACCAACTGA GTTAGTTGAA TCAAGAGTAA GAACAAGAGT ATTATTCGAT ACAACAGATG GAAAGTTTGC TGATGCTAGC ACAAGATCAG TAAGAATTGC TCCAGATAAC GTGAAATATC TTGAGGATGC AGGATATACA GCAAACGGAT TTACAGGTGC AAATGTAGCA GAAGGTACAG GAGATAAATT TGCTGAAAAT CCAACAGCAG AAGGCAAGAC ATTCCTAGGT TGGGTAACTG AAGCAGGTAA AGCTGAACTA GGAGCAACAA CTGTTAAATC AGATGCCTTC AATAAATTAT CAGCAGATAA AAAATTCACA TCAGAAACTC CAATAACTAC TCACCAAGTT GTATATGCAA TTTACTCAGA TGAAAAATTA GTAACATTTG ATGCTAATGG CGGTAAGTTT GACGATAATT CAACTACAAA GACAGATGAT ATAACTGACG GAGTTCAAGC ACCAACACCA ACTCAAGAAG GAAAAGAATT CGTAGGTTGG GCAAGCAAAC CTGATGCAAT AGAAGCTGAA GCAGGAATCT TAGATAAAGT AACAGAAGGA CAAACTGTTT ATGCTGTTTG GAAAGACGCT AAGACTCAAA CAGATGCACA AAAGAATCCA GCAGTTGATC CAACTAAGAC AGAAGTTGCA AACAAAGATA AGTTAACTGA AGACGAAAAA GCTAAAGTAG TTGAAGAAGT TAAGAAAGCA AACCCAGAAG CTAAAGACGT AACAGTTGAT GATAAGGGTA ATGCAACATT AACTTACGAA GATGGAACAA CAAATGAAAT TCCTGGAGAA AAGACAGTAA CTGAAAAAGC AAAAACAGAT GCTGAAAAGA ACCCAGCAGT TGATCCAACT AAGACAGAAG TTGCAAACAA AGATAAGTTA ACTGAAGACG AAAAAGCTAA AGTAGTTGAA GAAGTTAAGA AAGCAAACCC AGAAGCTAAA GACGTAACAG TTGATGATAA GGGTAATGCA ACATTAACTT ACGAAGATGG AACAACAAAT GAAATTCCTG GAGAAAAGAC AGTAACTGAA AAAGCAAAAA CAGATGCTGA AAAGAACCCA GCAGTTGATC CAACTAAGAC AGAAGTTGCA AACAAAGATA AGTTAACTGA AGACGAAAAA GCTAAAGTAG TTGAAGAAGT TAAGAAAGCA AACCCAGAAG CTAAAGACGT AACAGTTGAT GATAAGGGTA ATGCAACATT AACTTACGAA GACGGAACAA CAAATGAAAT TCCTGGAGAA AAGACAGTAA CTGAAAAAGC AAAAACAGAT GCTGAAAAGA ACCCAGCAGT TGATCCAACT AAGACAGAAG TTGCAAACAA AGATAAGTTA ACTGAAGACG AAAAAGCTAA AGTAGTTGAA GAAGTTAAGA AAGCAAACCC AGAAGCTAAA GACGTAACAG TTGATGATAA GGGTAATGCA ACATTAACTT ACGAAGACGG AACAACAAAT GAAATTCCAG CAGACAAGAC AGTAACTGAA AAAGCAAAAA CAGATGTTGA TAAGAGACCA CTTCAAAATG AAGTAGATAA AAAAGATGAT ACAAAAGCAT CAGACAAGTA TAAGAATGCA GACCAAGATA AAAAGGATGC ATACGACAAA GCACTAGAAG ATGCTAAGAA AGTACTTGAA GATCCAAATG CAAGCCAAGA AGATGTCAAC AAAGCTAAAG ACGCTTTAAC ATCAGCAGAA GAAGCTTTAA ACGGCAAAAA AACTCCAGAA GTAGATAAAT CAGCACTTCA AAAAGAAGTA GCCAAAGAAA ATACTACTAA GGATACAGAC AAGTACAAAA ATGCAGACCA AAATAAAAAG GATGCATACG ACAAAGCACT AGAAGATGCT AAGAAAGTCC TTGAAAATCC AAATGCAAGC CAAGAAGATG TCAACAAAGC TAAAGACGCT TTAACAGCAG CAGAAGAAGC ATTAAATGGA GAAAAAACTC CAGAAGTAGA TAAATCAGCA CTTCAAAAAG AAGTAGACAA AGAAAATACT ACTAAGGATA CAGACAAGTA CAAGAATGCA GACCAAGACA AAAAGGATGC ATACAACAAA GCACTAGCAG ATGCCGAAAA AGTACTAAAA GATCCAAATG CAAGTCAAGA CGATGTCAAC AAAGCTAAGC AAGCACTAGA AGATGCAGAA AAAGCACTAA ACGGTGAATC AACATCAGTA GATAAAAAAG CACTAGAAGC TGAAGCAGCT AAAAAAGACA CAACAAAAGC ATCAGATAAG TACACAAATG CAGACCAAGA CAAAAAGGAT GCATACAACA AAGCATTAGC AGATGCCGAA AAAGTACTAC AAGATCCAAA TGCAAGTCAA GACGATGTCA ATAAAGCTAA GAAAGCGCTA GAAGATGCAG AAAAAGCACT AAATGGTGAA TCAACATCAG TAGATAAAAA AGCACTAGAA GCTGAAGCAG CTAAAAAAGA CACAACAAAA GCATCAGATA AGTATACAAA TGCAGACCAA GACAAAAAGG ATGCATACAA CAAAGCATTA GCAGATGCCG AAAAAGTACT ACAAGATCCA AATGCAAGTC AAGACGATGT TAACAAAGCT AAGAAAGCGC TAGAAGAAGC GGAAAAAGCA CTAAATGGTA AAGCTAGTGA CGTTTCTGAT AATATTAGTC CAAATCTTCC AGGAAAAACA GAAGTAGAAG ATAAAGACAA CTTAACTGAA GAGGAAAAGG GTAAGGTTAA GGAAGAAGTT AAGAAGGCAA ACCCTAAGGC TAAAGACGTT GACGTTGATA ACAAGGGTAA TGCAACTCTA ATTTACCCAG ATGGATCAAA GAATTATATC TCTTCTGACA AGACAGTAAG CGAAAAAGAA AAATCTATAA AAGATAAAAC AGATGCTGAA GGTAACTTAG CAGTTGCACC AAGCAAAAAG CTTGGAGTAG CCGATAAGGG TAACTTAACA GATGCTGAAA GAAGAGAAAT AGCTGATAAT GTGAAGAAAG CTAATCCAAA TGCAAAAGAA GTAATCGTTG ATGCACAAGG AAAGGCAACA TTAGTATATC CAGATGGATC AAGAAACTTC ATCCCAGCAA GCGAATTAAT CTACGAAAAA GCTAAGGGAC TTGTAGCAGA TAAAACTGTT CAAACAACAA ACAAGACAGG CAAGGCTGCT AATACAAATG TTAAGACAGG AGTAGAGTCA TTAACAGGAG TAATGGCAAC ACTAGCTACA GCAGTAGGTG GATTGTTCGT AAGCAAAAAA AGAAAAGACG ATGATAGATA A
|
Protein sequence | MDTNDKLVKI IEYKKLKGSI RKPKYATRKL SIGLVSCMLG YALLVSPTSV EATEGNANAK TEVVERGDIE EVETPDEGEA KVEVPVATPE PDEEVITEAD NFNAELEKLT AKVGDSSIDY KKAIKNLPED AKLTVKAPAD TKEAGEKTVE ATITFADGSE KDLKITVDVK AEEKLEKSND GADKKEVPTE EQRKQDLANY LTRIEKSTNQ EEIDAILEEA ADKYKDIDFA PLLNAGIQIS EKTEPQALKE SNPDEKEFTE AKAKENWDQE LKDKGYWKLA EGQRFRRVSA SDPVSIYDIN YDGTFADAEG NTNLRFIYNE LNGVGSGVWH RIVVNFGDLT DKIDFEKSYV VGEDGKTTEK FYDVNGEKRL DITKLYSGPR AGKRVNFPMN IVLKDGKSIN EIPKENYIVQ LRLVNKARPN VAEGTEIYTY APKGTSVDYS SYTKVTSVDL SDNISSTVLE GPKQEKGERI AIQRSYMSEF IANPKEYDDS TQIGILRTEY LGKRGGSIGE NIKDVDGKPI GFAQVFDAKL VDYLKEDTNG NVAYTNVLTN DRKQGSWVKK IGIKKADITI KDGLAYVVIA RNDSVNAFKN NEIKAVGVPT LDQYINQSGI YFSSIDYLID KTKLNEEFTP GKRKTDFSMA AGWVEPNTKG WTIFEKTFDE DFVVPKGEKY TINTTNVPEG GQVIIQVGDK NQAILRNKQG YYNSTVSFWK QGVDSVDETS PGTYEFTLRE GATIKKGEGI RVLLPDTPDH TSPVSFVNQH NADGGTELKV ESNSGNIKVK LNNSANGDIK LKYTLKGETE QSEIVYKKKL VGWEKPNDPD NILEGLGKAW ITKSKLEPGT KILVEHYDAS GKKVDSLESY IIYKELEKSP ERYTNIAWVD STDTLSEVSM RKSLYKPYQV IFTNDYAEGT DDFYKDPKAL PSDNTEFMKT TDKIQGYTKY DGGLIRMRTE LLDDIALLGK TQALANEYDK DGNVTTDNSS KLTLKGANGE KEYRVYRYDI DLNKLGEISK AGTSARDVDA EGNNKLVLKK DMKLYFNASD GSSLPTELVE SRVRTRVLFD TTDGKFADAS TRSVRIAPDN VKYLEDAGYT ANGFTGANVA EGTGDKFAEN PTAEGKTFLG WVTEAGKAEL GATTVKSDAF NKLSADKKFT SETPITTHQV VYAIYSDEKL VTFDANGGKF DDNSTTKTDD ITDGVQAPTP TQEGKEFVGW ASKPDAIEAE AGILDKVTEG QTVYAVWKDA KTQTDAQKNP AVDPTKTEVA NKDKLTEDEK AKVVEEVKKA NPEAKDVTVD DKGNATLTYE DGTTNEIPGE KTVTEKAKTD AEKNPAVDPT KTEVANKDKL TEDEKAKVVE EVKKANPEAK DVTVDDKGNA TLTYEDGTTN EIPGEKTVTE KAKTDAEKNP AVDPTKTEVA NKDKLTEDEK AKVVEEVKKA NPEAKDVTVD DKGNATLTYE DGTTNEIPGE KTVTEKAKTD AEKNPAVDPT KTEVANKDKL TEDEKAKVVE EVKKANPEAK DVTVDDKGNA TLTYEDGTTN EIPADKTVTE KAKTDVDKRP LQNEVDKKDD TKASDKYKNA DQDKKDAYDK ALEDAKKVLE DPNASQEDVN KAKDALTSAE EALNGKKTPE VDKSALQKEV AKENTTKDTD KYKNADQNKK DAYDKALEDA KKVLENPNAS QEDVNKAKDA LTAAEEALNG EKTPEVDKSA LQKEVDKENT TKDTDKYKNA DQDKKDAYNK ALADAEKVLK DPNASQDDVN KAKQALEDAE KALNGESTSV DKKALEAEAA KKDTTKASDK YTNADQDKKD AYNKALADAE KVLQDPNASQ DDVNKAKKAL EDAEKALNGE STSVDKKALE AEAAKKDTTK ASDKYTNADQ DKKDAYNKAL ADAEKVLQDP NASQDDVNKA KKALEEAEKA LNGKASDVSD NISPNLPGKT EVEDKDNLTE EEKGKVKEEV KKANPKAKDV DVDNKGNATL IYPDGSKNYI SSDKTVSEKE KSIKDKTDAE GNLAVAPSKK LGVADKGNLT DAERREIADN VKKANPNAKE VIVDAQGKAT LVYPDGSRNF IPASELIYEK AKGLVADKTV QTTNKTGKAA NTNVKTGVES LTGVMATLAT AVGGLFVSKK RKDDDR
|
| |