Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0102 |
Symbol | |
ID | 8412945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 111583 |
End bp | 116928 |
Gene Length | 5346 bp |
Protein Length | 1781 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645021669 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003179129 |
Protein GI | 257783912 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.432951 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAGTC GTTTGCGCAA AGTATCTGCA ATATGTGCGG CATTAGCTTT TGTGTTAGGT GTGTTTTATA TAGCTCAACC CACGTATGCA GAAGAGCAAC CTGCTGAGGT GAGGTCTCAA TCAACAACGC AAATGAATTC TGCGCCTGAA AATGTTAACG TTAACATTAT TAATGACACC ACGAATCGTG TGCAGAACTT CAATTCAAAC TGGAAGTTCA AACTTGGTGA TGCATCAGGT GCAGAAAACA CAACGTACGA TGACTCTTCT TGGGAATCAG TTAATCTTCC TCACGACTAC AGCATAGATC AGCCCTATTC TCAGTCCGGA GAGGCAGAAA GCGCCTATAA ACCTGGTGGA GAGGGTTGGT ATAGAAAGAC TTTTGAAGTT GCTTCCAATC TTCAAGGAAA GCGTTTTCGT TTAGATTTTG ATGGCGTCTA CATGGATTCA ACTGTGTGGG TCAATGGCCA TATGTTGGGT ACTCATCCGT ATGGTTACAC GCCGTTTGCT TTTGATATCA CGCCATACAT TAAACCTGGC GAGCAAAACG TCATTACCGT TCGTGTAAAC GCACAAACAC CAAGTAGTCG TTGGTATTCG GGCGCTGGTA TTGGCCGTGA CGTTGATCTT GTGGTGACAA ATCCTGTTCA TGTTTCAAAA GATGGTGTAA AAGCAACTGC TCCAAATCTT GCATCTGAGG TTGGCGGAAG CGTTACCACT CAATTGACCA CTTCAGTAAC AAATTCATCT GATTCACCTG TAAATGTACA GGTAGTACAG ACTGTCTTTG CTCGCGGAAC TTCTCCCCAG CAGGCTATTG CATCCGTTAC CACAGAGCGT AGTATCAATG CAAACACAAC TGATACGTTT AACGCTTCTG CGATTACTTC ATCTTCACCA GCGCTTTGGG ATATTGATAA CCCCAATCTC TATACCGTAC GCACTGAAGT AAAGGTAGAC GGAAACGTTG TTGATACCTA TGACACAACC TTTGGATATC GTTATTTTAG CTTTGATGCC GAGAAGGGCT TCTTACTGAA CGGTATGCCT GTCAAGATAA AAGGTGTTTG TATGCACCAT GACCAGGGCG CACTTGGCTC TGTATCTACT GCCGATGCAG TTAGGAGACA GGTACAGATT CTTAAGAATA TGGGCGCCAA TGCCATTCGT ACTTCTCATA ACACACCCTC TCGCGAGCTT ATTGAGGCAT GTAACGAGCA AGGTGTCTTA TTAGACTATG AGTTCTTTGA CGGTTGGACT GCCGCAAAGA ACGGTAATAG CAAGGATTAC GCAAGGTTTT TCTCAACGGT GATGGGGGAA TCTGAGCTTA TTGGTGGCGA CGCAAACAAG ACGTGGGCAC AGTTTGATAT TGAGGCCAGT GTTGCACGTG ATTACAATGC GCCATCCATT GTGATGTGGT CGCTTGGTAA TGAGATGACT GAGGGTACCT ATGGTATTTA TGGTTTAGCT CAGGTCCAAA ATAGTCTAAT TGCCTGGACA CAGGCTGTTG ATCCAACTCG CCCTGTTACT ACAGGAGATA ACCGACTTAA GCGTGGATCA AATGAGCTAA ATCCTCAGGG AATTTCTGAT GCTGGCGGTA TTGTTGGCAT GAACTACGCT GGTGGTTCTA CGTACGATAG TATTCACAGC CAACATCCAG ATTGGAAACT TATTGGCTCT GAAACAGCTT CGTCAATTAA TAGTCGTGGT ATTTACAGTA CTCATAGTAG AGACAACTCT TCTCAGCAGC TTACTGCATA CGATTATTCT CGTGTTAATT GGGGTCATTA TGCTTCCCAA GCATGGTATG ACGTACTGAC ACGTGATTTT GTTGCTGGAG AATTTGTTTG GACTGGCTTT GATTATCTGG GTGAGCCAAC TCCTTGGAAC GGAGTTGATC CTGGCGCAAA AGGTAGATGG CCATCTCCAA AGAATTCTTA TTTTGGCATC ATTGATACTG CTGGTTTGCC AAAAGATTCG TATTATTTCT ATCAGAGCCA GTGGAATGAT GCTGTTCACA CATTGCACTT ACTTCCTGCA TGGAATGGTG ATGCTGTAAA GAAAAACCGT GATGGCACTG TTGACATATC GGTTTATACC GATGCTCACG CTGTCAGGCT TTATTTCACT CCTGCTGGCT CAACAGAGAA GCAAGATTTA GGTCTCAAGA CATTTACAAC AAAAACAACA CCAACTAATG GTTTTACGTA TCAGATTTAT GAAGGGGCCG ATAAGAGTAC TGACGAGTTC AGAAATCTAT ATCTGACCTG GCAAGTTCCG TATGCTGACG GCACTATCAC CGCCGAAGCA TACGATGAAG CAGGCAATGT TATTGATACT TCGAGCTGGG ATGGCCGTCA GAGTCTAACT ACCGCAGGTC AACCAAAGAA ACTTTCAGTA TCAGCAAACC GCTCTTCTAT GAGCGCAAAC GGTACCGATT TGACATATTT GACCGTAGAT GTTGTTGACG AGAATGGCAA TCGTGTTCCA AACGCTAACA ATAAGGTAAC GTTTGATGTT TTTGGATCTG GCAAATTAGC AGGAATTGAT AACGGCAGCG CGCCCGATCA TCAGTCATAT CGCGACGCTA ACCGAGACGC GTTCTCGGGA CAGGTTGTGG GAATTGTTCA GGCGGGAACA AAAGCGGGTG AGGTTACCGT ACGTGTTTCT GCTGACGGCT TAGAGCCAAC AGAGGTTACC ATTCCAGTTA CTCCTGCAAA TACAAGCGAT GATACTCCTC AGAAGACAGT TGGAAGTCTG TTCTATTCTC GTTATTACTA CGTTAAAACA GGCTCTTCAC TGACATTACC TAAAACAATT CAGGCTCGAT ATACTGACGG AACAGCTTCG GATGAGCCTG TTGTTTGGGA TTTATATGAT GCCGAGAAAC TCAACTCTGC AGGTACCTTT ACTGTTTCTG GAACAGTTGC AGGTGTTCGA GCTACCGTAA CGGTGACGGT ATTAGACAAT ATTGCCGCTC TTATGAACTA CTCGACAACA ACACCAGTTG GTCAAAACCC TATTCTTCCT GATGCACGAC CAGCAGTTCA GGCGGACGGC ACCGTTTTAC GGGCTAATTT CCCAGTAACT TGGAATGCTG TTCCCGAGGG CTCCTATAAC CAGGAGGGAA CAGTTACTGT CACTGGTACC GCAAACGTTT TTGGCCGTGA TATGTCCGTC ACAGCAACTG TGCGTGTGCA GCGAGAGACT GTAACGTTGG GTGAGAATGT TGCTCCTGTT GCATCAGTCT CGCAGGATAT TCCTGAAGAC AAACAGAGTG ACACACTTTC TGCTATTACT GACGGCTCTA CTTCAGTTGC AGCCAATCAA GGCGGTGGAG CTAATCCAAC TTGTTGGACA ACGTACAAGA ACGCACAGGC AGGAAACCAA ACTGCGTCCA TTACGTTCCG CTACGCAACT CAGCAGAGAA TTGGTCAGGC TCGCGTTCAC TTCTTTGTAG ACTCTTACTC CGCAAGGTTG CCTAAGCCAG GATCAACTAT CATCGAGGTT TCTGAGAATG GCGAGGATTG GACTAGGGTT GACGCACAAG AAACAGTTGG ACAAGCACAA GGTCGTGTAA CTCCTTACAC GTATAATTTT GCTCCGGTAA CCGCTACGTA TGTGCGTTTT ACCATTACCA ACTCTGATGA GGTTCTTGCT GGAAGAAAGC CGTGTACAGG TATTACTGAA GTTGAGCTCT TCAGTGCTGT TGGGTCATTT ACTACAAATA ATACGGCTTC ATTTGATTCG CTTTCTGTAA ATGGAAAACA GGTAAGTGAA GATGCTCTAG CAGCAGGGGA GTACAACACA CCAGCTCTTC TCACTAACGT AGAGGCGCAG ACAAAAGATA ACGCTGCACT GACTGTACTG CCTCCTTATC AAAATAAAGT TAAGATGCTC CTTGAATCTG AGGATCATAC TTCTAGGAGT ACGTTTACTG TTAATCTGGG AGTTGATCAG CCAATTACTG GCGATAGTGA TGCTCAAGAT TACCCGGTAG ATAAGATGCT TGTCTCGGCA GGTAGTGAAG TTGAGGATAG ATACGTAAGT CCAAATGAAG GCAAGGTTTT GCTGGCGTTT GACGGTAATC CTAGTACATA TTGGCACTCC ACTTGGGCAC CAAGTTCAAC CGATGACCAC TGGGTTCAGA TGGAACTTGA TGAGCCAACA ACTATTGAGG CGCTGAGATA TCTACCTCGT CCTAATAGCC CAGCAAATGG AACTGTTACT GAAGCGTTGG TTGAGTACAG TGACGATGGT GTTACTTGGC ATGAGGCGGG GCGTGCTACA TGGACTTCTC CTACCAGCCC TGATTACACT CCTGACTGGA AAATAGTCAA GTTTAATCAA CCAGTTACGG CTAAGTATTT CCGTCTAACT GGTGTCCATA CGTATGCAGA TGGTGGACGT AACGATAAAT TCATGAGTGC AGCAGAGATT CGCCTGCGTA CTACAAAGGA AACAACTGAT ATTTCTCATG CACGTATAGA AGCACCTAGT ACCCTGACAG TTGATTCTGT GAGTGAGTCT AACCCTGCGA TGTTTAATCC GTCTGACGTG CATGTGTATG TCCCATCAAC AAAAAGTGGA GAATCTCGAC GCACAAGGCG TTCAGTTACC GATGAGACTG AGCTGAGATA TGGCATTGAT TACGTTCTTG AGTACGAGAA CAATACATCT GAGGGAACCG CAACAGTACG TGCTCGGGGA ATTGATTCGT ATGCCGGCAC AACTGCGCCT ACACCGTTTA CGGTTGCTCT TACACAGGTG GTGGTTGATA GTGTATCGGT TGCTTCAACT CCTACTAAGA CTGCGTACAC TGTAGGTGAG AAGCTTGATC CGTCAGGGCT CAAGTTGACG CTTGCCATGA GTAACGGAAC GTCGCAGGAG GTAACATATA GCGAAGATAA CAAGGATGAC TTTACGTTTG ATCCTTCAGC AGAGGCTGCG TTTGATACTG CTGGTACACA TGAGATTACG GTGACCTATC AGGGTAAGTC TGCTACGTTT GAGGTTACTG TTACACAAGC AACAAACCCT GCCAATCCAA CAGATCCAAC AAATCCGACT AACCCATCGG ATCCAACAAA CCCAACGAAT CCAGCAGATC CTTCTAACCC CGCCAATCCA ACAGATCCAT CCAATCCAGA TGGCAATCAG AGCGCTGACA ATGGTGCCAA CAATGTTAAT AGCAATACTG ACCAGGGAAA TTCTTCTAAG AAGAAGCGCA CTTCTGCACT TCCTGGTATG GGTGACCCAG TTACGTTAGT TGCAGCATGC GGATTGCTTA CTATAGGTAT TACGTGCGCT GGCGGGGGAT ATTGGGTACG TAGGCGCAGG CATTAG
|
Protein sequence | MTSRLRKVSA ICAALAFVLG VFYIAQPTYA EEQPAEVRSQ STTQMNSAPE NVNVNIINDT TNRVQNFNSN WKFKLGDASG AENTTYDDSS WESVNLPHDY SIDQPYSQSG EAESAYKPGG EGWYRKTFEV ASNLQGKRFR LDFDGVYMDS TVWVNGHMLG THPYGYTPFA FDITPYIKPG EQNVITVRVN AQTPSSRWYS GAGIGRDVDL VVTNPVHVSK DGVKATAPNL ASEVGGSVTT QLTTSVTNSS DSPVNVQVVQ TVFARGTSPQ QAIASVTTER SINANTTDTF NASAITSSSP ALWDIDNPNL YTVRTEVKVD GNVVDTYDTT FGYRYFSFDA EKGFLLNGMP VKIKGVCMHH DQGALGSVST ADAVRRQVQI LKNMGANAIR TSHNTPSREL IEACNEQGVL LDYEFFDGWT AAKNGNSKDY ARFFSTVMGE SELIGGDANK TWAQFDIEAS VARDYNAPSI VMWSLGNEMT EGTYGIYGLA QVQNSLIAWT QAVDPTRPVT TGDNRLKRGS NELNPQGISD AGGIVGMNYA GGSTYDSIHS QHPDWKLIGS ETASSINSRG IYSTHSRDNS SQQLTAYDYS RVNWGHYASQ AWYDVLTRDF VAGEFVWTGF DYLGEPTPWN GVDPGAKGRW PSPKNSYFGI IDTAGLPKDS YYFYQSQWND AVHTLHLLPA WNGDAVKKNR DGTVDISVYT DAHAVRLYFT PAGSTEKQDL GLKTFTTKTT PTNGFTYQIY EGADKSTDEF RNLYLTWQVP YADGTITAEA YDEAGNVIDT SSWDGRQSLT TAGQPKKLSV SANRSSMSAN GTDLTYLTVD VVDENGNRVP NANNKVTFDV FGSGKLAGID NGSAPDHQSY RDANRDAFSG QVVGIVQAGT KAGEVTVRVS ADGLEPTEVT IPVTPANTSD DTPQKTVGSL FYSRYYYVKT GSSLTLPKTI QARYTDGTAS DEPVVWDLYD AEKLNSAGTF TVSGTVAGVR ATVTVTVLDN IAALMNYSTT TPVGQNPILP DARPAVQADG TVLRANFPVT WNAVPEGSYN QEGTVTVTGT ANVFGRDMSV TATVRVQRET VTLGENVAPV ASVSQDIPED KQSDTLSAIT DGSTSVAANQ GGGANPTCWT TYKNAQAGNQ TASITFRYAT QQRIGQARVH FFVDSYSARL PKPGSTIIEV SENGEDWTRV DAQETVGQAQ GRVTPYTYNF APVTATYVRF TITNSDEVLA GRKPCTGITE VELFSAVGSF TTNNTASFDS LSVNGKQVSE DALAAGEYNT PALLTNVEAQ TKDNAALTVL PPYQNKVKML LESEDHTSRS TFTVNLGVDQ PITGDSDAQD YPVDKMLVSA GSEVEDRYVS PNEGKVLLAF DGNPSTYWHS TWAPSSTDDH WVQMELDEPT TIEALRYLPR PNSPANGTVT EALVEYSDDG VTWHEAGRAT WTSPTSPDYT PDWKIVKFNQ PVTAKYFRLT GVHTYADGGR NDKFMSAAEI RLRTTKETTD ISHARIEAPS TLTVDSVSES NPAMFNPSDV HVYVPSTKSG ESRRTRRSVT DETELRYGID YVLEYENNTS EGTATVRARG IDSYAGTTAP TPFTVALTQV VVDSVSVAST PTKTAYTVGE KLDPSGLKLT LAMSNGTSQE VTYSEDNKDD FTFDPSAEAA FDTAGTHEIT VTYQGKSATF EVTVTQATNP ANPTDPTNPT NPSDPTNPTN PADPSNPANP TDPSNPDGNQ SADNGANNVN SNTDQGNSSK KKRTSALPGM GDPVTLVAAC GLLTIGITCA GGGYWVRRRR H
|
| |