Gene Apar_0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0102 
Symbol 
ID8412945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp111583 
End bp116928 
Gene Length5346 bp 
Protein Length1781 aa 
Translation table11 
GC content46% 
IMG OID645021669 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003179129 
Protein GI257783912 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.432951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAGTC GTTTGCGCAA AGTATCTGCA ATATGTGCGG CATTAGCTTT TGTGTTAGGT 
GTGTTTTATA TAGCTCAACC CACGTATGCA GAAGAGCAAC CTGCTGAGGT GAGGTCTCAA
TCAACAACGC AAATGAATTC TGCGCCTGAA AATGTTAACG TTAACATTAT TAATGACACC
ACGAATCGTG TGCAGAACTT CAATTCAAAC TGGAAGTTCA AACTTGGTGA TGCATCAGGT
GCAGAAAACA CAACGTACGA TGACTCTTCT TGGGAATCAG TTAATCTTCC TCACGACTAC
AGCATAGATC AGCCCTATTC TCAGTCCGGA GAGGCAGAAA GCGCCTATAA ACCTGGTGGA
GAGGGTTGGT ATAGAAAGAC TTTTGAAGTT GCTTCCAATC TTCAAGGAAA GCGTTTTCGT
TTAGATTTTG ATGGCGTCTA CATGGATTCA ACTGTGTGGG TCAATGGCCA TATGTTGGGT
ACTCATCCGT ATGGTTACAC GCCGTTTGCT TTTGATATCA CGCCATACAT TAAACCTGGC
GAGCAAAACG TCATTACCGT TCGTGTAAAC GCACAAACAC CAAGTAGTCG TTGGTATTCG
GGCGCTGGTA TTGGCCGTGA CGTTGATCTT GTGGTGACAA ATCCTGTTCA TGTTTCAAAA
GATGGTGTAA AAGCAACTGC TCCAAATCTT GCATCTGAGG TTGGCGGAAG CGTTACCACT
CAATTGACCA CTTCAGTAAC AAATTCATCT GATTCACCTG TAAATGTACA GGTAGTACAG
ACTGTCTTTG CTCGCGGAAC TTCTCCCCAG CAGGCTATTG CATCCGTTAC CACAGAGCGT
AGTATCAATG CAAACACAAC TGATACGTTT AACGCTTCTG CGATTACTTC ATCTTCACCA
GCGCTTTGGG ATATTGATAA CCCCAATCTC TATACCGTAC GCACTGAAGT AAAGGTAGAC
GGAAACGTTG TTGATACCTA TGACACAACC TTTGGATATC GTTATTTTAG CTTTGATGCC
GAGAAGGGCT TCTTACTGAA CGGTATGCCT GTCAAGATAA AAGGTGTTTG TATGCACCAT
GACCAGGGCG CACTTGGCTC TGTATCTACT GCCGATGCAG TTAGGAGACA GGTACAGATT
CTTAAGAATA TGGGCGCCAA TGCCATTCGT ACTTCTCATA ACACACCCTC TCGCGAGCTT
ATTGAGGCAT GTAACGAGCA AGGTGTCTTA TTAGACTATG AGTTCTTTGA CGGTTGGACT
GCCGCAAAGA ACGGTAATAG CAAGGATTAC GCAAGGTTTT TCTCAACGGT GATGGGGGAA
TCTGAGCTTA TTGGTGGCGA CGCAAACAAG ACGTGGGCAC AGTTTGATAT TGAGGCCAGT
GTTGCACGTG ATTACAATGC GCCATCCATT GTGATGTGGT CGCTTGGTAA TGAGATGACT
GAGGGTACCT ATGGTATTTA TGGTTTAGCT CAGGTCCAAA ATAGTCTAAT TGCCTGGACA
CAGGCTGTTG ATCCAACTCG CCCTGTTACT ACAGGAGATA ACCGACTTAA GCGTGGATCA
AATGAGCTAA ATCCTCAGGG AATTTCTGAT GCTGGCGGTA TTGTTGGCAT GAACTACGCT
GGTGGTTCTA CGTACGATAG TATTCACAGC CAACATCCAG ATTGGAAACT TATTGGCTCT
GAAACAGCTT CGTCAATTAA TAGTCGTGGT ATTTACAGTA CTCATAGTAG AGACAACTCT
TCTCAGCAGC TTACTGCATA CGATTATTCT CGTGTTAATT GGGGTCATTA TGCTTCCCAA
GCATGGTATG ACGTACTGAC ACGTGATTTT GTTGCTGGAG AATTTGTTTG GACTGGCTTT
GATTATCTGG GTGAGCCAAC TCCTTGGAAC GGAGTTGATC CTGGCGCAAA AGGTAGATGG
CCATCTCCAA AGAATTCTTA TTTTGGCATC ATTGATACTG CTGGTTTGCC AAAAGATTCG
TATTATTTCT ATCAGAGCCA GTGGAATGAT GCTGTTCACA CATTGCACTT ACTTCCTGCA
TGGAATGGTG ATGCTGTAAA GAAAAACCGT GATGGCACTG TTGACATATC GGTTTATACC
GATGCTCACG CTGTCAGGCT TTATTTCACT CCTGCTGGCT CAACAGAGAA GCAAGATTTA
GGTCTCAAGA CATTTACAAC AAAAACAACA CCAACTAATG GTTTTACGTA TCAGATTTAT
GAAGGGGCCG ATAAGAGTAC TGACGAGTTC AGAAATCTAT ATCTGACCTG GCAAGTTCCG
TATGCTGACG GCACTATCAC CGCCGAAGCA TACGATGAAG CAGGCAATGT TATTGATACT
TCGAGCTGGG ATGGCCGTCA GAGTCTAACT ACCGCAGGTC AACCAAAGAA ACTTTCAGTA
TCAGCAAACC GCTCTTCTAT GAGCGCAAAC GGTACCGATT TGACATATTT GACCGTAGAT
GTTGTTGACG AGAATGGCAA TCGTGTTCCA AACGCTAACA ATAAGGTAAC GTTTGATGTT
TTTGGATCTG GCAAATTAGC AGGAATTGAT AACGGCAGCG CGCCCGATCA TCAGTCATAT
CGCGACGCTA ACCGAGACGC GTTCTCGGGA CAGGTTGTGG GAATTGTTCA GGCGGGAACA
AAAGCGGGTG AGGTTACCGT ACGTGTTTCT GCTGACGGCT TAGAGCCAAC AGAGGTTACC
ATTCCAGTTA CTCCTGCAAA TACAAGCGAT GATACTCCTC AGAAGACAGT TGGAAGTCTG
TTCTATTCTC GTTATTACTA CGTTAAAACA GGCTCTTCAC TGACATTACC TAAAACAATT
CAGGCTCGAT ATACTGACGG AACAGCTTCG GATGAGCCTG TTGTTTGGGA TTTATATGAT
GCCGAGAAAC TCAACTCTGC AGGTACCTTT ACTGTTTCTG GAACAGTTGC AGGTGTTCGA
GCTACCGTAA CGGTGACGGT ATTAGACAAT ATTGCCGCTC TTATGAACTA CTCGACAACA
ACACCAGTTG GTCAAAACCC TATTCTTCCT GATGCACGAC CAGCAGTTCA GGCGGACGGC
ACCGTTTTAC GGGCTAATTT CCCAGTAACT TGGAATGCTG TTCCCGAGGG CTCCTATAAC
CAGGAGGGAA CAGTTACTGT CACTGGTACC GCAAACGTTT TTGGCCGTGA TATGTCCGTC
ACAGCAACTG TGCGTGTGCA GCGAGAGACT GTAACGTTGG GTGAGAATGT TGCTCCTGTT
GCATCAGTCT CGCAGGATAT TCCTGAAGAC AAACAGAGTG ACACACTTTC TGCTATTACT
GACGGCTCTA CTTCAGTTGC AGCCAATCAA GGCGGTGGAG CTAATCCAAC TTGTTGGACA
ACGTACAAGA ACGCACAGGC AGGAAACCAA ACTGCGTCCA TTACGTTCCG CTACGCAACT
CAGCAGAGAA TTGGTCAGGC TCGCGTTCAC TTCTTTGTAG ACTCTTACTC CGCAAGGTTG
CCTAAGCCAG GATCAACTAT CATCGAGGTT TCTGAGAATG GCGAGGATTG GACTAGGGTT
GACGCACAAG AAACAGTTGG ACAAGCACAA GGTCGTGTAA CTCCTTACAC GTATAATTTT
GCTCCGGTAA CCGCTACGTA TGTGCGTTTT ACCATTACCA ACTCTGATGA GGTTCTTGCT
GGAAGAAAGC CGTGTACAGG TATTACTGAA GTTGAGCTCT TCAGTGCTGT TGGGTCATTT
ACTACAAATA ATACGGCTTC ATTTGATTCG CTTTCTGTAA ATGGAAAACA GGTAAGTGAA
GATGCTCTAG CAGCAGGGGA GTACAACACA CCAGCTCTTC TCACTAACGT AGAGGCGCAG
ACAAAAGATA ACGCTGCACT GACTGTACTG CCTCCTTATC AAAATAAAGT TAAGATGCTC
CTTGAATCTG AGGATCATAC TTCTAGGAGT ACGTTTACTG TTAATCTGGG AGTTGATCAG
CCAATTACTG GCGATAGTGA TGCTCAAGAT TACCCGGTAG ATAAGATGCT TGTCTCGGCA
GGTAGTGAAG TTGAGGATAG ATACGTAAGT CCAAATGAAG GCAAGGTTTT GCTGGCGTTT
GACGGTAATC CTAGTACATA TTGGCACTCC ACTTGGGCAC CAAGTTCAAC CGATGACCAC
TGGGTTCAGA TGGAACTTGA TGAGCCAACA ACTATTGAGG CGCTGAGATA TCTACCTCGT
CCTAATAGCC CAGCAAATGG AACTGTTACT GAAGCGTTGG TTGAGTACAG TGACGATGGT
GTTACTTGGC ATGAGGCGGG GCGTGCTACA TGGACTTCTC CTACCAGCCC TGATTACACT
CCTGACTGGA AAATAGTCAA GTTTAATCAA CCAGTTACGG CTAAGTATTT CCGTCTAACT
GGTGTCCATA CGTATGCAGA TGGTGGACGT AACGATAAAT TCATGAGTGC AGCAGAGATT
CGCCTGCGTA CTACAAAGGA AACAACTGAT ATTTCTCATG CACGTATAGA AGCACCTAGT
ACCCTGACAG TTGATTCTGT GAGTGAGTCT AACCCTGCGA TGTTTAATCC GTCTGACGTG
CATGTGTATG TCCCATCAAC AAAAAGTGGA GAATCTCGAC GCACAAGGCG TTCAGTTACC
GATGAGACTG AGCTGAGATA TGGCATTGAT TACGTTCTTG AGTACGAGAA CAATACATCT
GAGGGAACCG CAACAGTACG TGCTCGGGGA ATTGATTCGT ATGCCGGCAC AACTGCGCCT
ACACCGTTTA CGGTTGCTCT TACACAGGTG GTGGTTGATA GTGTATCGGT TGCTTCAACT
CCTACTAAGA CTGCGTACAC TGTAGGTGAG AAGCTTGATC CGTCAGGGCT CAAGTTGACG
CTTGCCATGA GTAACGGAAC GTCGCAGGAG GTAACATATA GCGAAGATAA CAAGGATGAC
TTTACGTTTG ATCCTTCAGC AGAGGCTGCG TTTGATACTG CTGGTACACA TGAGATTACG
GTGACCTATC AGGGTAAGTC TGCTACGTTT GAGGTTACTG TTACACAAGC AACAAACCCT
GCCAATCCAA CAGATCCAAC AAATCCGACT AACCCATCGG ATCCAACAAA CCCAACGAAT
CCAGCAGATC CTTCTAACCC CGCCAATCCA ACAGATCCAT CCAATCCAGA TGGCAATCAG
AGCGCTGACA ATGGTGCCAA CAATGTTAAT AGCAATACTG ACCAGGGAAA TTCTTCTAAG
AAGAAGCGCA CTTCTGCACT TCCTGGTATG GGTGACCCAG TTACGTTAGT TGCAGCATGC
GGATTGCTTA CTATAGGTAT TACGTGCGCT GGCGGGGGAT ATTGGGTACG TAGGCGCAGG
CATTAG
 
Protein sequence
MTSRLRKVSA ICAALAFVLG VFYIAQPTYA EEQPAEVRSQ STTQMNSAPE NVNVNIINDT 
TNRVQNFNSN WKFKLGDASG AENTTYDDSS WESVNLPHDY SIDQPYSQSG EAESAYKPGG
EGWYRKTFEV ASNLQGKRFR LDFDGVYMDS TVWVNGHMLG THPYGYTPFA FDITPYIKPG
EQNVITVRVN AQTPSSRWYS GAGIGRDVDL VVTNPVHVSK DGVKATAPNL ASEVGGSVTT
QLTTSVTNSS DSPVNVQVVQ TVFARGTSPQ QAIASVTTER SINANTTDTF NASAITSSSP
ALWDIDNPNL YTVRTEVKVD GNVVDTYDTT FGYRYFSFDA EKGFLLNGMP VKIKGVCMHH
DQGALGSVST ADAVRRQVQI LKNMGANAIR TSHNTPSREL IEACNEQGVL LDYEFFDGWT
AAKNGNSKDY ARFFSTVMGE SELIGGDANK TWAQFDIEAS VARDYNAPSI VMWSLGNEMT
EGTYGIYGLA QVQNSLIAWT QAVDPTRPVT TGDNRLKRGS NELNPQGISD AGGIVGMNYA
GGSTYDSIHS QHPDWKLIGS ETASSINSRG IYSTHSRDNS SQQLTAYDYS RVNWGHYASQ
AWYDVLTRDF VAGEFVWTGF DYLGEPTPWN GVDPGAKGRW PSPKNSYFGI IDTAGLPKDS
YYFYQSQWND AVHTLHLLPA WNGDAVKKNR DGTVDISVYT DAHAVRLYFT PAGSTEKQDL
GLKTFTTKTT PTNGFTYQIY EGADKSTDEF RNLYLTWQVP YADGTITAEA YDEAGNVIDT
SSWDGRQSLT TAGQPKKLSV SANRSSMSAN GTDLTYLTVD VVDENGNRVP NANNKVTFDV
FGSGKLAGID NGSAPDHQSY RDANRDAFSG QVVGIVQAGT KAGEVTVRVS ADGLEPTEVT
IPVTPANTSD DTPQKTVGSL FYSRYYYVKT GSSLTLPKTI QARYTDGTAS DEPVVWDLYD
AEKLNSAGTF TVSGTVAGVR ATVTVTVLDN IAALMNYSTT TPVGQNPILP DARPAVQADG
TVLRANFPVT WNAVPEGSYN QEGTVTVTGT ANVFGRDMSV TATVRVQRET VTLGENVAPV
ASVSQDIPED KQSDTLSAIT DGSTSVAANQ GGGANPTCWT TYKNAQAGNQ TASITFRYAT
QQRIGQARVH FFVDSYSARL PKPGSTIIEV SENGEDWTRV DAQETVGQAQ GRVTPYTYNF
APVTATYVRF TITNSDEVLA GRKPCTGITE VELFSAVGSF TTNNTASFDS LSVNGKQVSE
DALAAGEYNT PALLTNVEAQ TKDNAALTVL PPYQNKVKML LESEDHTSRS TFTVNLGVDQ
PITGDSDAQD YPVDKMLVSA GSEVEDRYVS PNEGKVLLAF DGNPSTYWHS TWAPSSTDDH
WVQMELDEPT TIEALRYLPR PNSPANGTVT EALVEYSDDG VTWHEAGRAT WTSPTSPDYT
PDWKIVKFNQ PVTAKYFRLT GVHTYADGGR NDKFMSAAEI RLRTTKETTD ISHARIEAPS
TLTVDSVSES NPAMFNPSDV HVYVPSTKSG ESRRTRRSVT DETELRYGID YVLEYENNTS
EGTATVRARG IDSYAGTTAP TPFTVALTQV VVDSVSVAST PTKTAYTVGE KLDPSGLKLT
LAMSNGTSQE VTYSEDNKDD FTFDPSAEAA FDTAGTHEIT VTYQGKSATF EVTVTQATNP
ANPTDPTNPT NPSDPTNPTN PADPSNPANP TDPSNPDGNQ SADNGANNVN SNTDQGNSSK
KKRTSALPGM GDPVTLVAAC GLLTIGITCA GGGYWVRRRR H