Gene Apar_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0078 
Symbol 
ID8412921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp87861 
End bp89258 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content45% 
IMG OID645021645 
Productglycoside hydrolase family 1 
Protein accessionYP_003179105 
Protein GI257783888 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATATC AGCTTCCTAA AGACTTTTTC TTTGGCGGGG CTATGTCTGG CCCACAAACT 
GAAGGCAGAT GGCAAGATGA CGGAAGAATC CCTAGCATTT GGGATACTTG GTCTAACCTT
GACATCACCG CTTTTCACAA CCGCGTAGGG TCTTATGGTG GCAATGATTT TAGCAGCAGA
ATGGAAGAGG ACTTTGAGCT TCTTAAGTCA ATAGGAATGG ACTCAGTTCG TACTTCTATC
CAGTGGAGTC GCCTTTTAGA TATCGATGGA AACCTTAATC CAGAGGGTGA GAGGTACTAT
CATCAGCTCT TTGCTACAGC AAAGAAGGTT GGTATTGAGA TTTTTGTAAA TCTCTATCAC
TTTGATATGC CTGAATACCT CTTCAATCGC GGTGGTTGGG AGTCTCGCGA GGTAGTTGAG
GCATATGCGC ATTATGCACG TATTGCGTTT GAGACTTTTG GTAAAGAGAT TCGTTACTGG
TTTACTTTTA ATGAGCCAAT TGTTGAGCCT GAGATGCGCT ATACCGTTGG CGGATGGTTC
CCTTTTGTAA AGAATTATTC CCGCGCTCGT GCTGTTCAGT ACAATATTTC GCTTGCTCAT
GCGCTTGGTG TCCGCGAGTA TCGTCGCGCA AAAGCAGCAG GTTTTATGCT TGAGGATTCT
CGCATTGGTC TTATCAATTG CTTTGCACCA CCATATACCA AAGACAATCC ATCAGAAGCA
GACCTTGAGG CGCTGCGTAT GACCGATGGC GTTAACATTC GCTGGTGGCT TGACCTAGTT
ACTAAGGGAG AACTCCCACA GGATGTCATT GATACGCTGC AGTCTCGTGG TGTTGACCTG
CCTATTCGCC CTGAGGATAA GCTCATTCTT GCCGATGGAG TTGTGGATTG GTTGGGCTGC
AATTATTACC ATCCAGAGCG TATTCAGGCT CCTGCAAAAG ATACTGATGA AAATGGCATT
CCAAACTTTG CTGACCCGTA TGTTTGGCCA GAAGCAGAGA TGAATGTTTC TCGTGGTTGG
GAAATTTACC CACAAGGTCT TTACGACTTT GCTATGAAGG TTCGCGATGA ATATCCAGAG
CTTGAGTGGT TTGTTTCTGA GAATGGCATG GGTGTTGAGC GAGAAGATCT TAAAAAAGAT
GAAAACGGTG TAATTCAGGA CGACTACCGT GTTGATTTTG TTCGTCGCCA TCTTGAGTGG
ATTGCCCGTG CAATTCAGGA CGGCGCAAAA TGTCGTGGTT ACCACTACTG GGCCATCATT
GATAACTGGT CTTGGGCAAA TGCTTTCAAG AACCGTTATG GCTTTATTGA GGTAGATCTG
GAAGATAACT ACAACCGTCG TCTTAAGAAA TCAGCTAAGT GGCTTAAACA AATTGCCACT
ACACATATAG TTGACTAG
 
Protein sequence
MQYQLPKDFF FGGAMSGPQT EGRWQDDGRI PSIWDTWSNL DITAFHNRVG SYGGNDFSSR 
MEEDFELLKS IGMDSVRTSI QWSRLLDIDG NLNPEGERYY HQLFATAKKV GIEIFVNLYH
FDMPEYLFNR GGWESREVVE AYAHYARIAF ETFGKEIRYW FTFNEPIVEP EMRYTVGGWF
PFVKNYSRAR AVQYNISLAH ALGVREYRRA KAAGFMLEDS RIGLINCFAP PYTKDNPSEA
DLEALRMTDG VNIRWWLDLV TKGELPQDVI DTLQSRGVDL PIRPEDKLIL ADGVVDWLGC
NYYHPERIQA PAKDTDENGI PNFADPYVWP EAEMNVSRGW EIYPQGLYDF AMKVRDEYPE
LEWFVSENGM GVEREDLKKD ENGVIQDDYR VDFVRRHLEW IARAIQDGAK CRGYHYWAII
DNWSWANAFK NRYGFIEVDL EDNYNRRLKK SAKWLKQIAT THIVD