Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0078 |
Symbol | |
ID | 8412921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 87861 |
End bp | 89258 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 645021645 |
Product | glycoside hydrolase family 1 |
Protein accession | YP_003179105 |
Protein GI | 257783888 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATATC AGCTTCCTAA AGACTTTTTC TTTGGCGGGG CTATGTCTGG CCCACAAACT GAAGGCAGAT GGCAAGATGA CGGAAGAATC CCTAGCATTT GGGATACTTG GTCTAACCTT GACATCACCG CTTTTCACAA CCGCGTAGGG TCTTATGGTG GCAATGATTT TAGCAGCAGA ATGGAAGAGG ACTTTGAGCT TCTTAAGTCA ATAGGAATGG ACTCAGTTCG TACTTCTATC CAGTGGAGTC GCCTTTTAGA TATCGATGGA AACCTTAATC CAGAGGGTGA GAGGTACTAT CATCAGCTCT TTGCTACAGC AAAGAAGGTT GGTATTGAGA TTTTTGTAAA TCTCTATCAC TTTGATATGC CTGAATACCT CTTCAATCGC GGTGGTTGGG AGTCTCGCGA GGTAGTTGAG GCATATGCGC ATTATGCACG TATTGCGTTT GAGACTTTTG GTAAAGAGAT TCGTTACTGG TTTACTTTTA ATGAGCCAAT TGTTGAGCCT GAGATGCGCT ATACCGTTGG CGGATGGTTC CCTTTTGTAA AGAATTATTC CCGCGCTCGT GCTGTTCAGT ACAATATTTC GCTTGCTCAT GCGCTTGGTG TCCGCGAGTA TCGTCGCGCA AAAGCAGCAG GTTTTATGCT TGAGGATTCT CGCATTGGTC TTATCAATTG CTTTGCACCA CCATATACCA AAGACAATCC ATCAGAAGCA GACCTTGAGG CGCTGCGTAT GACCGATGGC GTTAACATTC GCTGGTGGCT TGACCTAGTT ACTAAGGGAG AACTCCCACA GGATGTCATT GATACGCTGC AGTCTCGTGG TGTTGACCTG CCTATTCGCC CTGAGGATAA GCTCATTCTT GCCGATGGAG TTGTGGATTG GTTGGGCTGC AATTATTACC ATCCAGAGCG TATTCAGGCT CCTGCAAAAG ATACTGATGA AAATGGCATT CCAAACTTTG CTGACCCGTA TGTTTGGCCA GAAGCAGAGA TGAATGTTTC TCGTGGTTGG GAAATTTACC CACAAGGTCT TTACGACTTT GCTATGAAGG TTCGCGATGA ATATCCAGAG CTTGAGTGGT TTGTTTCTGA GAATGGCATG GGTGTTGAGC GAGAAGATCT TAAAAAAGAT GAAAACGGTG TAATTCAGGA CGACTACCGT GTTGATTTTG TTCGTCGCCA TCTTGAGTGG ATTGCCCGTG CAATTCAGGA CGGCGCAAAA TGTCGTGGTT ACCACTACTG GGCCATCATT GATAACTGGT CTTGGGCAAA TGCTTTCAAG AACCGTTATG GCTTTATTGA GGTAGATCTG GAAGATAACT ACAACCGTCG TCTTAAGAAA TCAGCTAAGT GGCTTAAACA AATTGCCACT ACACATATAG TTGACTAG
|
Protein sequence | MQYQLPKDFF FGGAMSGPQT EGRWQDDGRI PSIWDTWSNL DITAFHNRVG SYGGNDFSSR MEEDFELLKS IGMDSVRTSI QWSRLLDIDG NLNPEGERYY HQLFATAKKV GIEIFVNLYH FDMPEYLFNR GGWESREVVE AYAHYARIAF ETFGKEIRYW FTFNEPIVEP EMRYTVGGWF PFVKNYSRAR AVQYNISLAH ALGVREYRRA KAAGFMLEDS RIGLINCFAP PYTKDNPSEA DLEALRMTDG VNIRWWLDLV TKGELPQDVI DTLQSRGVDL PIRPEDKLIL ADGVVDWLGC NYYHPERIQA PAKDTDENGI PNFADPYVWP EAEMNVSRGW EIYPQGLYDF AMKVRDEYPE LEWFVSENGM GVEREDLKKD ENGVIQDDYR VDFVRRHLEW IARAIQDGAK CRGYHYWAII DNWSWANAFK NRYGFIEVDL EDNYNRRLKK SAKWLKQIAT THIVD
|
| |