Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_1088 |
Symbol | |
ID | 8413961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1231230 |
End bp | 1232270 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645022677 |
Product | glucan 1,3-beta-glucosidase |
Protein accession | YP_003180107 |
Protein GI | 257784890 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.218791 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.837913 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCACG GAACATTGCG AGGTGTCAAT TTAACCGGTT GGCTTACGTT TGAACCGTGG GTAACTCCCG AGCCATTTGC GCGTACAGGC TGTGTTGATG AGGCTCAGCT CATTAAAGCA TTAGGTGTTG AAGCCTACCA TAATCTTGTT AAAGCGCATC GTTCTTCATT TATTCAGAGC TCTGACTTTG TGAGCATTGC CGCTCGCGGT TTTAATGCGG CGAGAATTTC TGTGCCTTGG TATGTTTTTG ACGAAGAAGC CGCTGATACA CCATATGTAA GCTGTATTGC TGAGCTTGAT AAGGCTCTTG AATGGGCAGA AGAGCTTGGT TTGCATGTTA TTTTTGTACT GGCAGTAAAT CCAGGATTAC CTGATGGTTT GGACGATCAG CCAGGTGGTG CTCCACGTAC CAGAATTTCT GGAGAAAAGT CGCTTTCTAT TCTTCATAAA CTTGCCCTTC ATTATGCCCA TCGTAGCGGT TTTTATGGCA TTGAAGTAGC TGATGAGGTT AAACCTCGCG TGCGCAAGGG TTTTAAGCTA ACTGATGGTA TTCCTGGTCA TTTACTTAGA AACTATTATC GCCGTGCTTA TGAGGCTATT AGATCTGTGG CAGGAGAAGA ACCTGTGGTT ATTCTCCCGG ATGGTGGTTG GCCTCAGGGA TTTAGGCGTT TTATGAGTCA GCAGTCGTAT CAAAATGTTT GGCTTGATGC TCATCTTGAT AAGCCTTGTG AGGGAATTGA TTGTTCAGGT CCTCGTGGGG TACAGCAGCT CATTGATAAA AACGAAGCGT ATTTGAAAAC TTCTGCATCA GGAGGTCTAC CCGTAATGGT AGGCAAGTGG TCTGCATCAC TACCAAACAT CAACGGCGCT ATGACTGCAG AAGGAAGAAT TGCTCTTGAG CGTATTTACA CCTCTGGCCA GCTTAAAGTG TACAATACGT GTCCTGCATG GTTCTTCCAG ACCTGGAAGA CCTCTGCATT TTTGGCAGCA TGGGACGCGC GCGTTGCACT GGCAACGTTT GAGAGGGGAA TGCTCGAGTA A
|
Protein sequence | MAHGTLRGVN LTGWLTFEPW VTPEPFARTG CVDEAQLIKA LGVEAYHNLV KAHRSSFIQS SDFVSIAARG FNAARISVPW YVFDEEAADT PYVSCIAELD KALEWAEELG LHVIFVLAVN PGLPDGLDDQ PGGAPRTRIS GEKSLSILHK LALHYAHRSG FYGIEVADEV KPRVRKGFKL TDGIPGHLLR NYYRRAYEAI RSVAGEEPVV ILPDGGWPQG FRRFMSQQSY QNVWLDAHLD KPCEGIDCSG PRGVQQLIDK NEAYLKTSAS GGLPVMVGKW SASLPNINGA MTAEGRIALE RIYTSGQLKV YNTCPAWFFQ TWKTSAFLAA WDARVALATF ERGMLE
|
| |