Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_1336 |
Symbol | |
ID | 8414221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 1502713 |
End bp | 1504410 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 645022933 |
Product | alpha amylase catalytic region |
Protein accession | YP_003180351 |
Protein GI | 257785134 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | [TIGR02403] alpha,alpha-phosphotrehalase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00360472 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAGGGC AATCTTCACG AGTAACAAGC AATCCCAAGG GCGCTTATAC GCGCGCTGAG TTTCTCGGCA CCAAGGTTGT GTATCAAATC TACGTCCGCT CCTTTAACGA TTCAAACGGA GACGGCATCG GAGACCTTCC CGGCATCACG CAACGTCTTG ACTACCTGCA AAAACTCGGC GTTGACTATC TGTGGCTTAC TCCGTTTTTT GTTTCACCCC AACATGACAA TGGCTACGAC GTTGCAGATT ACCGCAACGT CGAGCCTCTC TTTGGCACCA TGGCTGACTT CAACGAACTT TCAGCCGAAG CAAAAAAGCA CGGTATCAAG CTGATGCTCG ACATGGTCTT TAACCACACA TCAACCGAGC ATCCATGGTT TCAGCGGGCA CTTAGCGGCG ATCCTCAATA CCTTGCCTAC TATACCTTTG TCGATGGCAA CCCCAACACG CCGCCAACAA ACTGGAAGTC CAAGTTTGGC GGAAGCGCAT GGGAGTGGGT TCCCACACTT CATAAGTGGT ATCTCCATCT CTTTGACGCC TCACAGGCGG ATCTCAACTG GGACAACCCA AGCGTCCGAG CCGAGCTTGC CGACATAGTG AGTTTCTGGC ACAACAAAGG CGTTGATGGT TTTCGCTTTG ACGTTGTTAA TCTCATTTCC AAGCCAGATG TCTTTGAGGA CGACGCCATA GGAGACGGCA GGCGCTTTTA CACCGACGGG CCACATGTCC ACGAATACCT GCAGGAACTC GTCAAGCGCG GTGGTATCGA CGGGCTGATG ACCGTCGGTG AAATGAGCTC CACCTCAATT GAAAACTGCA TACGCTATTC CAATCCGGCT GACCATGAGC TTGCCATGAC CTTCTCGTTT CATCACCTTA AGGTTGATTA CCTCAACGGA GACAAGTGGT CGCTCAAAGA GCCGGATATC GGCAAGCTTC GTGAGCTGTT GAAATCGTGG CAAGAACAGA TTACGGCAGG TGGAGGCTGG AATGCGCTGT TTTGGGCCAA TCACGATCAG CCTCGTCCGA ATTCTCGCTT TGGCGATACC GAGCACTACT GGGAAATGTC CAGCAAACTG CTTGCCGTCA CCGCACATCT TCTGCGCGGA ACTCCCTATA TTTACCAGGG AGAAGAGCTG GGTATGACCA ATGCGGGATT TACCAATATC ACACAGTATC GAGACGTGGA GTCCCTCAAT TACTTTAAGA TCCTTCAGGA TCGGGGCTGC TCCCCCAAAG AAGCGCTCCA TATCATCTCC GAGCGCTCCC GCGACAATGG GCGCACACCC GTTCAATGGG ATGCCTCAAA AACCGCAGGT TTCACTTCCG GTACACCGTG GATTGGCATT CCCGACAACC ACACCATCAT CAATGCTGCT GCAGAAGTTG GCGATCCTGA CTCGATATTC TCCTTCTATC AAAAGCTCAT TGTTCTTCGA AAAACTCATC CCGTTATCAG CGAGGGAGAT GTTTGCTTTA TCGACTCCGC TGGCGAGAAG GTCATTGCCT ACGAGCGTAC CTTGGATAGC TGTTGCGTGC GCGTCTTTGC AAACTTCTCT GACCAAAAGG TTCGCTGTGC GCCGAAAGCT GAAATCGATG GGTCTGACGT CCTTATTGGT AACTATCCCA ACACCGTAAC CGATGCAGAT GCGCTCATAC TTCGTCCTTT TGAGGCACGA GCCTTTATCT GGGAATGA
|
Protein sequence | MSGQSSRVTS NPKGAYTRAE FLGTKVVYQI YVRSFNDSNG DGIGDLPGIT QRLDYLQKLG VDYLWLTPFF VSPQHDNGYD VADYRNVEPL FGTMADFNEL SAEAKKHGIK LMLDMVFNHT STEHPWFQRA LSGDPQYLAY YTFVDGNPNT PPTNWKSKFG GSAWEWVPTL HKWYLHLFDA SQADLNWDNP SVRAELADIV SFWHNKGVDG FRFDVVNLIS KPDVFEDDAI GDGRRFYTDG PHVHEYLQEL VKRGGIDGLM TVGEMSSTSI ENCIRYSNPA DHELAMTFSF HHLKVDYLNG DKWSLKEPDI GKLRELLKSW QEQITAGGGW NALFWANHDQ PRPNSRFGDT EHYWEMSSKL LAVTAHLLRG TPYIYQGEEL GMTNAGFTNI TQYRDVESLN YFKILQDRGC SPKEALHIIS ERSRDNGRTP VQWDASKTAG FTSGTPWIGI PDNHTIINAA AEVGDPDSIF SFYQKLIVLR KTHPVISEGD VCFIDSAGEK VIAYERTLDS CCVRVFANFS DQKVRCAPKA EIDGSDVLIG NYPNTVTDAD ALILRPFEAR AFIWE
|
| |