Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0935 |
Symbol | |
ID | 8413806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1049159 |
End bp | 1050760 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 645022523 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003179955 |
Protein GI | 257784738 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGT TTATTAACCG TCCACTTTCG CGTCGTAGCT TCCTTGGTGG AGCAACTGTA GCAGCTGGCC TTGGTCTCAC CGCTTGTGGT GGTGGACAGC AGAAGGCACC TGAAGCTCCA TCTGGAAAGG AAGGCGGCAG CACAATTACT GCTGGTACCG CTTATTCAAC CCAGAATTAT GATCCATCTA CCACTTCATC AGCACTTGCT CTTGGCGTTA ACTGGCAAGT AGTTGAGGGC CTTTATGGTC TTAACTTCCA CGACTACTCT ACGTTCAATG AGCTTGCTAC TGCTGATCCA AAGAAGGTTG ACGATAACAC CTTTGAGATT ACCATCCGTG ACGGCGCAAA GTTCTCTGAT GGCAACGAGG TAAAGGCAGA TGACGTTGTT GAGTCCTTCA ACCGTTCTTC TGCAAAGGGC AGCGTTTACG TCCCAATGCT TGCTCCAATT GCTTCCGTAG AGAAGAAGGA TGACAAGACT GTTACCGTTA AGACTACTAT TGCCAACTTC TCCCTCCTCA AGGAGCGTCT GTCCATCGTT CGCGTTGTTC CAGCAAGCAG CTCCCAGGAT GAGATGAAGA AGCAGCCAAT TGGCTCTGGT CCATGGAAGT TTGAGTCCAT CTCTGACAAC ACTCTTGACC TTGTTCCAAA CACTAACTAC AACGGTGCAA CTCCTGCTAA GGACGAGAAG CTCCATTATG ATGTTCTTAA GGATCCAACT GCTCGTCTGA CCGCACAGCA GGACGGAACC ACCCTTGCTA TGGAAATGGT TCCTGCAGAC GCTGTAGACC AGCTCAAGGG TGCAGGCTGC ACTGTCGACG CTGTTCAGGG CTTTGGTGTT CGTTTTGCAA TGTTTAACCT TGGCAAGGCT CCTTGGGACA ACGTTAAGGT TCGCCAGGCA TGCCTCTATG CACTTGATAC AGACAAGATG CTCTCTAATG CATTCTCTGG TCAGGCTGAG GTTGCTACCA GCTATCTTCC TTCTTCTTTT GCTAACTATC ACAAGGCATC CACTGTTTAT GCACATGATG TTGATAAGGC TAAGAAGCTT ATTTCTGAGT CTGGTATCAC TCCAGGTGAT GTTGTTCTTC GTACCACTGA TAACGAGCAG GTTAAGTCCA TGGCAACTCA GGTCAAGAAC GACCTCGACG CACTTGGATT TAACGTAAAC ATCGTTTCCG ATACTTCTCC AGCAACGTAT GCTGCAATCG ATGCAGCCGA CGGCTCTTGG GACATCTTGC TTGCCCCTGG TGATCCTTCC TGCTTCGGCG CAGACGTTGA TTTGCTGCTC AACTGGTGGT ATGGCGATAA CGTTTGGATG ACCAAGCGTT GCCCATGGAA AGAGTCTGCT GAGTGGAACA AGCTTCACGA GCTCATGAAC AAGGCTCTTG GTCAGTCTGG TGCTGATCAG CAGTCCACTT GGAATGAGTG CTTTGATATC ATCGCTGAGA ACGTGCCTCT GTACCCAGTT CTCTTTGCTA AGACCTTCAC CGCTTCTTGG AGCGAGAAGC CAAATGCAAA TGGTGTTGCA CTTAAGGGCT TTAAGGGCAT TGGTACTACA GGTATGTCCT TCATTGATGT CAACACAGTT ACTGCTAGCT AA
|
Protein sequence | MSEFINRPLS RRSFLGGATV AAGLGLTACG GGQQKAPEAP SGKEGGSTIT AGTAYSTQNY DPSTTSSALA LGVNWQVVEG LYGLNFHDYS TFNELATADP KKVDDNTFEI TIRDGAKFSD GNEVKADDVV ESFNRSSAKG SVYVPMLAPI ASVEKKDDKT VTVKTTIANF SLLKERLSIV RVVPASSSQD EMKKQPIGSG PWKFESISDN TLDLVPNTNY NGATPAKDEK LHYDVLKDPT ARLTAQQDGT TLAMEMVPAD AVDQLKGAGC TVDAVQGFGV RFAMFNLGKA PWDNVKVRQA CLYALDTDKM LSNAFSGQAE VATSYLPSSF ANYHKASTVY AHDVDKAKKL ISESGITPGD VVLRTTDNEQ VKSMATQVKN DLDALGFNVN IVSDTSPATY AAIDAADGSW DILLAPGDPS CFGADVDLLL NWWYGDNVWM TKRCPWKESA EWNKLHELMN KALGQSGADQ QSTWNECFDI IAENVPLYPV LFAKTFTASW SEKPNANGVA LKGFKGIGTT GMSFIDVNTV TAS
|
| |