Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0917 |
Symbol | |
ID | 8413788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1028299 |
End bp | 1030077 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 645022505 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003179937 |
Protein GI | 257784720 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.340529 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGATC ATATGGCTGA CAGTTTTTCC CGTCGTTCGT TCTTTGCTCT GACTGGTATC ACCGCTGCAG GCCTTGGCCT TGCTGGCTGT GCTCCATCAA ACTCTCAGGG TGTCGGCTCT TCCAACACTG GAACTGAGCC TCAAGACGGA TCTCCTGCAA ACACTCCTCT GGATCAGCTT CCTCTTCCAG AGAAGGGCAA GAAGTACAAC AATCCTAAAT CTCGTGATGA GGTTCAGGAT GGCGGTACTT TAACTCAACC AATTACTGAA GTTGGACCTC AGTGGAATTA CTACAATCTT GCTGGTAATA CCACTTACAT GAATATTCTT CATGGTCTTA TGAATCCACG TGATTTGTTT TATGGCAATA TTGACGGTAG TAAGTTTGAG CCTAATAAAG ATTATATTAA GGATTATAAG GTAGAGGAGA AGGACGGCAA GCAGGTTGCT ACCCTTACTT TTACTGATCA GGCTAAGTTT AATGATGGTA CCGATATTGA CTGGACCGCA ATTCAAACAG CATATGTTTG TCTGAGCGGT ACAAATGAAA AATTTGAGGT TTCAAATACT GACGGCTATG ACAAGATTGA GTCTGTTGCT CAAGGTGATA CTGCTAAAAC CGCTGTTGTT ACTATGAAAG AAGCAGTCTA TCCAATTGAA ATGGTTTTGT CTTGGGCACT TCATCCAAAG CTTCAAGATC CAGATTTCTT TAATAATGGC TACAACAATG AGCCAAATAA TGATCTTGGT GCTGGCCCTT ATATTGTTGA CTCACACGAT GATTCTTCTG CTACCTTTAA GCCCAATCCT AAGTGGTGGG GAGACGCTCC TAAGCTTGAT ACGCTTGTGT ACAAGCAGAT GGATACTCAA GCTACTATTA ATGCCTTCAA GAATGAAGAA GTAGACACTG CTGGTGGCTC TACCTCAGGT TCTGCAGAGC TTCTCTCTAA CTTCTCTGGC CTTGGTGATA AAGCACAGAT TCGTCGTGGC CTAGGTTTAT CTATTGCTGT TATTGAGGTT AACTCTACTC GTGGTGTACT CCAGGATGTA GCGGTTCGTA AGGCATTCTG TCAAGTTGTT GATCCAGCAA CTCTTGTATC CATTGTGTTC CAGGGCGTTA ACTGGAAAGA AGATACTCCT GGTTCTATGC TTTGGCCGTA TTGGGCTGCG GGCTATGAGA ATAATCTTCC TGATGATGTC AAGAACCTTA AGTCTGCTGA AGAGCGCACC AAGGCCGCTA AGAAGACCCT TGAGGATGCA GGCTATTCTC TCAATGGTGA TTTCTATGAG AAGGATGGTC AGCAGGTTAC CTTTGCTTAC ACGATGTTTG GCGATGGTAC TAATGTAAAG AACCGTGCTG CAGCAATTCA GAAGATGTGT AAAGATGCTG GTATTAATAT GACGCTTGAC TCTCACCCAT CTTCAGAGTT CTCTAAGGTT CTTACCTCTG GTTCTTGGGA TGTCTGCCTC TTTGGTTGGA ATGGAACCTC AACTTCTTAC AACAATGGCG TTCAGCTTTA TGGTTCCACC TCTGCATCCA ACTTTGGTAA GCAGGGTACT GCAGACACCG ATGCAATGTT TGCAAAGGTT GTCTCTACCA AAGACTTTGA TGAGCGTATA AAACTTATGA ATCAGGCAGA GAAAAAGATG ATGGAAACTT ACTGTTATCT TCCAGTTTAC ACAGGCTCTG ATTGCTATGT TGTTAAGAAG GGTCTTGCCA ACTTTGGTCC TAAGCTTTTT GAGGATGTTC CATATACCAT TGTTGGTTGG CAGAAGTAA
|
Protein sequence | MGDHMADSFS RRSFFALTGI TAAGLGLAGC APSNSQGVGS SNTGTEPQDG SPANTPLDQL PLPEKGKKYN NPKSRDEVQD GGTLTQPITE VGPQWNYYNL AGNTTYMNIL HGLMNPRDLF YGNIDGSKFE PNKDYIKDYK VEEKDGKQVA TLTFTDQAKF NDGTDIDWTA IQTAYVCLSG TNEKFEVSNT DGYDKIESVA QGDTAKTAVV TMKEAVYPIE MVLSWALHPK LQDPDFFNNG YNNEPNNDLG AGPYIVDSHD DSSATFKPNP KWWGDAPKLD TLVYKQMDTQ ATINAFKNEE VDTAGGSTSG SAELLSNFSG LGDKAQIRRG LGLSIAVIEV NSTRGVLQDV AVRKAFCQVV DPATLVSIVF QGVNWKEDTP GSMLWPYWAA GYENNLPDDV KNLKSAEERT KAAKKTLEDA GYSLNGDFYE KDGQQVTFAY TMFGDGTNVK NRAAAIQKMC KDAGINMTLD SHPSSEFSKV LTSGSWDVCL FGWNGTSTSY NNGVQLYGST SASNFGKQGT ADTDAMFAKV VSTKDFDERI KLMNQAEKKM METYCYLPVY TGSDCYVVKK GLANFGPKLF EDVPYTIVGW QK
|
| |