Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0823 |
Symbol | |
ID | 8413688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 907438 |
End bp | 908739 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 645022405 |
Product | ABC-type sugar transport system periplasmic component-like protein |
Protein accession | YP_003179843 |
Protein GI | 257784626 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.596738 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000464353 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAAAGG TCACTACTAT GGATGTGTCT CGTCGCAGTT TTCTAAAGTT CTGTGGTCTG GGCGTTCTTA GCGTTGGTGG ATCTAGCGTT CTTGCAGCTT GCGCACCTAG CGAGAAAAAA GATGATACCG CTAAAAACAC AAAGGTAGAG AACAAGGATA AGCCTGTTGT CTTCTTTAAC CGTCAGCCTT CCAACAGCTC AACTGGTGAA CTTGATACCA ACACCCTTAA CTTCAACAAG GACACCTATT ACGTTGGCTT TGATGCAGTT CAGGGTGCAG AGCTCCAGGG CCAGATGGTT CTTGATTACA TCAAGGCTCA TGCAGCTGAG CTTGATCGCA ACAAGGACGG CATCATTGGT TACGTTCTTG CAATTGGTGA CATTGGTCAC AATGACTCTA TCGCTCGTAC TCGTGGCGTT CGTAAGGCAC TTGGTACTGG CATTGAGAAG GATGGCAAGA TTATTTCTGA TCCAGTAGGT ACTAATACTG ATGGCTCCGC TTCTGTTGTC CAGGACGGTA AGCTTGAGGT TGGTGGCAAG TCCTACACCA TTCGTGAGCT TGCTTCTCAG GAGATGAAGA ATACCGCTGG TGCAACATGG GATGCTGCAA CCGCTGGTAA CGCAATTGCT GCTTGGTCTT CTTCCTTCGG TGATCAGATT GATGTTGTTG TTTCCAATAA CGACGGTATG GGTATGTCTA TCTTCAACGC ATGGTCCAAG GCTCAGAAGG TCCCAACCTT TGGTTACGAT GCTAACTCTG ACGCTGTTGC AGCTATCGCT GATGGCTATG CAGGCACCAT TTCTCAGCAC CCAGACGTTC AGGCTTACCT GACACTTCGT CTGCTCCGTA ACGCTCTTGA CGGCGCTGAT ATCAATACTG GTATTGAGTC TGCAGACGAT GCTGGCAATA AGATTGACTC CAAGGATTAC AAGTATGTTG CAGAGCAGCG TTCTTACTAT GCTCTGAACC TTGCAGTTAC TGCTGAGAAC TACAAGGACA ATCTTGATGC AACTACTACT TACAAGGATG CTTCTGCTCA GCTTAGCGCA GACAAGCACC CAGAGAAGAA GGTTTGGCTC AACACCTATA ACTCTGGTGA CAACTTCCTT GGTTCAACCT ATGTTCCACT GCTCAAGAAG TATGCTCCAC TTCTCAACCT CAACGTTGAG TTTATCGCAG GCGATGGCCA GACTGAGTCC AACATTACCA ACCGTCTTGG CAACCCTGAT GAGTACGATG CATTTGCATT CAACATGGTT AAGACCGACA ACGGTTCTTC CTATACCCAG CTGCTTAAGT AA
|
Protein sequence | MRKVTTMDVS RRSFLKFCGL GVLSVGGSSV LAACAPSEKK DDTAKNTKVE NKDKPVVFFN RQPSNSSTGE LDTNTLNFNK DTYYVGFDAV QGAELQGQMV LDYIKAHAAE LDRNKDGIIG YVLAIGDIGH NDSIARTRGV RKALGTGIEK DGKIISDPVG TNTDGSASVV QDGKLEVGGK SYTIRELASQ EMKNTAGATW DAATAGNAIA AWSSSFGDQI DVVVSNNDGM GMSIFNAWSK AQKVPTFGYD ANSDAVAAIA DGYAGTISQH PDVQAYLTLR LLRNALDGAD INTGIESADD AGNKIDSKDY KYVAEQRSYY ALNLAVTAEN YKDNLDATTT YKDASAQLSA DKHPEKKVWL NTYNSGDNFL GSTYVPLLKK YAPLLNLNVE FIAGDGQTES NITNRLGNPD EYDAFAFNMV KTDNGSSYTQ LLK
|
| |