Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0604 |
Symbol | |
ID | 8413461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 673442 |
End bp | 674752 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 645022179 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003179625 |
Protein GI | 257784408 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.293813 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGTA TCAATCGTAG GCAGTTTGTC ACCTTAAGTG CATCAGCTCT TTGCTCACTG GGTCTTGTTG CTTGTGGAGG AAATAACAAC CAAAAGCAGG GAGGATCTGA TTCAGCAAAA GGTTCAGTTT ATTTTCTAAA CTTCAAGCCT GAGGCTGATC AGCAGTGGAA AGATCTTGCA GCTAAGTATA CTGAGGAAAC TAAGGTTCCA GTAAAGGTAG TAACTGCTGC TTCTGATACC TATGCACAGA CATTCCAGTC TGAGATTAAC AAGGACGCTT CTGTAGCACC AACACTGTTC CAGACAAATG GACCTTCCGG TCTTATTGGT GTAAAGGATT ACTGTATTGA TCTTAAGGGC GCCAAGATTC TAGATGAGCT CACAAATGAT GCTCTTAAGC TTCAGGAGGA TGGCGTTGTC TATGGCGTTG ACTATGTAGA GGAAGACTAC GGCATTATCT ACAACAAGAA GTTGCTTGAG AAGGCAGGCT ATAAGGGAGA TGACATCAAG GACTTTGCTT CTCTGAAGAA GGTTGTTGAG GATATTCAGT CTCGCAAAGC AGAACTTGGT GTAAAGGGAG CATTTACCAG TGCTGGTCTT GATTCAAGTT CCGACTGGCG CTTTACCACT CACTTGGCTA ACCTCCCTCT TTACTATGAG TTCAAAGACA CTGGAAAGCC AGATGCTAAA GAGATTAAAG GAACCTATTT GCCTAACTAC AAGCAAATTT TTGATCTCTA TATTAACAAT GCTACTTGTA GTCCTACTGA GCTTGCTGGT AAAACGGGCG ATGACGCTGT TGCAGAGTTT GTTAACGGCG AGGCTGTCTT CTATCAGAAT GGTACTTGGG CATACAATGA CATCAAGGGA CTTGGTGACG ATGCTCTTGG CATGATTCCA ATTTATATTG GTGTTAAGGG TGAAGAGAAG CAGGGTATGT GCTCTGGCGG TGAGAATTAT TGGTGCGTTA GCTCAAAGGC GGACGACGCT TCCCAGAAGG CAACACTTGA TTTTATGTAC TGGTGTGTAA CTTCTGATAC TGCTACTACA GCAATTGCAG AAGATATGGG CCTTACTATT CCATTCAAGA ATGCAAAAGC AACTAAGAAT GCTCTTGCAA ACATTGCAGC TGAGTACGCA AAGAAGGGCA ATGAGTCCGT AGCTTGGGAC TTCATTTACA TTCCTTCACA GGAGTGGAAG AACAACGTTT CCAGCGCACT TAAGGGTTAT GCAGCAGGAA CAGAGGACTG GGATGCCGTA AAGTCTGCAT TCGTTGATGG TTGGAAGACT GAAAAAGAGG CCAACGCATA G
|
Protein sequence | MSGINRRQFV TLSASALCSL GLVACGGNNN QKQGGSDSAK GSVYFLNFKP EADQQWKDLA AKYTEETKVP VKVVTAASDT YAQTFQSEIN KDASVAPTLF QTNGPSGLIG VKDYCIDLKG AKILDELTND ALKLQEDGVV YGVDYVEEDY GIIYNKKLLE KAGYKGDDIK DFASLKKVVE DIQSRKAELG VKGAFTSAGL DSSSDWRFTT HLANLPLYYE FKDTGKPDAK EIKGTYLPNY KQIFDLYINN ATCSPTELAG KTGDDAVAEF VNGEAVFYQN GTWAYNDIKG LGDDALGMIP IYIGVKGEEK QGMCSGGENY WCVSSKADDA SQKATLDFMY WCVTSDTATT AIAEDMGLTI PFKNAKATKN ALANIAAEYA KKGNESVAWD FIYIPSQEWK NNVSSALKGY AAGTEDWDAV KSAFVDGWKT EKEANA
|
| |