Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_1058 |
Symbol | |
ID | 8413931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1202256 |
End bp | 1203257 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 645022647 |
Product | periplasmic solute binding protein |
Protein accession | YP_003180077 |
Protein GI | 257784860 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.985578 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTGA ATCCAACCGT TTCTCGTCGT AGCGCACTTG TAGCATCCGC TTTTGGCGTG AGTGCTATTC TTGCTGGCTG CAAGCCTGAG GCTCAAAAGC AAGAAGGAGC TTCTGATAAC AAAGACCAGA AGAAGCTCAG TATTGTGGCA AGTTTTTATC CTATGTATGA TTTTGCTAAG CGCATCGCTG GCGATCATGC TGAGGTAACA TGCCTAGTTC CAGCAGGTAC TGAGCCACAT GATTGGGAGC CATCCAGCAA GGATATGAAG ACTATCCAGG AAGCAGACTT CTTGATCTAC AACGGTGCTG GTATGGAGCA CTGGGTCAAG GACGTACTTG ATGGACTTGG TTCTGGCACA AAACTTACAT CAGTTGAGAC CAGCAAAGAC GTCAAACTTC TTGAGCTTGA AGAGGACGAT GATCACGATC ATGAGGAGAA GAAAGACGAC CACGATCATG ACCACGATCA CGACCACGGT GGAACCGATC CTCACGTTTG GCTTAGCCCT TTGAACGCAA AGATTCAGAT GAAGAATATT TGTGATGCTC TCTCCGAGAA GGATTCTGAG CACAAGACAG AGTATGCTGC CAATCTTGAT AAGGCAAATG CTGATTTTGA TACGCTTGAT AGTGAGTTCC ATAAGGGTCT TGACCCTCTA CCTAACAAGA CCATCGTTGT TTCTCACCAG GCATTTGGTT ATTTGTGTGA GGCTTACGGT CTTACTCAGA TGCCAATCGA GGGTGTTGAG GCAGATGCCG AGCCAAATGC TCAGGAAATG AAAGAGATTA CTGAGTTTGT AAAAGAGCAT AATGTAAAGG TCATTTTTAC TGAGGAATTG GTAAGCCCTA AGGTTGCTCA GGCAATTGCT GAGGCAACTG GAGCTCGCGT TGAAGAGCTT AACCCACTTG AGGGTTTGAC TGATGAAGAG CTTAAGGCGG GCGAGGATTA CCTGTCTGTT ATGCGTGACA ACCTCAAAGC CCTTGAGGGT GCTCTCGCAT AG
|
Protein sequence | MNLNPTVSRR SALVASAFGV SAILAGCKPE AQKQEGASDN KDQKKLSIVA SFYPMYDFAK RIAGDHAEVT CLVPAGTEPH DWEPSSKDMK TIQEADFLIY NGAGMEHWVK DVLDGLGSGT KLTSVETSKD VKLLELEEDD DHDHEEKKDD HDHDHDHDHG GTDPHVWLSP LNAKIQMKNI CDALSEKDSE HKTEYAANLD KANADFDTLD SEFHKGLDPL PNKTIVVSHQ AFGYLCEAYG LTQMPIEGVE ADAEPNAQEM KEITEFVKEH NVKVIFTEEL VSPKVAQAIA EATGARVEEL NPLEGLTDEE LKAGEDYLSV MRDNLKALEG ALA
|
| |