Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0198 |
Symbol | |
ID | 8413046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 237003 |
End bp | 238163 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645021767 |
Product | restriction modification system DNA specificity domain protein |
Protein accession | YP_003179222 |
Protein GI | 257784005 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.719886 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000218103 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACTAGCA AACCGCAATT TGTCGGCGAG CGAGTGAGAC TTACATCTTT TGTTAGCGCT GCGGGCAAAA GAAATAAGGG TGCAAAGTGC ACGGATGTAT ATAGTGTTAC GAACTCACAT GGATTCGTAC CATCAACTGA ATACTTTTCC AAAGAGGTCT TTAGCAAGGA ACTGGAAGCA TATCGCCTTG TCGAAAGAGG AATGCTTGCC TATAACCCGT CTCGTATTAA CGTAGGTTCA ATTGCACTAC AAGAGTCTGC CGATAGAGTG GTTGTAAGCC CGCTCTATGT AGTCTTTTCG GTTGATACGA GACATCTGGC GCCGGGTTAC CTGCTGCGAT TTCTCAAGAG TAAGCCTGGA CTCAACCAGA TAGCGTTTAG GTCGTCTGGT ACCGTGCGCA GCAATCTCAA ATTCGATGCA CTTAGCTTGC TTGAGATGCC CTTGCCTAGT ATTGATGTAC AGGAAAAAAG ACTTGTTGTC TTATCGCGAC TTGAAAAGCA GATTGAGGCA AGAGGAGAAT TTATTGCTTC GCTCGACACC CTCGTCAAAT CCCGGTTTAT CGAGATGTTC GGGGACCCAA TCGCTCTAAA TTCGAATAAG AAGTCGCGCC TAGATAGCTT TGCCAAAATA ATCACAGGGA ATACCCCATC AAGAAAAAAA CCTGAGTACT ACGGCGACTA TATTGAATGG ATTAAAACAG ACAACATCAC ATCAACCCCA GTGCTTACAA AGGCAGCTGA ATCGCTTTCT GAGGACGGGG CATCTGCGGG CAGGATTGCT CCATCAGGAA GCGTGCTGAT GTCATGCATT GCAGGAAGCG TTAAATCCAT AGGCAAAGTG GCGATTGCCG ATAGACCCGT AGCATTCAAC CAACAAATAA ATGCGATTAT TCCTGCAGAT GGAATCTTGA CTGAGTATCT TTACTGGATG CTGTCGCTTT CAAAAGACTA TCTTTGCTCT GATATCAACA TGCAGCTCAA AGGCATCCTT AACAAAACAG CTCTATCAAG AAAAATGTTC TGCGTGCCCC CTCCATCTCT TCAGCAAGAG TTCGCGACCT TTGTCCGGCA GGTCGACAAA CTGCGAGTTG TAGCAGAAGA GCAGAAGAAA AAACTCCAGA CGTTGTACGA CAGTCTCGCT CAGGAATACT TCGCGATATA G
|
Protein sequence | MTSKPQFVGE RVRLTSFVSA AGKRNKGAKC TDVYSVTNSH GFVPSTEYFS KEVFSKELEA YRLVERGMLA YNPSRINVGS IALQESADRV VVSPLYVVFS VDTRHLAPGY LLRFLKSKPG LNQIAFRSSG TVRSNLKFDA LSLLEMPLPS IDVQEKRLVV LSRLEKQIEA RGEFIASLDT LVKSRFIEMF GDPIALNSNK KSRLDSFAKI ITGNTPSRKK PEYYGDYIEW IKTDNITSTP VLTKAAESLS EDGASAGRIA PSGSVLMSCI AGSVKSIGKV AIADRPVAFN QQINAIIPAD GILTEYLYWM LSLSKDYLCS DINMQLKGIL NKTALSRKMF CVPPPSLQQE FATFVRQVDK LRVVAEEQKK KLQTLYDSLA QEYFAI
|
| |