Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_1291 |
Symbol | |
ID | 8414171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1450181 |
End bp | 1453216 |
Gene Length | 3036 bp |
Protein Length | 1011 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 645022883 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003180306 |
Protein GI | 257785089 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.475725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTCAG AAAACAGAGA AGTTGTTGGA GCCGAGTCGC TATTAGGTGG TTCTTCTCGC CATGTAATGG GGGACTATAG CAGGATTGTT CCACCTCAGC ATTTCTTTAG AACAATGGAA AAGACAAAGC GGCCACTTGC GCTTTGTGCA GAGCCCTGTA TGGGGAAGAC CATGTTTCTG CGAGAGCTAG CAGATTATGC GCAGTCAAAT GGATGGAATG TGTATGAGAT TTCGCTTTCA AGTCTTTCGG CTAAAGAGGC CTCTCAGATT CTCTCTAAGA AAAGTACTTC TATCTGCAAT GTCAAGAATA CAAAAGCTCT TAAAAGGCTA GTTATTATTG ATGATTTTCC TCCGTCTGAT GAGTATTTTG TTGCTCGTCA GGTTAAATCG ATTGCGCGTC TTCGCATGGC TGGATGTCTT GTTGCGTTTT CTTTGCCTCC TGAAGCACGT CAATTAATTG ATGAAGTTCC TAGTGTGTAT GTTTTGGGAA AGAACGAGCT TCTCACGTTT ATGCCTGGAA TTGATAATTC TGAACAATCA ATTCTGAATA ATATGAGATT GACACGCGGT ATTCCAACGC TCGTGTATTC TCTGCCGGTT ACATTTTGTG AGCACGGAGA TACTAACGTG CCCATTACTT ATCAGACTAG TCTTGCATGC GTTGCATCGT ATATGCTCAG AAGCTCACTT GGTATTGAGG AACTTAGACT TCGACTTGGC ATGATGTTGA TTGGTGTTGG TTCGTTTGAT GATTTGAGCC GTATATGTGG TCAAGCTGAT CTTGAGTATC TGGCTAAGAT AGAACAAGAT GCACCATTCT TTGGTGTACA CGTAGAAACC AAGCGTTTTT CGTGTCTTCA TGTTACATGC TTTGATGTGT TGAATTTTAA TAAACAGGAG CTCGTGGCTT TGGCGAGCAA GCATGAAAAA CTTATCTTGA AAGCTATAGC TTTGCTGATT GATAGAGAAG ATTTTTCTAA GGCTGCATTT GTAAGTTCTC TGGTTAGAGA AGAGATTACC TGGGAAATTG TGCTTTCTCA TGCAGCAGAG TTTGTAGATG CTGGATATAT TGAGTTGGTA GATAATGCGC TTACTGCTAC ACATTCTGAT TGCACGCTAG AAAATTCCAG TAAGAAAGCC GCAAAAAGAA TGGTGGACGC ACTGTCTAAT ACAAAAAATC CAATAATTGC TAAAGATGCT GAAACAACAT TTGAAAACCT TACATCTTTC AATGGCTTTC TCAAGCAAAC AGCTTATATG ACTTTATTGA AGTTACTTTT GCAAAAGCCT ATGTCGCCTT TAAAGGAAGA TCCTGAATTA AGCCAGCTGG AGAAGAAAAT TGCACTACAT AAACGTGCAG TTGATTTAGG TATGCAAGGA AATTTTAAGT ATGCCCTTCA ACTACTGCTT CTTGAGCAAC AGTACGAGAA AACTTCTTCT ATAACCTCTT CAATTCAAAC AGCTGATATT GAATTGCTTT ACGTATTACT AGGTGTATAC CAAAAAGAAT TTGACTCTCG TAGCTTAAGT GCACTTTCCT TTTTACAAGA AGGCGAAGCA GGCGCACTTA AGGGTTCTGT TGGGCTACTT AAATGCGCTC GCTATCTTTT CGAAAAGAGC TCTTCTGTAG GTAATTTATA TGATACTGAA CAACTTATAA GTCAGTCAGA GCTGCAGGGT AATCGTGTGA TTCAAGTACC TGCGCTTCTT ATTGGAGCTT TTCTCAGCCT TAGAAGCAGA GCGTATCCCA AAGCTCAGCT CCAGGCAAGA AGAGCGGTAA TGCTGAGCAG GGAATGGAAC TCAATATATG TGGCGCAGGT GGGAAAGATA ATCGAAGATA TTGCGGGATT CTTTTTGGGA GTTAAGCCCA CAGAGAAGAG CCTTCAAGCA ATTACCCATC CATCGCTAAA AGCAGTATGT AGAACAATAT ACAAGGCTCT CTTTAAGAGT GTGAAAGGAC ACTCCCCTGT CTGGCTTGAT GTTGTTGAGT ATGGAGTTCC TGAAAATGCC ATGTGGCTTA TAAGAGCACT TTTGTCAGAT GAATCTGAGT TTCAGCAGTG TTTAGAACAG GAAGTCCCAG AAGAATGGCT CCATTATTTA CGTTCAAATG AGGGTAAGCG AGACGTAACA AAATGGAGAA ATTCTCAACA GGGAGCAACG GTTTCCATAA CTGGAAACCC TGAGGTAAAG AATTTGCATG TGGAGCGAAC AAAGAACGCT CACCCGGGGG TGTATATCGC TCTTCTCGGC AGATTTAGTT TGTCGGTTCA AGGAGAGGAG ATTGCCGGTA GAAAAATTGC CTATCGTTCG GCAAAGGCAC TGCTTGTGTA TTTGGCGCTT GCTCACAATC ATATGAGTTT TAGGTCACAA ATTGCACAGC AGATTTGGCC AGAGGCTGAT CAGGGCCACT GGCAAGAGCG TCTGTATCAA GCAACTCGAG TTATTCGCAA AGAAGTGCAG GAGATTCAAA AAGACTGTGA ACCCCTAGAG GCATCTCGGA TTGAAAAGAC ACTTGGATTT AATTCCCAGC AAGTAACGGT AGATATTGAT ATTTTTACGC AGTTAGCAAA GAGTGTGGCG TCATCAAATA GTGATGAAGA CATAGTGCAT CTGGCCAAGC AAGTAGAGAA GTTTTATCAA GGTGATTTAT ATCTGCCCGA GGATGAATGC TTTAGATTTG CAGATCCTAT TCGTATTGCT TTGAGAGATC AATACATAGA TACCATGGTG ATAGCTTCAG CAGCCGCTTT GAGGATTACT CATTATACGC TTGCAGTGCA TTTTGCAGAG CTTGCCTACC TTGTTGACGA TATGCGAGAA GACACGCTCA TGGCACTTAT TCAAGCGCTT AGAAAATGTG GTCGAGCGCA AGATGCACAA CATTATTACG ATTTGTATGT ACAGAAATAC GTTATGAAGC GTAGAAAAAT GCCGTCTAAA CAGCTTAGGA TGATTGCGGG CGCAGAAAAG GGAAAAGAAT CAATAGAAAC TAGTGGTGGA GAAATAACGA AATTAGGATT TTACGACGCC ATGTAG
|
Protein sequence | MESENREVVG AESLLGGSSR HVMGDYSRIV PPQHFFRTME KTKRPLALCA EPCMGKTMFL RELADYAQSN GWNVYEISLS SLSAKEASQI LSKKSTSICN VKNTKALKRL VIIDDFPPSD EYFVARQVKS IARLRMAGCL VAFSLPPEAR QLIDEVPSVY VLGKNELLTF MPGIDNSEQS ILNNMRLTRG IPTLVYSLPV TFCEHGDTNV PITYQTSLAC VASYMLRSSL GIEELRLRLG MMLIGVGSFD DLSRICGQAD LEYLAKIEQD APFFGVHVET KRFSCLHVTC FDVLNFNKQE LVALASKHEK LILKAIALLI DREDFSKAAF VSSLVREEIT WEIVLSHAAE FVDAGYIELV DNALTATHSD CTLENSSKKA AKRMVDALSN TKNPIIAKDA ETTFENLTSF NGFLKQTAYM TLLKLLLQKP MSPLKEDPEL SQLEKKIALH KRAVDLGMQG NFKYALQLLL LEQQYEKTSS ITSSIQTADI ELLYVLLGVY QKEFDSRSLS ALSFLQEGEA GALKGSVGLL KCARYLFEKS SSVGNLYDTE QLISQSELQG NRVIQVPALL IGAFLSLRSR AYPKAQLQAR RAVMLSREWN SIYVAQVGKI IEDIAGFFLG VKPTEKSLQA ITHPSLKAVC RTIYKALFKS VKGHSPVWLD VVEYGVPENA MWLIRALLSD ESEFQQCLEQ EVPEEWLHYL RSNEGKRDVT KWRNSQQGAT VSITGNPEVK NLHVERTKNA HPGVYIALLG RFSLSVQGEE IAGRKIAYRS AKALLVYLAL AHNHMSFRSQ IAQQIWPEAD QGHWQERLYQ ATRVIRKEVQ EIQKDCEPLE ASRIEKTLGF NSQQVTVDID IFTQLAKSVA SSNSDEDIVH LAKQVEKFYQ GDLYLPEDEC FRFADPIRIA LRDQYIDTMV IASAAALRIT HYTLAVHFAE LAYLVDDMRE DTLMALIQAL RKCGRAQDAQ HYYDLYVQKY VMKRRKMPSK QLRMIAGAEK GKESIETSGG EITKLGFYDA M
|
| |