Gene Apar_1291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1291 
Symbol 
ID8414171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1450181 
End bp1453216 
Gene Length3036 bp 
Protein Length1011 aa 
Translation table11 
GC content42% 
IMG OID645022883 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003180306 
Protein GI257785089 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.475725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCAG AAAACAGAGA AGTTGTTGGA GCCGAGTCGC TATTAGGTGG TTCTTCTCGC 
CATGTAATGG GGGACTATAG CAGGATTGTT CCACCTCAGC ATTTCTTTAG AACAATGGAA
AAGACAAAGC GGCCACTTGC GCTTTGTGCA GAGCCCTGTA TGGGGAAGAC CATGTTTCTG
CGAGAGCTAG CAGATTATGC GCAGTCAAAT GGATGGAATG TGTATGAGAT TTCGCTTTCA
AGTCTTTCGG CTAAAGAGGC CTCTCAGATT CTCTCTAAGA AAAGTACTTC TATCTGCAAT
GTCAAGAATA CAAAAGCTCT TAAAAGGCTA GTTATTATTG ATGATTTTCC TCCGTCTGAT
GAGTATTTTG TTGCTCGTCA GGTTAAATCG ATTGCGCGTC TTCGCATGGC TGGATGTCTT
GTTGCGTTTT CTTTGCCTCC TGAAGCACGT CAATTAATTG ATGAAGTTCC TAGTGTGTAT
GTTTTGGGAA AGAACGAGCT TCTCACGTTT ATGCCTGGAA TTGATAATTC TGAACAATCA
ATTCTGAATA ATATGAGATT GACACGCGGT ATTCCAACGC TCGTGTATTC TCTGCCGGTT
ACATTTTGTG AGCACGGAGA TACTAACGTG CCCATTACTT ATCAGACTAG TCTTGCATGC
GTTGCATCGT ATATGCTCAG AAGCTCACTT GGTATTGAGG AACTTAGACT TCGACTTGGC
ATGATGTTGA TTGGTGTTGG TTCGTTTGAT GATTTGAGCC GTATATGTGG TCAAGCTGAT
CTTGAGTATC TGGCTAAGAT AGAACAAGAT GCACCATTCT TTGGTGTACA CGTAGAAACC
AAGCGTTTTT CGTGTCTTCA TGTTACATGC TTTGATGTGT TGAATTTTAA TAAACAGGAG
CTCGTGGCTT TGGCGAGCAA GCATGAAAAA CTTATCTTGA AAGCTATAGC TTTGCTGATT
GATAGAGAAG ATTTTTCTAA GGCTGCATTT GTAAGTTCTC TGGTTAGAGA AGAGATTACC
TGGGAAATTG TGCTTTCTCA TGCAGCAGAG TTTGTAGATG CTGGATATAT TGAGTTGGTA
GATAATGCGC TTACTGCTAC ACATTCTGAT TGCACGCTAG AAAATTCCAG TAAGAAAGCC
GCAAAAAGAA TGGTGGACGC ACTGTCTAAT ACAAAAAATC CAATAATTGC TAAAGATGCT
GAAACAACAT TTGAAAACCT TACATCTTTC AATGGCTTTC TCAAGCAAAC AGCTTATATG
ACTTTATTGA AGTTACTTTT GCAAAAGCCT ATGTCGCCTT TAAAGGAAGA TCCTGAATTA
AGCCAGCTGG AGAAGAAAAT TGCACTACAT AAACGTGCAG TTGATTTAGG TATGCAAGGA
AATTTTAAGT ATGCCCTTCA ACTACTGCTT CTTGAGCAAC AGTACGAGAA AACTTCTTCT
ATAACCTCTT CAATTCAAAC AGCTGATATT GAATTGCTTT ACGTATTACT AGGTGTATAC
CAAAAAGAAT TTGACTCTCG TAGCTTAAGT GCACTTTCCT TTTTACAAGA AGGCGAAGCA
GGCGCACTTA AGGGTTCTGT TGGGCTACTT AAATGCGCTC GCTATCTTTT CGAAAAGAGC
TCTTCTGTAG GTAATTTATA TGATACTGAA CAACTTATAA GTCAGTCAGA GCTGCAGGGT
AATCGTGTGA TTCAAGTACC TGCGCTTCTT ATTGGAGCTT TTCTCAGCCT TAGAAGCAGA
GCGTATCCCA AAGCTCAGCT CCAGGCAAGA AGAGCGGTAA TGCTGAGCAG GGAATGGAAC
TCAATATATG TGGCGCAGGT GGGAAAGATA ATCGAAGATA TTGCGGGATT CTTTTTGGGA
GTTAAGCCCA CAGAGAAGAG CCTTCAAGCA ATTACCCATC CATCGCTAAA AGCAGTATGT
AGAACAATAT ACAAGGCTCT CTTTAAGAGT GTGAAAGGAC ACTCCCCTGT CTGGCTTGAT
GTTGTTGAGT ATGGAGTTCC TGAAAATGCC ATGTGGCTTA TAAGAGCACT TTTGTCAGAT
GAATCTGAGT TTCAGCAGTG TTTAGAACAG GAAGTCCCAG AAGAATGGCT CCATTATTTA
CGTTCAAATG AGGGTAAGCG AGACGTAACA AAATGGAGAA ATTCTCAACA GGGAGCAACG
GTTTCCATAA CTGGAAACCC TGAGGTAAAG AATTTGCATG TGGAGCGAAC AAAGAACGCT
CACCCGGGGG TGTATATCGC TCTTCTCGGC AGATTTAGTT TGTCGGTTCA AGGAGAGGAG
ATTGCCGGTA GAAAAATTGC CTATCGTTCG GCAAAGGCAC TGCTTGTGTA TTTGGCGCTT
GCTCACAATC ATATGAGTTT TAGGTCACAA ATTGCACAGC AGATTTGGCC AGAGGCTGAT
CAGGGCCACT GGCAAGAGCG TCTGTATCAA GCAACTCGAG TTATTCGCAA AGAAGTGCAG
GAGATTCAAA AAGACTGTGA ACCCCTAGAG GCATCTCGGA TTGAAAAGAC ACTTGGATTT
AATTCCCAGC AAGTAACGGT AGATATTGAT ATTTTTACGC AGTTAGCAAA GAGTGTGGCG
TCATCAAATA GTGATGAAGA CATAGTGCAT CTGGCCAAGC AAGTAGAGAA GTTTTATCAA
GGTGATTTAT ATCTGCCCGA GGATGAATGC TTTAGATTTG CAGATCCTAT TCGTATTGCT
TTGAGAGATC AATACATAGA TACCATGGTG ATAGCTTCAG CAGCCGCTTT GAGGATTACT
CATTATACGC TTGCAGTGCA TTTTGCAGAG CTTGCCTACC TTGTTGACGA TATGCGAGAA
GACACGCTCA TGGCACTTAT TCAAGCGCTT AGAAAATGTG GTCGAGCGCA AGATGCACAA
CATTATTACG ATTTGTATGT ACAGAAATAC GTTATGAAGC GTAGAAAAAT GCCGTCTAAA
CAGCTTAGGA TGATTGCGGG CGCAGAAAAG GGAAAAGAAT CAATAGAAAC TAGTGGTGGA
GAAATAACGA AATTAGGATT TTACGACGCC ATGTAG
 
Protein sequence
MESENREVVG AESLLGGSSR HVMGDYSRIV PPQHFFRTME KTKRPLALCA EPCMGKTMFL 
RELADYAQSN GWNVYEISLS SLSAKEASQI LSKKSTSICN VKNTKALKRL VIIDDFPPSD
EYFVARQVKS IARLRMAGCL VAFSLPPEAR QLIDEVPSVY VLGKNELLTF MPGIDNSEQS
ILNNMRLTRG IPTLVYSLPV TFCEHGDTNV PITYQTSLAC VASYMLRSSL GIEELRLRLG
MMLIGVGSFD DLSRICGQAD LEYLAKIEQD APFFGVHVET KRFSCLHVTC FDVLNFNKQE
LVALASKHEK LILKAIALLI DREDFSKAAF VSSLVREEIT WEIVLSHAAE FVDAGYIELV
DNALTATHSD CTLENSSKKA AKRMVDALSN TKNPIIAKDA ETTFENLTSF NGFLKQTAYM
TLLKLLLQKP MSPLKEDPEL SQLEKKIALH KRAVDLGMQG NFKYALQLLL LEQQYEKTSS
ITSSIQTADI ELLYVLLGVY QKEFDSRSLS ALSFLQEGEA GALKGSVGLL KCARYLFEKS
SSVGNLYDTE QLISQSELQG NRVIQVPALL IGAFLSLRSR AYPKAQLQAR RAVMLSREWN
SIYVAQVGKI IEDIAGFFLG VKPTEKSLQA ITHPSLKAVC RTIYKALFKS VKGHSPVWLD
VVEYGVPENA MWLIRALLSD ESEFQQCLEQ EVPEEWLHYL RSNEGKRDVT KWRNSQQGAT
VSITGNPEVK NLHVERTKNA HPGVYIALLG RFSLSVQGEE IAGRKIAYRS AKALLVYLAL
AHNHMSFRSQ IAQQIWPEAD QGHWQERLYQ ATRVIRKEVQ EIQKDCEPLE ASRIEKTLGF
NSQQVTVDID IFTQLAKSVA SSNSDEDIVH LAKQVEKFYQ GDLYLPEDEC FRFADPIRIA
LRDQYIDTMV IASAAALRIT HYTLAVHFAE LAYLVDDMRE DTLMALIQAL RKCGRAQDAQ
HYYDLYVQKY VMKRRKMPSK QLRMIAGAEK GKESIETSGG EITKLGFYDA M