Gene Apar_0916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0916 
Symbol 
ID8413786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1025666 
End bp1028011 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content50% 
IMG OID645022503 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003179936 
Protein GI257784719 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.557683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGATA ATAAGCTGAG TGTGAGACGT GTTGAGGGAG TTGTTGACGG AACCGTTGCT 
GCAACTCTGT CTCAGTTGCT ACGACTGACC AGTGATGCAG TCATCGTTTT TAATATGGAG
GGCATAGTTC TTCTCGCCAA TGAGGAAGCC GAGACCTTAT TTGCTAAGGC TGACGGTACA
ATTGCTGGCC TGGATGTGCG CTTTTTGTTT GCCCCTGCAA ATCAAGAAAA TTTGCAAGAG
GCGTTTTCTG TAGAGTCGTT GCCGTTTGCG ATTGATGGTT CTACCTCTAT GGTGACTGCT
CCCTCGCAAG ATGGTACGTC GCAGGTGCTG AGTGTGCGCG CAGATTATGT TGGCGCACCT
ACGCAAGCCA TTGTTTTGTC AGCATCTAAG CTTCGTGATC TGTCTAGTTC TATGCACGAT
GATGAACGAA TGGTGTTGGA CCTTCGACGC GCCAATAAGA GGCTCTCAGG AGCGCTCCAG
ATTGTTTTGG ATACGCTTGA CTCGGAAGAT ATGAGTCAGC TTATTGAACG TGTGTTAGAA
GAGCTCTCAG AGACTGTTGA GGCCGATGGC GCGCTTATTT ATCTTGCCGA GCAAGACGGG
TATCACCTTC ATGGTGTTAC GGAGTCACTG AGAAGTGCAT TACTGAGCGG TATTCCTCGC
TATTTTGCAT TTAACAGCTC TTTGGAGCGT TTGTTGTTTT ATTCTGAGCA CGCGCTCCGC
TTGCACACTG TGCCCCTGAA CTCGGATGCT CTTAAGCAGG GAAGGGTTAA AAAGCGCAAT
TTAGTTAACG AGGAAACACG CGAGGTCATA ACCGTTGACG CAAGTCATCT TCCTCCGTTT
ACCAGTTTTC TTGTGGTGCC TGTGTGGTTT GGTGGTCACA TTGTTGCCTT GATTGAGGTG
GGTTGGGAGC GCAAGCGTGC GCTTCTGGTC GAGGATGCAA GACTTCTTGA CTCTATTGCA
AATTACCTCT CAGTGCAGCT TGTAGGTGCG TTGTCAGCCA TGAGGACGCA GCGTAGAGAT
ATGCTTCGTG AGGCATTAGC TCGCGTACGC CAAGGATTAC TCCATAGCAC TGCAGAGGGT
GAGAAGATCT CTGGTGAACG TCTGCAGCAG GTTATGAGGT CGGTGGGAAC TGATCTTAAT
ACACAGGTTT TCTCGATAGA TTGCTGCGAG GTAACGGGCA ACATTACACT TCATATGCTT
GACGCTGAGC AAGGATTTGC TGATGCTCAG GACGGTGGCA CGCAAAGTGT GGCTGAGGCG
GATAATGACG CACAAGTCAC AGCTGAGCAG CAAGATAGTG GCAAGCAGAT TGCGCTTCCA
TCGACGGTTG AAGGACTCAA GACAGGGGAG GGCGAGGCAT CAGTTAAAGA GGTTGAGCTT
GATTCTGAGC TGAGCCGAGC GCTTGCCGCA CAAGGTTTGC CTTGCAAGGG AGCTGTGCTT
TTCTTGGGCA GGTTTGCAGG AGAGCAGCAT ACCTGGCTTT TCTTGCGAGA AGAAAATGCT
GAGCCCTTGA GCGACATTGA GTTTGATTTC TTAGATCGTG TGATGCTTCT TGTTCACTCG
CTGGTAGTTG GCGCCGAAGA GAGTCAGCAG AACAAACACA TTTCGCAGGC GCTGCAGTCA
GGCATGAAGA ATGAATTGCA ACAGGTCGAG GGAATCTCTG CCGAGGGAAT TTATTCATCT
GCAACCGCCG ATGCGTTTGT TGGTGGCGAT TTTTACAGCA TGATTAAGCT GCCAGGCCGT
CGCGCGTGTA TCATTATGGG CGATGTTTCT GGTAAGGGTA TTGAGGCGGC ATCCATTTCA
TCCGCAGTTA AAACGGCACT TTCTGCTTAT GCATGGGAAG GAAGAACCCC TGCTCGCATG
GTGGCAACCC TGAACGAATT CTTGTTAGGT TTCTCAAGAG TAGAGACGTT TGCAACGCTT
TTTGTTGGCA TTGTTGACCT TACTACCTCA TCACTTATGT ACTGTTCTGC TGGCCATCCA
CCTGCAATTC TGGTGTCTGC TCAATCGGGT GACGCTGAGC TTTTGGATGT ACAGTCGGGC
GTTGTGGGCG CGTTTCATGA CATGGAGTAC AAAAACGGTA CCGTTTGCTT GCATGAGGGC
GATATCCTGC TACTTTACAC TGATGGAACC ACAGAGGCAC GTAGTCCTGA AGGCGCTTTT
TTTGGTGAGA CTGGCTTGCG CGACATGATT ATGAATGAGG TTCCTCGTGG GTTTGATGGC
TTGCTAAATA GGTTTTTGAA TACGCTTGAC CGCTATACTG GCAGAAGACT TGACGATGAT
GTTGCAATGG TTGCTTTGCG CTTTGACGAG CTTGGTAATG CCGATTCTGG CAAGAAGAGC
AACTAG
 
Protein sequence
MADNKLSVRR VEGVVDGTVA ATLSQLLRLT SDAVIVFNME GIVLLANEEA ETLFAKADGT 
IAGLDVRFLF APANQENLQE AFSVESLPFA IDGSTSMVTA PSQDGTSQVL SVRADYVGAP
TQAIVLSASK LRDLSSSMHD DERMVLDLRR ANKRLSGALQ IVLDTLDSED MSQLIERVLE
ELSETVEADG ALIYLAEQDG YHLHGVTESL RSALLSGIPR YFAFNSSLER LLFYSEHALR
LHTVPLNSDA LKQGRVKKRN LVNEETREVI TVDASHLPPF TSFLVVPVWF GGHIVALIEV
GWERKRALLV EDARLLDSIA NYLSVQLVGA LSAMRTQRRD MLREALARVR QGLLHSTAEG
EKISGERLQQ VMRSVGTDLN TQVFSIDCCE VTGNITLHML DAEQGFADAQ DGGTQSVAEA
DNDAQVTAEQ QDSGKQIALP STVEGLKTGE GEASVKEVEL DSELSRALAA QGLPCKGAVL
FLGRFAGEQH TWLFLREENA EPLSDIEFDF LDRVMLLVHS LVVGAEESQQ NKHISQALQS
GMKNELQQVE GISAEGIYSS ATADAFVGGD FYSMIKLPGR RACIIMGDVS GKGIEAASIS
SAVKTALSAY AWEGRTPARM VATLNEFLLG FSRVETFATL FVGIVDLTTS SLMYCSAGHP
PAILVSAQSG DAELLDVQSG VVGAFHDMEY KNGTVCLHEG DILLLYTDGT TEARSPEGAF
FGETGLRDMI MNEVPRGFDG LLNRFLNTLD RYTGRRLDDD VAMVALRFDE LGNADSGKKS
N