Gene Apar_0510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0510 
Symbol 
ID8413361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp588153 
End bp589751 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content51% 
IMG OID645022080 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003179532 
Protein GI257784315 
COG category[S] Function unknown 
COG ID[COG2461] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0555672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGG TTCTTGATTT AAGCAAATCC GTTTATGAGA TTTGCACTAA GTTTCCCGTC 
ATTAAAGGCC TTATGGCAGA AAACGGCTTT GCCGAGATTA CCGAGCCAGG CCGCCTTCAG
ACCATGGGCC GTTTTATGAC CATTCCTAAA GGCTGTGACC ACAAAGGCGT AGACCTTGAG
GAGCTCAAGG CAATTTTCCG CTCTCACGGC TTTACTATTC TTGGTGATGA GGAGCCTATA
GCTGCAGACA CTCCATCTGA CGAGAAACCC GCAGACACAG AGGCAACTGA CGCACCTATT
GCACAAACCC CTAAGGAGCG CAAAGCTCTT ATCGCCCAGT ATCTTGAGCG CCTCAACGAC
GGTGAGGACA TCGAGGCTGT TCGTGAGGAC TTTGCTCGTA ACTTCAAGGA TGTCTCTGGA
TCTGAGATCT CTACCGCTGA GCAGGAGCTT ATCGCTGGCG GTGTTCCTAT GGAGAAGGTC
TTGAAGCTCT GCGACGTTCA CGCTGCTCTC TTTGAAGGAA AGGTCTCTTG TGCACCAACA
GGCGCTGCTG AAGAGACTCC TGGACATCCT GTTTGGACCA TGCGCCAAGA AAACGACCGC
ATTCTTGCCT TTATTCACGA CCGCGTTGCT CCAGATGTAA GACGTGCTCG CACGCTTAAC
GAGTCTTCCA GCAATGAAGA GTGCGCTGGT GTTGCAGCAA TTCTAAAGGC TGATATGGAC
GCCTTTGACG AGGTCTTTGT TCACTACAAA CGCAAGGAAG AGCTGCTCTT CCCTCACCTG
GAGCGTCACG ATATTACCGG TCCTTCAAAG GTTATGTGGG GCAAAGACGA CGAAGTAAGA
AACGCCGTTA ACGGCGCAGA GGCTCTGCTT GAGACTGCTT ATGGTCAGTA TGATGCTGGT
CTTATGGCAG GCGTTGCTGA CTATCTGGAT GAAGCTATTG AAGGCGCTGA GTCCATGGCT
TCCAAGGAAG AGAACGTCCT TATTCCACTT TCCCTTGAGC ACCTAACTGC CACCCAGTGG
ACCCAAATTG CTGCTGAAGA GAACGAGTTT GGCCACGCAT TTGGCGTCAA TCCACCTTCA
TGGCATGCAG ATCCTCTTGA GCTTGCAAAT GACAAACTCA AAGAAATGGA AGCTGCAGGT
GCTATGGGCG AGAATAACGC TGAGGCAGAA GAGGAAGCAA TTTCTGCTGA TGGAAAGGTT
AAACTCTCTA CCGGTGAGTT CACCATTCCT CAGCTTGAGG CTGTTTTTGC AACCATTCCG
CTTGACATTA CCTTTGTTGA TGCAGACGAT AAAACCCGTT ACTTCAGCCA CGGCGACACC
CGCGCCTTCC CTCGTCCAAA GAGTTGCCTT GGTCGCGACG TCTACGATTG TCATCCACCA
AAGAGCCAAG AAGCTGTTCG CCGTATCCTC ACCGAGTTCA AGAGCGGCAA GCGTGATTGC
TCTGAGTTCT GGTTTGAAGT CAAAGACAAG TTCCTCTATG TCCGTTACTT TGCTGTCCGC
GATGAAAAGG GCAACTACCT GGGCGCTCTT GAGACCACTC AAGACATCGG ACCAATCCGT
GCCCTTGAGG GAGAAAACCG CAGAGGCTCT GACAGTTAA
 
Protein sequence
MAQVLDLSKS VYEICTKFPV IKGLMAENGF AEITEPGRLQ TMGRFMTIPK GCDHKGVDLE 
ELKAIFRSHG FTILGDEEPI AADTPSDEKP ADTEATDAPI AQTPKERKAL IAQYLERLND
GEDIEAVRED FARNFKDVSG SEISTAEQEL IAGGVPMEKV LKLCDVHAAL FEGKVSCAPT
GAAEETPGHP VWTMRQENDR ILAFIHDRVA PDVRRARTLN ESSSNEECAG VAAILKADMD
AFDEVFVHYK RKEELLFPHL ERHDITGPSK VMWGKDDEVR NAVNGAEALL ETAYGQYDAG
LMAGVADYLD EAIEGAESMA SKEENVLIPL SLEHLTATQW TQIAAEENEF GHAFGVNPPS
WHADPLELAN DKLKEMEAAG AMGENNAEAE EEAISADGKV KLSTGEFTIP QLEAVFATIP
LDITFVDADD KTRYFSHGDT RAFPRPKSCL GRDVYDCHPP KSQEAVRRIL TEFKSGKRDC
SEFWFEVKDK FLYVRYFAVR DEKGNYLGAL ETTQDIGPIR ALEGENRRGS DS