Gene Apar_0839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0839 
Symbol 
ID8413705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp930551 
End bp931561 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content46% 
IMG OID645022422 
Productpseudouridine synthase, RluA family 
Protein accessionYP_003179859 
Protein GI257784642 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00962928 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGAGC GTATTGTTGA AATATTAGTT GGGCCAGAAG GAGACAGCGT CAGATTAGAT 
GCGTTTCTTT CCGCTCAAGA TATGCTTCCT TCAAGAAGCG CCTGCGTCAA GCTGGTAGAA
GAGGGAAGAG TAACCATCAA CGCCACACTC GCTACTTCTA AGTCAGAAAA ACTTATGTTG
GGCGATAGAC TTTTGGTGTC TCTCCCAAAT GCAGAGTCTC AAACAGGTCT TTTGCGTCCA
AATCCAGACA TTCCACTTGA TATTCGTTTT GAGGATCAGT ACCTCATAGT ACTTTCAAAG
CAGATTGGCC TTGTCTGCCA TCCATCTCCG GGCCATGTTG ATGATACTTT GGCAAATGCT
CTGGTTGCCC ATTGTGGCTA CGAGCATCTA GGAATGCTTC AGGGAGAAGA TCGTCCTGGT
ATTGTGCATC GTCTTGACAT GGATACGTCC GGTCTTATGT TGGCTGTAAA GTCAGATGAG
GCTCAGAAGG CCCTTCAAGA TCTCATCAGG CTGCGCGTAC TTGATCGACG CTACATTGTG
TTGGTGCATG GCTATGTTGC CCATGATTCT GGCACTATTG AAACGGGCAT TGCTCGTTCA
ACGCGAGACC GTCTAAAAAT GACTGTATCT GATGCGCCAG GTGCACGAGA AGCCATTACC
ACCTTTAGGA CGCTTGAGCG CTTTGAGGCA GGTAGAAAAG GCGATGGTTA CTCGCTTCTT
GAGTGTCATC TCTATACAGG TCGCACGCAC CAGATTAGAG TTCACATGCG CCACATCGGA
CATCCTGTTG TTGGCGATCA ACTCTACGGT AAAAAAGACA CAAGTCTTAA TCTTGGTCTT
AATAGACAAT TTCTACACTC TTGGCGTGTT CAGTTTGAGC ACCCTTTTAC AGGCGAGAAT
ATCATAGTGG CAGATACGCT GCCAAAAGAC CTACAAGAAG CACTTATTTC TCAGCAGGAT
ATGTCTATGG GAAGAACAAT AGCAGGAAAA GAAATCTGCC CACAGTTGTA A
 
Protein sequence
MNERIVEILV GPEGDSVRLD AFLSAQDMLP SRSACVKLVE EGRVTINATL ATSKSEKLML 
GDRLLVSLPN AESQTGLLRP NPDIPLDIRF EDQYLIVLSK QIGLVCHPSP GHVDDTLANA
LVAHCGYEHL GMLQGEDRPG IVHRLDMDTS GLMLAVKSDE AQKALQDLIR LRVLDRRYIV
LVHGYVAHDS GTIETGIARS TRDRLKMTVS DAPGAREAIT TFRTLERFEA GRKGDGYSLL
ECHLYTGRTH QIRVHMRHIG HPVVGDQLYG KKDTSLNLGL NRQFLHSWRV QFEHPFTGEN
IIVADTLPKD LQEALISQQD MSMGRTIAGK EICPQL