Gene Apar_0244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0244 
Symbol 
ID8413092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp286697 
End bp288052 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content50% 
IMG OID645021812 
Productdihydropyrimidinase 
Protein accessionYP_003179267 
Protein GI257784050 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.20857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.276303 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTAT TAAAGAATGG TTTTCTTGTA TTGCCTGAAG GCGTCTTCTG TGGAGACTTA 
GTGCTCGATG GTACAAAAAT CATCCAGGTG GGAGGTACGT ACGAGGCACG TATAGATGAC
AACGTTATCG ATGTTACGGG TAAATACGTA TTCCCTGGTT TTATTGACGC TCATACGCAT
ATGCAATGCT GGACAGGCAT GGACTGGACA GCAGATAGCT TTGAGACGGG AACCCGTGCT
GCTGCTTGCG GCGGTACCAC CACTATTGTG GATTATGCCA CACAAGACAA AGGCATGACG
CTACCCGAAG CGCTTGATGA GTGGCATAAA CGTGCAGATG GTACCTGTAC TGCTAACTAT
GCATTTCATA TGGCCATTGC TGATTGGAAT GAACAAACTA AAGCAGATAT GCAGGCTATG
CGCGATGCAG GCGTTATGTC ATTTAAGACC TATTTTGCCT ACGATCATCT GCGTTTAGAT
GACGCTCAGA CGCTTGAGGT ACTTGAATAT ATCCGTGATA TTGACGGTGT CCTGTGCGTT
CACTGCGAGA ATGGCACGCT TGTGAATGAG CTACAAAGAC GTATGCTCGC AGCTGGTATT
ACTGGTCCTG AGGGTCATCC TATGAGTCGT CCTGCAGCTT GTGAAGCAGA GGCTATTTCT
CGTCTGTGTT ATCTTGCGGA GCTTGCCGAT GCTCGTATCA ACATTGTTCA CCTGTCCAGC
GCTCTTGGTC TTGAGGCGGT TCGTGCGGCT AAGGCACGCG GCAAGGTAAA GATGGATGTT
GAGACCTGTC CTCAGTATCT GTTGCTGGAT GATTCCCGCT ATCTTGAGAG TGACTTTGAG
GGTGCCAAGT ACGTTATGAG TCCACCTTTG CGCAAGCCGC ACGATATTGA GGTATTGCGT
CGGGCCGTAT GTGACGGCGA GATTGATACC ATTGCTACCG ATCACTGCAG TTTCAATCTT
CACGGTCAAA AAGATCGCGG CATCGACGAT TTCACTCACA TTCCCAATGG CGGTCCTGGT
GTAGAACATC GACCCGGTCT TATCATGACC TCGTTTGAGA ATCGCTTGGG CGTCCAAGAC
TTTGCTCGTC TCATGAGTGA GGGACCCGCT CGTGTCTTTG GTATGTATCC GCGCAAGGGT
GTCCTGCGTG TTGGCTCCGA TGCTGACGTG ACGGTATGGG ATCCAAGTGT GACGTGGACC
ATCAGCGAGA AGAACCAGCA TCAAAACGTT GACTATACGC CATATGAGGG CTTTGAGGTC
CACGGACGTC CGGCGTATGT TTTTGTCAAC GGAGAGCTGG CAGTGGTTGA TGGTGAGCCA
ACCGGCGTGA AGCCAGGCGC ATACGTCAAA CGGTAA
 
Protein sequence
MNVLKNGFLV LPEGVFCGDL VLDGTKIIQV GGTYEARIDD NVIDVTGKYV FPGFIDAHTH 
MQCWTGMDWT ADSFETGTRA AACGGTTTIV DYATQDKGMT LPEALDEWHK RADGTCTANY
AFHMAIADWN EQTKADMQAM RDAGVMSFKT YFAYDHLRLD DAQTLEVLEY IRDIDGVLCV
HCENGTLVNE LQRRMLAAGI TGPEGHPMSR PAACEAEAIS RLCYLAELAD ARINIVHLSS
ALGLEAVRAA KARGKVKMDV ETCPQYLLLD DSRYLESDFE GAKYVMSPPL RKPHDIEVLR
RAVCDGEIDT IATDHCSFNL HGQKDRGIDD FTHIPNGGPG VEHRPGLIMT SFENRLGVQD
FARLMSEGPA RVFGMYPRKG VLRVGSDADV TVWDPSVTWT ISEKNQHQNV DYTPYEGFEV
HGRPAYVFVN GELAVVDGEP TGVKPGAYVK R