Gene Apre_1430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1430 
Symbol 
ID8398240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1548944 
End bp1550314 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content32% 
IMG OID644995795 
Productdihydropyrimidinase 
Protein accessionYP_003153174 
Protein GI257066918 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTAA TTATAAAAAA TGCGAATTTA TTAGGTAGTA ATATTACAGA TATAAAGATT 
AACAATGAGA AAATTGAAGC TATTGGTAAT AATTTAGATG AAAAGGACAG TGAAATCTAT
GATGCTGAGA ATAAGTTAGT TTTGCCTGGT GGAATTGATG TTCATACTCA TATGTCTTTA
GATTTAGGAG AATATGTAGC TGTTGATGAT TTCCATACAG GCACTAGAGC AGCAGCACAT
GGCGGAACAA CTTGTATTGT AGATCACATT GCTTTCGGAC CAAAGGAAAG CTATGTTTCT
GAAATGATTG AACATTATCA TGAATTAGCG AATAACAAAG CTATAATAGA TTATTCTTTT
CATGGTGCAG TACAAGATGC GAGAGATGAA ACTCTAAAGC AAATTGAGCA TCTTCATATT
AATGATGGGA TTGTATCTGA AAAGATTTAC ACTACCTATG GTGGAAAATT AAATGATGCA
GAAATCTTAA AAATTCTAAA AAAAGCTAAA GAAACAGGAA CCATAATATG TATTCATTGT
GAAAATGATG GTTCAATTCA GGAATTGAGA GAGGAAGCGG AAGAACAAGG AAATTTAGAT
CCTATTTATC ATGCAAAAAC TAGACCAGCA GAAACTGAAG CGGAAGCAAT TAATAGACTA
ATATACCTAT CAGAAATTGC TGGTTTTCCG AAATTGTATA TAGTTCATAC TTCAACTGAA
TTAGGACTAA GGGAAATAAT AAAAGCTAGG AAAAGAGGAG TTAAAAACCT TTTCTGTGAA
ACATGTACCC AATACTTAAT GCTTAATGAA AGCAAATATA TAGATGGTGG AAATGAGGAA
GCTATTAAAT ATATAATGGC ACCACCATTA AGAAAGCAAT CTGACCAAGA TTTTTTATGG
GAAGGCATTA GAAACGGTGA TGTAGATGTA GTTGCTACTG ACCATTGTCC ATTCTTCTAC
GAGAAAGAAA AGTTGCCTCA TAAAGATAAC TTTATGACAT GTCCAGGAGG AGTTCCAGGA
GTTGAAGAAA GAGTAGAGCT AATTATAACT GAAGGTTTAA GAAGAGGAAT AGAACTAGAA
AGGCTTGTAG AAGTCTTAAT GATAAATCCA TCAAAAATAT TCGGATTGTA CCCTAGGAAG
GGGAATATAA TACCAGGAGC AGATGCCGAT ATAATAGTGC TAGATGAAAA AGAATATACT
ATAAAACAAG ATAATAGACA TTCAATTGTA GATTATACAA CTTACGAAGG TATGAACTCA
GATTATGAAG TTTCTACAGT TTTATGTAGG GGTAACTTTA TACTTAAAGA TGGAGAATAC
TTAGGAAAGC AAGGTTACGG TAAATTTATT AAAAGGAAAT TTGATGAGTA A
 
Protein sequence
MSLIIKNANL LGSNITDIKI NNEKIEAIGN NLDEKDSEIY DAENKLVLPG GIDVHTHMSL 
DLGEYVAVDD FHTGTRAAAH GGTTCIVDHI AFGPKESYVS EMIEHYHELA NNKAIIDYSF
HGAVQDARDE TLKQIEHLHI NDGIVSEKIY TTYGGKLNDA EILKILKKAK ETGTIICIHC
ENDGSIQELR EEAEEQGNLD PIYHAKTRPA ETEAEAINRL IYLSEIAGFP KLYIVHTSTE
LGLREIIKAR KRGVKNLFCE TCTQYLMLNE SKYIDGGNEE AIKYIMAPPL RKQSDQDFLW
EGIRNGDVDV VATDHCPFFY EKEKLPHKDN FMTCPGGVPG VEERVELIIT EGLRRGIELE
RLVEVLMINP SKIFGLYPRK GNIIPGADAD IIVLDEKEYT IKQDNRHSIV DYTTYEGMNS
DYEVSTVLCR GNFILKDGEY LGKQGYGKFI KRKFDE