Gene Apar_0811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0811 
Symbol 
ID8413676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp892494 
End bp893420 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content50% 
IMG OID645022393 
Productdihydroorotate dehydrogenase family protein 
Protein accessionYP_003179831 
Protein GI257784614 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0183468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.536442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGACC AGGTAAAAAT GGCCGTCAAT CTGGGTGGAA TCCAGATGAA AAATCCTATT 
AATACTGCTG CAGGTACCTT TGGTTATGGC TGGCAGTTTG AAGGCTTTTA CGACGTCTCC
CTGCTTGGCG CCATTACTAT GAAGGGTGTT GCCCGCGTTC CTTGGGAAGG AAATCCTGCT
CCTCGTATGT GTGAGCTTAA CGGCGGCATG ATGAACAGCG TTGGCCTTGC TAATCCGGGA
GTGGATGATT TTATCGCTCA CACTGATGAC TACATGAAAG ACCTTGAAGA CCGTGGTACG
CGCGTTATCA TGCAGATGGC AGCACATTCC GTGCAAGAAA TGATTGACGT TGTCGAGCGT
CTTGAGGAGC TTAATCCGCA CATTTCAGCT ATTGAGCTTA ACGTGAGCTG TCCAAATCTC
GAGAAGGGCG GCAGACCTCT TGGCGGCACT CCTGAGCAGG CAACAGAGAT TATGAAAGCG
GTTCGTCCTC TAACGAAGCT GCCTATCTTG GTTAAGATGG CTCCCGTCAA TGTTGCTGAG
ATTGGCAAGG CTTTTGAGGC TGAGGGTGCT GATGGCCTCA CATTGATTAA CTCTATTCCA
GGCATGTCTA TCAATGTTCA TACTAGAAAG AGCAGGCTTT CTAAGCCAAC AGGCGGCCTC
AGTGGTCCTT TATGTCATAA CGCTGCTGTC CGTATGGTTT GGGAGTGCGC TCAGGCAGTC
TCTATCCCTA TCTGTGGTGT AGGTGGTGTG GAAACAGGCG AAGATGCTGC GGAATTTATT
CTGGCAGGCG CTACGGCCGT CTCGGTTGGT TCTGCAAACC TTTACGACCC TATGTGTGCT
CCACGTATTC TGAACGAGCT TACTGATTGG GCAAAGTCTC AGGGCGTATC TGACATCCAC
GAACTGATTG GAGCTGTTGA ATGTTAA
 
Protein sequence
MVDQVKMAVN LGGIQMKNPI NTAAGTFGYG WQFEGFYDVS LLGAITMKGV ARVPWEGNPA 
PRMCELNGGM MNSVGLANPG VDDFIAHTDD YMKDLEDRGT RVIMQMAAHS VQEMIDVVER
LEELNPHISA IELNVSCPNL EKGGRPLGGT PEQATEIMKA VRPLTKLPIL VKMAPVNVAE
IGKAFEAEGA DGLTLINSIP GMSINVHTRK SRLSKPTGGL SGPLCHNAAV RMVWECAQAV
SIPICGVGGV ETGEDAAEFI LAGATAVSVG SANLYDPMCA PRILNELTDW AKSQGVSDIH
ELIGAVEC