Gene Apar_0813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0813 
Symbol 
ID8413678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp894208 
End bp895515 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content47% 
IMG OID645022395 
Productdihydroorotase, multifunctional complex type 
Protein accessionYP_003179833 
Protein GI257784616 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0402861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.237167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTTC TGCTTAAGGG TGCACGCGTT GTTGACCCAC AGCTTAATCT TGATGCGGTA 
ATGGATGTTC GTATTGACGG AGAAACCATT GATGAAGTTG CAGCAACGGT TAAACCTCAA
GTGGGCGATA TTGTTATTGA TGCAAAGGGT AAGGTGCTGA CACCTGGCCT TGTTGACATG
CACGTCCACT TTAGAGATCC TGGTTTTGAG TACAAGGAGA CCATCGAAAC AGGTTCTAGA
GCTGCGGTTC ATGGTGGTTT CACTGACGTT GCTACTATGC CAAATACCAA TCCTGTCACC
GATAATGGAG CTGCTGTTCG TTTTCAGATT GACCGTGGCA ATGAGGTTGG TCTCATTCAT
GTGCGTCCCA TGGGTGCTCT GACTAAGGGT TCTAAGGGAC AAGAGCTTGC TGAGATTGGC
AACATGGTTG ACGAGGGTGC TTCTGCATTC TCTGACGATG GGCACGGCGT TCAAAGTGCC
GGTATGATGC GTACAGTTAT GGAGTACGTT TCTCAGTTTG ATCGCGTTGT CGCTGCTCAC
TGTGAGATTG AGTCCATCAG TGCTGGCGGC CTTGTTAACG AGGGCCGCGC AAGCACACGA
CTTGGTATGT TTGGTTGGCC GGCACTTGGC GAGGAGCTGG AGATTTCCAG AGATATTGAT
CTTTGCCGAC TCACTGGCTG TCCTCTACAT ATCTGCCACA TTTCAACAGG CAGGGGAGTA
GAGCTGGTAC GCGCAGCTAA GAAGGAAGGC CTGCCCGTTA CCGCAGAGGT TTGTCCTCAT
CACCTTTTCT TGTCTGAAGA CGACATCACC GATGCTTATA ACACTAACCT TAAGATGAAC
CCACCACTCA GAACTGCTGA GGATACGCTT GCACTTCAGA CAGCAGTTGC AGACGGTACA
GTTGATTGTA TTGTTACCGA TCATGCACCT CATGCAGCTC ACGAGAAGGA CTGTGAGTGG
GAAATTTCTT ACTTTGGTTG CATTGGTCTT GAGACCTCGC TGCCTTTAAT GATTACTAAT
ATGGTACGAA CCAATAAGCT TTCTTATACT GGCCTTGTTC GTGCAATGGC TGTTAATCCT
CGTCGTATTC TGCGTCTTGA GCCTATTAAG ATTGCTGCAG GTTATAAGGC TGATTTAACA
CTTTTTGATC CAAATAAAAA GGTCCAGATT ACACCAGAGT ATTTTGAGAG TAAGTCTAAG
AACTCCGCCT TTATTGGCTC TGAGCTTTAT GGCGTTGCAA CTGATGTGTT TGTTGATGGT
AAGCGTGTCC TTGCCAATGA AGTAGTTGTT CCTGGAGAGG AAAAGTAA
 
Protein sequence
MAFLLKGARV VDPQLNLDAV MDVRIDGETI DEVAATVKPQ VGDIVIDAKG KVLTPGLVDM 
HVHFRDPGFE YKETIETGSR AAVHGGFTDV ATMPNTNPVT DNGAAVRFQI DRGNEVGLIH
VRPMGALTKG SKGQELAEIG NMVDEGASAF SDDGHGVQSA GMMRTVMEYV SQFDRVVAAH
CEIESISAGG LVNEGRASTR LGMFGWPALG EELEISRDID LCRLTGCPLH ICHISTGRGV
ELVRAAKKEG LPVTAEVCPH HLFLSEDDIT DAYNTNLKMN PPLRTAEDTL ALQTAVADGT
VDCIVTDHAP HAAHEKDCEW EISYFGCIGL ETSLPLMITN MVRTNKLSYT GLVRAMAVNP
RRILRLEPIK IAAGYKADLT LFDPNKKVQI TPEYFESKSK NSAFIGSELY GVATDVFVDG
KRVLANEVVV PGEEK