Gene Amir_5244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_5244 
SymbolpyrC 
ID8329446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp6241225 
End bp6242526 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content75% 
IMG OID644945683 
Productdihydroorotase 
Protein accessionYP_003102911 
Protein GI256379251 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.937093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACCCGC TGCTGCTCAG GGGCGTGCGC CCCTACGGCG AGGGCGAGCC GGTCGACGTG 
CTCGTCCGCG ACGGCGTGAT CGCCGAGCTG GCCGCGACGA TCGACGCCGC GACGATCGAC
GCCGACGACG TCCAGGTCGT CGACGGCAAT GGCGCGGTCC TGCTGCCCGG CTTCGTCGAC
CTGCACACCC ACCTGCGCGA GCCCGGCCGC GAGGACACCG AGACCATCGC GACCGGGTCC
GCCGCCGCCG CGCTCGGCGG CTACACCGCC GTGTTCGCCA TGGCCAACAC CGACCCGGTG
GCCGACAACG CCGTCGTCGT GGAGCACGTG GCCCGGCGTG GCCGCGAGGT CGGCCTGGCC
GACGTGCACC CGGTCGGCGC GGTCACCGTC GGCCTCAAGG GCGAGAAGCT CGCCGAGCTC
GGCACGATGG CCAAGGTCGG CGTGCGCGTC TTCTCCGACG ACGGCCACTG CGTGCACGAC
CCGCTGCTGA TGCGCCGCGC GCTGGAGTAC AGCCGCGCGC TCGACGCGGT CATCGCCCAG
CACGCCGAGG AGCCCCGGCT CACCGTCGGC GCGCAGGCCC ACGAGGGCGA GAACGCCGCC
CGGCTCGGGC TCCAGGGCTG GCCGGCCTCG GCCGAGGAGT CGATCGTGGC GCGCGACTGC
CTGCTCGCGC TGCACGCCGA GGCCCGCCTG CACGTGTGCC ACGTGTCCAC CTCGGGCACC
GCCGACGTGC TGCGCTGGGC CAAGGCGCGG GGCACGCGGG TGTCCGCCGA GGTCACCCCG
CACCACCTGC TGCTCGACGA CAGCAGGCTC GCCACCTACG ACCCGGTCAA CAAGGTCAAC
CCGCCGCTGC GCGCCGAGTC CGACGTCCTC GCGCTGCGCG CCGCGCTCGC GGACGGCTCG
ATCGACTGCG TCGCCACCGA CCACGCCCCG CACGCCGTGC AGGACAAGGA CTGCGAGTGG
TCCGCCGCGC GGCCGGGGAT GCTCGGCCTG CAGACCGCGC TGTCCGTGGT CGCCGAGACC
ATGGTCGCCA CCGGCCTGCT CGACTGGCGC GGCGTCGCCC GCGTCATGTC CGAGCGCCCG
GCGGAGATCG GCGGCCTCGC CGACCAGGGC CGCCCGATCG CGGTCGGCGA GCCCGCGAAC
CTGGCGCTGG TCGACCCGGA CGCCCGCTGG ACCGTGCGCG GGGCCGACTT CGCCAGCATC
GCGGCGAACA CCCCGTTCGA GGGGATGGAG CTCCCCGCCG CCGTCGTGGC GACGGTCCTG
CGCGGGCGAG TCACCGCGCT CAGCGGAAGG ATCCAGCCAT GA
 
Protein sequence
MNPLLLRGVR PYGEGEPVDV LVRDGVIAEL AATIDAATID ADDVQVVDGN GAVLLPGFVD 
LHTHLREPGR EDTETIATGS AAAALGGYTA VFAMANTDPV ADNAVVVEHV ARRGREVGLA
DVHPVGAVTV GLKGEKLAEL GTMAKVGVRV FSDDGHCVHD PLLMRRALEY SRALDAVIAQ
HAEEPRLTVG AQAHEGENAA RLGLQGWPAS AEESIVARDC LLALHAEARL HVCHVSTSGT
ADVLRWAKAR GTRVSAEVTP HHLLLDDSRL ATYDPVNKVN PPLRAESDVL ALRAALADGS
IDCVATDHAP HAVQDKDCEW SAARPGMLGL QTALSVVAET MVATGLLDWR GVARVMSERP
AEIGGLADQG RPIAVGEPAN LALVDPDARW TVRGADFASI AANTPFEGME LPAAVVATVL
RGRVTALSGR IQP