Gene Amir_5937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_5937 
Symbol 
ID8330144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp6982131 
End bp6983840 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content77% 
IMG OID644946368 
Productallantoinase 
Protein accessionYP_003103591 
Protein GI256379931 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR03178] allantoinase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCCACT TCGGCCGCGC CGCCCTCGGC CCGCGCGCGA ACCACACCAC CGGCTCGCCC 
GACACCCCCG GAAGGTCGTA CCGGCCGCGT TCGCCCGACG CCGTTCCGGA AGCCCGAACC
CCGCCTGCCG CCAACCGGGA GACCGCCGCC CCGCAGCCCT CGCCCGCGCT TCGCGCGGCC
GGCACCCTGA CCCGCATGGA CCTGGTCTTC CGCGCGAAGC GCGTGATCAC GCCGGACGGC
GAGATCGGCG CCGACGTCGG CGTGGCGGAC GGCCGCATCA CCGTCGTCGT CCCGCACCCG
GACCCGACCG GCCCCGGCCG CGCAGGCACC GACCCGCTGC CCACCGGACC GTTCCGCACC
AGCCAGTTCG GCGCCGGACC GTTCGGCGCC GGACCGTTCG ACGCCGGCGC AGCCCCCACC
GGACCGCTTC CGGGGCTCGC CGCCACCGGC CCGTTCCCCG CCGACCCGGC CGCCACCCCG
GCCGCCGACC TCATCCGCGA AGCCCTCCGC GCAGGCGCCG AGCTGGTCGA GCTGCCCGAC
GACGAGGTCC TCATCCCCGG CCTGGTCGAC ACCCACGTCC ACGTCAACGA CCCCGGTCGC
GCCGACTGGG AGGGCTTCCC CACCGCCACG CTCGCCGCCG CCGCGGGCGG GGTCACCTCG
ATCGTCGACA TGCCCCTCAA CAGCCTCCCC CCGACCACCA CCCCCGCCGC GCTCGACGCC
AAGCTCGACG CCGCGCGCGG GCGCGTGCAC GTCGACGTCG GCTTCTGGGG CGGTCTCCTG
CCCGGCAACG GCGACCAGCT CGCCGCGCTC GTCGACCGGG GCGTCTTCGG GTTCAAGTGC
TTCCTCGCGC ACTCCGGCGT CGACGAGTTC CCGCACGTCG ACGTCCCCCG GCTGCGCGCC
GCCCTCACCC GCCTCCCACC CGACCTGCCG GTGATCGTCC ACGCCGAGGA CCCCGCCCAC
CTCGCCGAGC CCGCGAGCGG CGACTACCCG GGTTTCCTCG CCTCCCGCCC GCACGCCGCC
GAGCAGCGCG CCGTCGCCGA CGTCATCGCC GCCGCCCGCG ACACCGGCCA CCGCCTGCAC
GTCCTGCACG TCTCCAGCGC CCGCGCCGCC GCCGACCTCG CGGCGGCCAA GCGCGACGGC
GTCCCCGTCA CCGCCGAGAC CTGCCCGCAC TACCTCACCT TCACCGCCGA GGAGATCCCC
GAGGGCGCCA CCGCGTTCAA GTGCTGCCCC CCGATCCGCG AGGCCGCCAA CCGCGAGCTG
CTCTGGGCGG CCCTGCGCGA CGGCGCGCTC GACCTCGTCG TCAGCGACCA CTCGCCGTGC
ACCCCCGACC TCAAGCGCGG CGACTTCGCC ACCGCCTGGG GCGGCGTCGC GAGCCTCCAG
CTGGGCCTCC CGGCCGTGTG GACGCAGGCC CGCCGCCGGG GCTTCGCGCT CACCGACGTG
GTCCGCTGGA TGTCCACGGC CCCCGCCGAC CTCACCGGTC TGCGGCACAA GGGCCGCATC
GCGCCCGGCG CGGACGCCGA CCTGTGCGCG TTCGCCCCCG ACGCCGCCTT CGTCGTGGAC
CGCGCCCACC TGCGCCACCG CAACCCGGTC ACCGCCTACC ACGGCCTGCC GCTGGCGGGC
GAGGTGCGGC GGACCTGGTT GCGCGGACGC CGGATCACCG GGGACGCGCC GTCCGGGCGG
TTCCTGACCC GAGGCGGAGG AGCGGCATGA
 
Protein sequence
MRHFGRAALG PRANHTTGSP DTPGRSYRPR SPDAVPEART PPAANRETAA PQPSPALRAA 
GTLTRMDLVF RAKRVITPDG EIGADVGVAD GRITVVVPHP DPTGPGRAGT DPLPTGPFRT
SQFGAGPFGA GPFDAGAAPT GPLPGLAATG PFPADPAATP AADLIREALR AGAELVELPD
DEVLIPGLVD THVHVNDPGR ADWEGFPTAT LAAAAGGVTS IVDMPLNSLP PTTTPAALDA
KLDAARGRVH VDVGFWGGLL PGNGDQLAAL VDRGVFGFKC FLAHSGVDEF PHVDVPRLRA
ALTRLPPDLP VIVHAEDPAH LAEPASGDYP GFLASRPHAA EQRAVADVIA AARDTGHRLH
VLHVSSARAA ADLAAAKRDG VPVTAETCPH YLTFTAEEIP EGATAFKCCP PIREAANREL
LWAALRDGAL DLVVSDHSPC TPDLKRGDFA TAWGGVASLQ LGLPAVWTQA RRRGFALTDV
VRWMSTAPAD LTGLRHKGRI APGADADLCA FAPDAAFVVD RAHLRHRNPV TAYHGLPLAG
EVRRTWLRGR RITGDAPSGR FLTRGGGAA