Gene Amir_2099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2099 
Symbol 
ID8326288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2320465 
End bp2321856 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content71% 
IMG OID644942649 
Productphenylhydantoinase 
Protein accessionYP_003099890 
Protein GI256376230 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0317322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGACGT TGATCAGCGG CGGTCTGGTG GTCAACGCGG CCGGGTCGGC GCGCGCGGAC 
GTGCTGGTGG AGGGCGAGAA GGTCGCGGCG CTGCTGGCTC CGGGGCTGGA ACCGGCGCTG
GACGTGGACG AGCGGGTCGA CGCGACCGGG AAGTACGTGA TCCCCGGCGG GATCGACGCG
CACACCCACA TGGAGATGCC GTTCGGCGGG ACGCAGTCCA GCGACGACTT CACCAGCGGC
ACGATCGCGG CGGCGTGGGG CGGGACGACC ACGATCATCG ACTTCGCCGT GCAGGCCAAG
GGGACCTCGC TGCTGGCCAC CCTGGACAGG TGGCACGCCA AGGCGGACGG CAAGTGCGCG
GTGGACTACG GGTTCCACAT GATCGTGTCC GATGTGGACG ACTCCTCGCT CAAGGAGATG
GGCGCCTGCC TCGACGCGGG CGTGAACTCG TTCAAGATGT TCATGGCCTA CCCCGGCGTC
TTCTACGCCA CCGACGGGGA GATCCTGCGC GCGATGCAGC GGGCGCGCGA GATCGGCGGC
ACGGTCATGA TGCACGCCGA GAACGGCATC GCGATCGACG AGCTGGTCGC GCAGGCGCTC
GCGGAGGGCC GCACCGACCC GGTGCAGCAC GGGCTCACCC GGCCGCCGGA GCTGGAGGGC
GAGGCGACCT CGCGGGCCAT CGCGCTGGCC AGGGTCACCG GGGCGCCGCT GTACGTCGTG
CACCTGTCGG CGGCGCAGGC GCTCGACGCG GTCACCGAGG CGCGGGACAC CGGGCAGAAC
GTGTTCGCCG AGACCTGCCC GCAGTACCTC TACCTGTCGC TGGAGGACAT GGCGCGCCCC
GGTTTCGAGG GCGCGAAGTA CGTGGCCTCC CCACCGCTGC GGCCGGTCGA GCACCAGGCG
CGGTTGTGGC GGGGGCTGCG CACCAACGAC CTGTCGGTGG TGTCGACCGA CCACTGCCCG
TTCTGCTTCG CCGACCAGAA GGTGCTGGGG CAGGGCGACT TCTCCAAGAT CCCCAACGGG
ATGCCGGGCG TGGAGCACCG GATCGACCTG CTGCACCAGG GGGTCGTGCG CGGCGAGATC
GGGCTGGAGC GGTGGGTGGA GATCTGCTCG ACCACCCCGG CCCGGATGTT CGGGCTGCAC
CCGCGCAAGG GCGTCGTCGC GCCGGGGGCC GACGCGGACC TCGTCGTGTA CGACCCCGCC
GCGCGGCAGA CCATCTCGGC GGCCACGCAC CACATGAACG TGGACTACTC GGCGTTCGAG
GGGTTCGAGC TGACCGGGCG GGTCGAGGTG GTGCTCTCGC GCGGGCGGGT CGTGGTGGAC
CGGAGCGGGT TCCGGGGGTC GGCCGGGCAC GGGCGGTTCC TGGCCCGCGA GCTGAACCAG
TACCTGGTGT GA
 
Protein sequence
MRTLISGGLV VNAAGSARAD VLVEGEKVAA LLAPGLEPAL DVDERVDATG KYVIPGGIDA 
HTHMEMPFGG TQSSDDFTSG TIAAAWGGTT TIIDFAVQAK GTSLLATLDR WHAKADGKCA
VDYGFHMIVS DVDDSSLKEM GACLDAGVNS FKMFMAYPGV FYATDGEILR AMQRAREIGG
TVMMHAENGI AIDELVAQAL AEGRTDPVQH GLTRPPELEG EATSRAIALA RVTGAPLYVV
HLSAAQALDA VTEARDTGQN VFAETCPQYL YLSLEDMARP GFEGAKYVAS PPLRPVEHQA
RLWRGLRTND LSVVSTDHCP FCFADQKVLG QGDFSKIPNG MPGVEHRIDL LHQGVVRGEI
GLERWVEICS TTPARMFGLH PRKGVVAPGA DADLVVYDPA ARQTISAATH HMNVDYSAFE
GFELTGRVEV VLSRGRVVVD RSGFRGSAGH GRFLARELNQ YLV