Gene Amir_3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3049 
Symbol 
ID8327239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp3525753 
End bp3526997 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content77% 
IMG OID644943574 
Productamidohydrolase 
Protein accessionYP_003100814 
Protein GI256377154 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCCCACC GCCTGGCGAT CGCCAACACG ACCCTGCTGA CCATGAACGA GCGGCTCGGC 
GACCTGGTCG GCGCGGACCT GGTCGTGGAC GGGACCGGCC GGATCGTGGT CCCCGGTTTC
GTGGACTCGC ACCGGCACAT GTGGCTGGCG GTGCTGCGCG GCCGGGGCGC GGACCAGACG
CTGCCCGAGT ACTTCGCCGA CGTCCTCGGC CACTGCGGGT CCCGGCTCAC CGAGGAGGAC
GCGCACGTCG GCACCGCTCT CAGCGCGCTG ACCGCGCTCG ACGCGGGCAT CACGACGGCG
CAGGACGTGG CCAACATCAA CGACCGCCCC GGCCGCACCG AGGCCGCGGT CGCCGCGCTG
CGCGAGTCGG GGCTGCGGGC GGTGTTCGCG TTCGGCCACA GCGCGATGGG CGAGGACCGG
CCCGACCACG GCGGCCTGTC CGAGGCGGGG GTGCGGGTGC TGGCCGAGCT GCTGCCGGAC
CGGGACGCGC GGGTGCGCAT GGGGTTGTGC GTGGACGCGT TCACCCCGGA GGCGGCGCGG
TGGAACTGGG CGCTGGCGAG CGAGCTGGAC GTGCCGGTCG TGCTGCACTG CCTGGGCGGG
CGGGGCGGGC TGGAGCCGAG CGACCTGCGG GACCTGGGCG TGTTCGGGCC GAGGGCGGTG
TTCGTCCACG GCACGGGACT GGCCGCGGCG GAGCTGGCGG TGCTCGCCGA GAGCGGCGCG
GCGCTGTCCG TGGCGCCGGT GGCGGAGATG CTGATGGGCC ACGGGACGCC GCCGCTGGTG
GACGCGCTGG CGGCGGGGCT GCGGCCGACG CTGAGCACGG ACGTGGAGTC CACGGGCGCG
GGCGACGTGT TCGCCCAGAT GCGGTCGGGG CTCCAGGTGG CCAGGCTCAT GGCCCTGCAC
GGCCCCGGCG CGCCGGGCCG GGACGAGCCG CCACCCCCGC TGATGACCTC GCGGCAGGCG
CTGGAGGCCG CGACGATCAA CGGCGCGCGG GCGCTGGGCG TGGGGGACGA GACCGGCTCG
CTGACGCCGG GCAAGCAGGC CGACCTGGTC GTGCTGCGCG CGGACCGGCC CGGCCTCGCC
CCGGTGCACG CCCCGGTGGG CGCGGTGGTG CAGAGCGCGG AGCGGGGCGA CGTGGAGACC
GTGCTGGTGG CGGGCAGGGC GGTCAAGCGC GACGGCCGCC TGCTGCGCGA CACCACCGAC
CTGCCGGCGC GGGCCGACCG CGTGCGGGAC CGGCTGTCCC GCTGA
 
Protein sequence
MPHRLAIANT TLLTMNERLG DLVGADLVVD GTGRIVVPGF VDSHRHMWLA VLRGRGADQT 
LPEYFADVLG HCGSRLTEED AHVGTALSAL TALDAGITTA QDVANINDRP GRTEAAVAAL
RESGLRAVFA FGHSAMGEDR PDHGGLSEAG VRVLAELLPD RDARVRMGLC VDAFTPEAAR
WNWALASELD VPVVLHCLGG RGGLEPSDLR DLGVFGPRAV FVHGTGLAAA ELAVLAESGA
ALSVAPVAEM LMGHGTPPLV DALAAGLRPT LSTDVESTGA GDVFAQMRSG LQVARLMALH
GPGAPGRDEP PPPLMTSRQA LEAATINGAR ALGVGDETGS LTPGKQADLV VLRADRPGLA
PVHAPVGAVV QSAERGDVET VLVAGRAVKR DGRLLRDTTD LPARADRVRD RLSR