Gene Mmcs_3749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3749 
Symbol 
ID4112580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4001593 
End bp4002819 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content68% 
IMG OID638032888 
Productamidohydrolase 
Protein accessionYP_640911 
Protein GI108800714 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.468026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACAC TGAAGGCGGC CGGGTACGTC GACGTCGATG CCGGGGAGAT CATCCGCCCC 
GGCATCGTCC GTGTCGACGG TGACCGGATC GTCTCCGTCG GCGGATCGCC GGTCGACGGT
GACGAGGTGA TCGATCTCGG CGACTCGATC CTGTTGCCCG GCCTGATGGA CATGGAGGTC
AACCTCCTGA TGGGCGGCCG GGGCGAGAAC CCCGGCCTGT CCCAGGTGCA GGACGACCCC
CCGACCCGGG TGTTGCGCGC GGTGGGCAAC GCCAGGCGCA CCCTGCGCGC CGGGTTCACC
ACAGTGCGCA ACCTCGGTCT GTTCGTCAAG ACCGGCGGAT ACCTGCTCGA CGTCGCGCTC
GGTAAGGCGA TCGACGCCGG CTGGATCGAC GGGCCGCGTG TCATCCCGGC GGGACACGCG
ATCACGCCGA CCGGCGGCCA TCTCGACCCC ACGATGTTCG CGGCGTTCAT GCCGGGCGCA
CTGGAGTTGA CGGTCGAGGA GGGCATCGCC AACGGCATCG ACGAGATCCG CAAGGCCGTG
CGCTACCAGA TCAAACACGG CGCCCAGCTG ATCAAGGTGT GCGTATCCGG CGGCGTCATG
TCGTTGACGG GTGAGGCTGG CGCACAACAC TATTCGGACG AGGAACTGCG CGCCATCGTC
GACGAGGCGC ACCGGCGCGG GCTGCGGGTG GCTGCCCACA CCCACGGCGC CGAGGCGGTC
AAACACGCAG TGGCCTGCGG TATCGACTGC ATCGAGCACG GATTCCTGAT GGACGACGAG
GCCATCCAGA TGCTGGTCGA CAACGACCGA TTCCTGGTGA CGACGCGGCG GCTGGCGGAG
TACATGGACG TGTCCAAGGC GCCGCCGGAG TTGCAGGCCA AGGCCGCTGA GATGTTCCCC
AAGGCGCGCA CGTCGATCAA GGCCGCCTAC GAGGCGGGCG TGAAGATCGC CGTCGGCACC
GACGCCCCGG CGATCCCGCA CGGCCGCAAC GCCGACGAAC TCGTCACCCT CGTCGAATGG
GGTATGCCGC CGGCCGCGGT GCTGCGGGCC GCGACCGTCG TGGCCGCCGA TCTGATCAAC
GTCAGCGACC GCGGCCGCCT GGCCGAGGGA CTGCTCGCCG ACATCATCGC CGTACCGGGA
GATCCGTTGT CCGACATCAC CGTCACCCGG CACGTGAACT TCGTCATGAA AGGCGGAAAG
GTCTTCAAGA ATGACAGCGC CAATTAG
 
Protein sequence
MLTLKAAGYV DVDAGEIIRP GIVRVDGDRI VSVGGSPVDG DEVIDLGDSI LLPGLMDMEV 
NLLMGGRGEN PGLSQVQDDP PTRVLRAVGN ARRTLRAGFT TVRNLGLFVK TGGYLLDVAL
GKAIDAGWID GPRVIPAGHA ITPTGGHLDP TMFAAFMPGA LELTVEEGIA NGIDEIRKAV
RYQIKHGAQL IKVCVSGGVM SLTGEAGAQH YSDEELRAIV DEAHRRGLRV AAHTHGAEAV
KHAVACGIDC IEHGFLMDDE AIQMLVDNDR FLVTTRRLAE YMDVSKAPPE LQAKAAEMFP
KARTSIKAAY EAGVKIAVGT DAPAIPHGRN ADELVTLVEW GMPPAAVLRA ATVVAADLIN
VSDRGRLAEG LLADIIAVPG DPLSDITVTR HVNFVMKGGK VFKNDSAN