Gene Mkms_3822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3822 
Symbol 
ID4611757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4036075 
End bp4037301 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content68% 
IMG OID639793502 
Productamidohydrolase 
Protein accessionYP_939805 
Protein GI119869853 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.28333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACAC TGAAGGCGGC CGGGTACGTC GACGTCGATG CCGGGGAGAT CATCCGCCCC 
GGCATCGTCC GTGTCGACGG TGACCGGATC GTCTCCGTCG GCGGATCGCC GGTCGACGGT
GACGAGGTGA TCGATCTCGG CGACTCGATC CTGTTGCCCG GCCTGATGGA CATGGAGGTC
AACCTCCTGA TGGGCGGCCG GGGCGAGAAC CCCGGCCTGT CCCAGGTGCA GGACGACCCC
CCGACCCGGG TGTTGCGCGC GGTGGGCAAC GCCAGGCGCA CCCTGCGCGC CGGGTTCACC
ACAGTGCGCA ACCTCGGTCT GTTCGTCAAG ACCGGCGGAT ACCTGCTCGA CGTCGCGCTC
GGTAAGGCGA TCGACGCCGG CTGGATCGAC GGGCCGCGTG TCATCCCGGC GGGACACGCG
ATCACGCCGA CCGGCGGCCA TCTCGACCCC ACGATGTTCG CGGCGTTCAT GCCGGGCGCA
CTGGAGTTGA CGGTCGAGGA GGGCATCGCC AACGGCATCG ACGAGATCCG CAAGGCCGTG
CGCTACCAGA TCAAACACGG CGCCCAGCTG ATCAAGGTGT GCGTATCCGG CGGCGTCATG
TCGTTGACGG GTGAGGCTGG CGCACAACAC TATTCGGACG AGGAACTGCG CGCCATCGTC
GACGAGGCGC ACCGGCGCGG GCTGCGGGTG GCTGCCCACA CCCACGGCGC CGAGGCGGTC
AAACACGCAG TGGCCTGCGG TATCGACTGC ATCGAGCACG GATTCCTGAT GGACGACGAG
GCCATCCAGA TGCTGGTCGA CAACGACCGA TTCCTGGTGA CGACGCGGCG GCTGGCGGAG
TACATGGACG TGTCCAAGGC GCCGCCGGAG TTGCAGGCCA AGGCCGCTGA GATGTTCCCC
AAGGCGCGCA CGTCGATCAA GGCCGCCTAC GAGGCGGGCG TGAAGATCGC CGTCGGCACC
GACGCCCCGG CGATCCCGCA CGGCCGCAAC GCCGACGAAC TCGTCACCCT CGTCGAATGG
GGTATGCCGC CGGCCGCGGT GCTGCGGGCC GCGACCGTCG TGGCCGCCGA TCTGATCAAC
GTCAGCGACC GCGGCCGCCT GGCCGAGGGA CTGCTCGCCG ACATCATCGC CGTACCGGGA
GATCCGTTGT CCGACATCAC CGTCACCCGG CACGTGAACT TCGTCATGAA AGGCGGAAAG
GTCTTCAAGA ATGACAGCGC CAATTAG
 
Protein sequence
MLTLKAAGYV DVDAGEIIRP GIVRVDGDRI VSVGGSPVDG DEVIDLGDSI LLPGLMDMEV 
NLLMGGRGEN PGLSQVQDDP PTRVLRAVGN ARRTLRAGFT TVRNLGLFVK TGGYLLDVAL
GKAIDAGWID GPRVIPAGHA ITPTGGHLDP TMFAAFMPGA LELTVEEGIA NGIDEIRKAV
RYQIKHGAQL IKVCVSGGVM SLTGEAGAQH YSDEELRAIV DEAHRRGLRV AAHTHGAEAV
KHAVACGIDC IEHGFLMDDE AIQMLVDNDR FLVTTRRLAE YMDVSKAPPE LQAKAAEMFP
KARTSIKAAY EAGVKIAVGT DAPAIPHGRN ADELVTLVEW GMPPAAVLRA ATVVAADLIN
VSDRGRLAEG LLADIIAVPG DPLSDITVTR HVNFVMKGGK VFKNDSAN