Gene Mkms_3534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3534 
Symbol 
ID4611464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3721273 
End bp3722922 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content70% 
IMG OID639793210 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_939518 
Protein GI119869566 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.820035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG GATTCCAGCA TGTGCACAGC GACCTGACCG ATGGCTTCGT CCCATTCCCC 
GAAGACCGCG CCGAGCAGTA CCGGCGGGCC GGGTACTGGA CCGGGCGGCC CCTCGAGTCC
CTTCTCCTTG ACGCGGCACA CCGACGGCCC GATCACCCGG CCGTCGTCGA CGTCGACGGA
TCCCTCACGT TCGCGGAGCT GACCGCGCGC GCCGACACCG TCGCCGCGGC GCTGGCCGGA
CTCGGGATCC GTCCTGGTGA CCGGGTGCTG CTGCAACTAC CCAACTCCGT GCGTTTCGCC
GTCGCGTTCT TCGGTCTGCT GCGGGCGGGC GCGGTGCCGG TGATGTGCCT GCCCGGCCAC
CGCACCGCCG AGCTGGGACA CTTCGCCGAC GTCAGCGGCG CGGTGGCGCT GATCGTGCCC
GACGAGGTCG GCGGCTTCGA CTACCGCGAG ATGGCCGCCC AGTTGGTGGC GGACCGGCCC
ACGCTGCGGC ACGTGCTCGT CGACGGTGAC CCGGGACCGT TCCTATCGTG GGCGGCACTG
ATCGACAGCG GCGGCGTGGC ACCGGAGATC GGACCCGTCG ACACCTCGCT GCCTGCCCTG
CTGCTGGTCT CCGGCGGAAC GACCGGACTG CCCAAGCTGA TCCCCCGCAC CCACGACGAC
TACGTCTACA CCGCGGTGTC GAGCGCGCAG GCGTGCCACT TCACCCCCGA CGACGTCTAC
CTGGTGGCGT TGCCGGCCGG CCACAACTTC CCGTTGGCGT GCCCCGGCAT GCTGGGGGCA
ATGACCGTGG GGGCGACGAC GGTGTTCACC GCCGATCCCA GCCCGGAGGA GGCGTTCGCG
CTCATCGACA AACACCAGGT GACCGTCACC GGGCTGGTCA ACGCGCTGGG CAAGCTGTGG
GCGCAGGCGT GCGACTGGGA ACCGGTGCTG CCGACGTCGC TGCGCCTGGT GCAGGTGGGC
GGTTCGCGGA TGAGCCCGGA GGAGGCGCGG TTCATCCTCG ATCGCCTGAC CCCCGGGCTG
TCCCAGATCT TCGGAATGGC CGAGGGCATG CTGAATTTCA CGCGGCCGGG AGATCCGCTC
GACGTCGTGG TGCACACCCA GGGCAGGCCG GTCTCCCCGC ACGACGAGAT GCGGGTGGTC
GACGAGTCCG GTGTCGAGGT CGCACCGGGT GAAGAGGGCG AACTGCTGGT GCGCGGTCCC
AACACACTCA ACGGGTACTA CCGGGCCGAC GAGGCCAACG CCCGGTGCTT CAGCCCGGAC
GGCTTCTACC GCACCGGGGA CCGGGTGCGG ATCTTCGCCG ACGGCCCGCT GGCCGGCAAC
GTCGAGGTGA CCGGCCGGAT CAAGGACGTC ATCCACCGCG GCGGTGAGAC GGTTTCGGCG
ACCGACCTCG AGGACCATCT GCTGACCCAC CCCGCGATCT ATGCGGCGGC CGCCGTCGCG
CTACCCGACG ACTATCTCGG TGAGAAGATC TGCGCCGCAG TGGTGTTCCG CGGCAAGCAG
CTCACGCTCG CCGAACTCAA CGCATTCCTC GACGAACGCG GGGCGTCCAC TCACGCCCGG
CCCGACGTGC TGGCTGCGAT GCCGTCGCTG CCGCTGACCG CGGTCGGCAA GGTCGACAAG
AAGAAGCTCG TCGCCCAGCT GACGGGGTGA
 
Protein sequence
MSTGFQHVHS DLTDGFVPFP EDRAEQYRRA GYWTGRPLES LLLDAAHRRP DHPAVVDVDG 
SLTFAELTAR ADTVAAALAG LGIRPGDRVL LQLPNSVRFA VAFFGLLRAG AVPVMCLPGH
RTAELGHFAD VSGAVALIVP DEVGGFDYRE MAAQLVADRP TLRHVLVDGD PGPFLSWAAL
IDSGGVAPEI GPVDTSLPAL LLVSGGTTGL PKLIPRTHDD YVYTAVSSAQ ACHFTPDDVY
LVALPAGHNF PLACPGMLGA MTVGATTVFT ADPSPEEAFA LIDKHQVTVT GLVNALGKLW
AQACDWEPVL PTSLRLVQVG GSRMSPEEAR FILDRLTPGL SQIFGMAEGM LNFTRPGDPL
DVVVHTQGRP VSPHDEMRVV DESGVEVAPG EEGELLVRGP NTLNGYYRAD EANARCFSPD
GFYRTGDRVR IFADGPLAGN VEVTGRIKDV IHRGGETVSA TDLEDHLLTH PAIYAAAAVA
LPDDYLGEKI CAAVVFRGKQ LTLAELNAFL DERGASTHAR PDVLAAMPSL PLTAVGKVDK
KKLVAQLTG