Gene Mkms_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4020 
Symbol 
ID4611960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4240972 
End bp4242543 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content71% 
IMG OID639793704 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_940002 
Protein GI119870050 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.165206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACC TTCGACCCCT CTGGGACACC GAGGCGTTCA GCGGTGTCTA CGCCGGCCAC 
CCCGGCCGCC TCTACCGCCG CGGCCCCGCC ACGATCACCG ACCTCATCGC GGGCACGCGC
TGCTGGACCG ACCGTGAGTT CCTGGTTCAC GGCGACCGCC GGATCTCCTA TGCCGCATTC
CGCGGCGCGC TCGGCCGGGT CGGCGCGCAT CTGGCCGATC TCGGGGTGCG GCCGCGCGAC
CGGGTGATGG TGTTCGGCTA CAACAGCCCG GAGTGGATCG TGGCCCTGTT CGCCCTCCTT
CTGCAGGGGG CGGTTCCGGT GCTCGGCAAC CGGTGGTGGA GCCCCGCCGA GGTGGCCCAT
GCCGCCGAAC TGCTCGACCT GCGGCACATC CTCACCGACA CCGCGCTCGA CACCGATCGC
CCGGCCAGCC CGCTCGCCGA CCTGGCGTAC GCATTCGACG CACCGGCCGG TCCGGCGTCG
CACGCCGATG AAGACGTCGA CATCGACGAG GTCGCGATCG TCCTGTTCAC CTCCGGCAGC
TCCGGGCTGC CGAAAGCCGT CGAGCTGTCC CGCCGTTCGG TGATCGCCAA CCAGCAGAAC
ATCCTGACGC GCAACGGCAG GCTGCCCCAC CTGCTGAACG CCGACTCCCC GCAGGCCGTC
AGCCTCGCGA GCACGCCGAT GTTCCACATC GGCGGCCTGT CGAGCCTGCT CACGCATTTC
CTGACCGGCG GCCGAATCGT GTTGGCGCAG GGCCGGTTCG ACCCCGGCCA GGTGATGGCG
CTCGTCGAAC GCGAACGGGT GCAGGTCTGG GGTGCGGTGC CCACCATGGC CGTGCGCGTC
CTCGAACACC CCGAGTTCGG GTCCCGCGAC CTGAGCAGCC TGCGGTCGTG GCCGCTGGGG
GGCGCTCCGG TGAGCCCCGA ACTCCTCGAA CGCATCCGCA CGCAGCTGCC CACCCTGCGT
GAGCGTGGGC TCTCCAACAC CTGGGGCATG ACCGAGGCCG GCGGCTTCCT CACCGTCGCC
GACAGCCGGG ACCTGCGCGC GAGGCCGGGC ACGGTGGGCA GGCCCTATCC CGTCGTCGAG
TTGCGCATCG ACCGGCCCGA CGACGACGGC GTCGGCGAAG TGCTGGCCCG CTCCCCCACC
GTGATGCTGG GTTACGCGGG TCGCGCGGAC GACGACACCG TCGACGCCGA CGGCTGGTTG
CACACCGGCG ACCTCGGCCA CCTCGACGAC GACGGCTATC TCTACATCGA CGGCCGCAGC
AAGGACGTCG TGATCCGCGG CGGTGAGAAC ATCGCCTGCC CGCACGTCGA GGCGGCGCTC
GCCAGCCATC CGGCCGTCGT CGAGGCCGCG GCCCTGGGCC TGCCCCATCC CGACCTGGGC
GAAGAGCTCG CCGCGGTGGT CGTCTACCGC AGCGGTGCAC TGCCGCCGAC CGACGACGAG
CTGCGGCGCC ACCTCGCGGG CATCGTGTCG TCGTTCGCCG TCCCGACCCG CTGGCTCATC
CGGACCGAAC CGCTCCCCAC GCTCGCCGGG GAGAAGATCG ACAAGAAAAC CCTGGCGACC
GCATTCGACT GA
 
Protein sequence
MTDLRPLWDT EAFSGVYAGH PGRLYRRGPA TITDLIAGTR CWTDREFLVH GDRRISYAAF 
RGALGRVGAH LADLGVRPRD RVMVFGYNSP EWIVALFALL LQGAVPVLGN RWWSPAEVAH
AAELLDLRHI LTDTALDTDR PASPLADLAY AFDAPAGPAS HADEDVDIDE VAIVLFTSGS
SGLPKAVELS RRSVIANQQN ILTRNGRLPH LLNADSPQAV SLASTPMFHI GGLSSLLTHF
LTGGRIVLAQ GRFDPGQVMA LVERERVQVW GAVPTMAVRV LEHPEFGSRD LSSLRSWPLG
GAPVSPELLE RIRTQLPTLR ERGLSNTWGM TEAGGFLTVA DSRDLRARPG TVGRPYPVVE
LRIDRPDDDG VGEVLARSPT VMLGYAGRAD DDTVDADGWL HTGDLGHLDD DGYLYIDGRS
KDVVIRGGEN IACPHVEAAL ASHPAVVEAA ALGLPHPDLG EELAAVVVYR SGALPPTDDE
LRRHLAGIVS SFAVPTRWLI RTEPLPTLAG EKIDKKTLAT AFD