Gene Mkms_5421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5421 
Symbol 
ID4613105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5653432 
End bp5655072 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content67% 
IMG OID639795116 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_941397 
Protein GI119871445 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAGCA CCATGCAGGA CGTCCCGCTG ACAGTCGCCG CGATTCTGCG GTACGCCGCC 
ACAGTCCACG GCGACCGAAC GGTGACGACC GCGACGGGAA ACGGTGGTTA CCGGCACGCC
ACCTACCGTG AGGTCGGACA GCAGGCGGCC CGACTGGCCC ATGCGCTGCG CCGGTTCGGG
ATCGAGGGTG ACGACCGCGT CGGGACGTTC ATGTGGAACA ACCAGGAGCA TCTGGAGGCC
TACGTCGCGG TGCCCTCGAT GGGCGCGGTG CTGCACACGC TGAACATCCG GCTGTTCCCG
GAGCAGATCG AGTTCGTCGC CTACGAGGCC GAGGACCGCG TGCTGATCGC CGATCTGTCG
CTGGCGCCGG TGCTTGCTCC GGTGCTGCGC TCGCTGGAGA CCGTGCACAC GGTGATCGCC
GTCGGTGAGG GTGACCTGGC GCCGTTCGAG GAGTCGGGCA AACGAGTGGT GCGCTACCAC
GAGGTGACCG CCGCCGAATC CGACGAGTAC GACTGGCCCG ACATCGACGA GAACTCCGCT
GCCGCAATGT GTTACACCAG CGGAACCACC GGCCACCCGA AGGGCGTCGT ATACGGGCAC
CGGTCGAGCT ACCTGCACTC GATGGCGGTC TGCGGGGGCA ACGGACTCGG GATGAGCTTC
TCCGACAAGG CGCTGCCGAT CGTGCCGATG TTCCACGCCA ATGCCTGGGG TCTGCCCTAC
GCCGCCCTGA TGGCCGGAGC TGACCTGGTG CTTCCCGACC GCTTCATGGA CGCCACGTCC
CTCGTCGACC TGATCGAGAC GCAGCGGCCG ACTGTCGCCG GTGCGGTACC CACAATCTGG
AACGACGTCA TGCACCACCT CGACCAGAAC CCCGGCCACG ACATCTCCTC GCTGCGGCTC
GTCGGCTGCG GCGGGTCGGC GGTGCCGGTG TCGCTGATGA AGGCGTTCGA GGAGAAGTTC
GGCGTGCAGA TCCGGCAGTT GTGGGGGATG ACGGAGACGT CCCCGGTTGC GACGATCGCG
TGGCCGCCGC CGGACACCCC CGCGGAGAAG CACTGGCAGA TCCGCAGCAC CCAGGGCCGT
CCGCTCTGCG GTGTGGAGGC CCGCATCGTC GACGACGACG GTGCGGTGCT GCCCAACGAC
GGTGAGTCAG TCGGTGAACT CGAGGTCCGC GGGCCCTGGA TCACCGGGTC CTATTACCGC
AACACCGACG ACTCGAAGTT CCAGTCGGGT TGGCTGCGCA CCGGCGACGT GGGCCGTATC
GATCCGCAGG GTTACATCAC CCTGACCGAC CGCGCCAAGG ACGTCATCAA GTCCGGCGGT
GAGTGGATCT CGTCGGTGGA GTTGGAGAAC CACCTCATCG CGCATCCGGC GGTGCGAGAA
GCCGCCGTGG TCGGGGTGCC CGACGAGCGT TGGCAGGAGC GGCCATTGGC GGCGGTCGTC
GTCCAGGAGG GTGCCCAGGT GGACGCTGAC GAACTGCGGA ACTTCCTGGC CGACAAGGTT
GTCCGGTGGT GGCTTCCCGA GCGCTGGACC TTCGTCGACG AAATCCCACG CACCAGCGTC
GGTAAGTACG ACAAGAAGGT GATCCGGGCG CGCTACGCGG ACAACGCATA CCAGGTCGCC
GACCTGCGCG AGCACACGTA G
 
Protein sequence
MYSTMQDVPL TVAAILRYAA TVHGDRTVTT ATGNGGYRHA TYREVGQQAA RLAHALRRFG 
IEGDDRVGTF MWNNQEHLEA YVAVPSMGAV LHTLNIRLFP EQIEFVAYEA EDRVLIADLS
LAPVLAPVLR SLETVHTVIA VGEGDLAPFE ESGKRVVRYH EVTAAESDEY DWPDIDENSA
AAMCYTSGTT GHPKGVVYGH RSSYLHSMAV CGGNGLGMSF SDKALPIVPM FHANAWGLPY
AALMAGADLV LPDRFMDATS LVDLIETQRP TVAGAVPTIW NDVMHHLDQN PGHDISSLRL
VGCGGSAVPV SLMKAFEEKF GVQIRQLWGM TETSPVATIA WPPPDTPAEK HWQIRSTQGR
PLCGVEARIV DDDGAVLPND GESVGELEVR GPWITGSYYR NTDDSKFQSG WLRTGDVGRI
DPQGYITLTD RAKDVIKSGG EWISSVELEN HLIAHPAVRE AAVVGVPDER WQERPLAAVV
VQEGAQVDAD ELRNFLADKV VRWWLPERWT FVDEIPRTSV GKYDKKVIRA RYADNAYQVA
DLREHT