Gene Mkms_4780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4780 
Symbol 
ID4616195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5010802 
End bp5012322 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content69% 
IMG OID639794471 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_940760 
Protein GI119870808 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG CCCCGAGGAC GATCCCTGCC GTTCTCGACC GGATTGCAGA GCAGTTCTCC 
GACCACGAGG CCGTGGTCAC GGACGATCGC CGCCTGACCT ACGCGCAGCT GCGTGACGAG
GTGCGCCGGG CCGCCGCGGC GATGATCGAC CTCGGCATCG CCGCGGGTGA CCGCGTCGCG
ATCTGGTCAC CGAACACCTG GCACTGGGTC GTCGCAGCCC TGGCCACGAC CTACGCCGGC
GGCGTCGTCG TCCCGCTCAA CACCCGCTAC ACCGCAAGCG AGGCGAGCGA CATCCTCGCC
CGCACCGCCG CGCCCCTGTT GATCACGGCA GGGAAGTTCC TCGGCGCGGA CCGGTCGGCC
GACCTCGACC GCTCGGCGCT GCCGGCACTT CGTCACATCG TGCGGGTGCC GATCGAGACA
GCCGACGGTA CATGGGACGA CTTCGTCTCG CGCGGAACGG ATCTCGCTGC GGCCGACGCG
CGGGCCGCGG CCGTCCGCCC CGACGACGTG GCCGACATCC TGTTCACCTC GGGGACCACG
GGACGCAGCA AGGGTGTGCT GTGCGCGCAC CGTCAGTCCC TGGACGCGCC CGCGGCGTGG
GCGGAGTGCG GACAGCTCAC CAGCTCCGAC CGGTATCTGT GCATCAACCC GTTCTTCCAC
AACTTCGGAT ACAAGGCCGG GATTCTGACC TGCCTGCAGA CCGGGGCCAC GCTGATCCCG
CAGCTGACGT TCGATCCCGA GAAGGCGATG GCCGCCGTCG CCGAACAGCG GATCACCGTG
CTTCCCGGCC CCCCGACGAT CTACCAGACC ATCCTCGACC ACCCGAAACG CGCCGAGTAC
GACCTGACGT CGCTGCGATT CGCGGTCACC GGCGCCGCCG TCGTCCCCGT CGTGCTGATC
GAGCGGATGC AGTCCGAACT CGACATCGAC ATCGTGCTGA CCGCCTACGG GCTGACCGAG
GCGAGTGGCT TCGGCACGAT GTGCCGGGCC GACGACGACG CGGTCACCGT CGCCACCACC
TGCGGACGGC CGATCGCCGG CTTCGAACTG CGCATCGGCG ATTCGGGCGA GGTGCTGCTG
CGCGGGCCGA ACGTGATGCT CGGCTATCTC GACGACCCGG AGGCCACCGC GGCCGCGATC
GACCCCGACG GCTGGCTGCA CACCGGCGAC GTCGGCACCG TCGACGAACG CGGCAACCTG
CGGATCACCG ACCGGCTCAA GGACATGTAC ATCTGCGGCG GCTTCAACGT CTATCCCGCG
GAGATCGAAC AGGTCCTCGC CCGCCTCGAC GGGGTCGCCG AATCGGCCGT GATCGGGGTG
CCCGACGAGC GGCTCGGTGA GGTCGGCAAG GCCTTCGTCG TCGCCAAACC GGGTGCGAAC
CTCGACGAAC AGGCCGTGAT CGCCTACGCG CGTGACCATC TCGCGAATTT CAAGACGCCG
CGGTCGGTGG AATTCCTCGA CGTGCTGCCC CGCAACCCGG GCGGCAAGGT CGTCAAACCG
CTCCTGAGGA AGAGAGCCTG A
 
Protein sequence
MTTAPRTIPA VLDRIAEQFS DHEAVVTDDR RLTYAQLRDE VRRAAAAMID LGIAAGDRVA 
IWSPNTWHWV VAALATTYAG GVVVPLNTRY TASEASDILA RTAAPLLITA GKFLGADRSA
DLDRSALPAL RHIVRVPIET ADGTWDDFVS RGTDLAAADA RAAAVRPDDV ADILFTSGTT
GRSKGVLCAH RQSLDAPAAW AECGQLTSSD RYLCINPFFH NFGYKAGILT CLQTGATLIP
QLTFDPEKAM AAVAEQRITV LPGPPTIYQT ILDHPKRAEY DLTSLRFAVT GAAVVPVVLI
ERMQSELDID IVLTAYGLTE ASGFGTMCRA DDDAVTVATT CGRPIAGFEL RIGDSGEVLL
RGPNVMLGYL DDPEATAAAI DPDGWLHTGD VGTVDERGNL RITDRLKDMY ICGGFNVYPA
EIEQVLARLD GVAESAVIGV PDERLGEVGK AFVVAKPGAN LDEQAVIAYA RDHLANFKTP
RSVEFLDVLP RNPGGKVVKP LLRKRA