Gene Mkms_3902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3902 
Symbol 
ID4611837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4108465 
End bp4109979 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content70% 
IMG OID639793581 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_939884 
Protein GI119869932 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATCT CGCTGCTGCT CGAGATGTCG GTCTCGGCCG ACCCGGAGCG GACCGCCGTC 
GTCTCCGACG GCCTGCGGTT GTCCGTCGAA GAGCTGTCCG CACTGGCCGA CGGGGGCGCC
GGCGTCATCG CGAAGTCCGG CGCGGCGCAC GTCGCCTACG TCGGCACCGG CGGAGTCCTG
CTGCCGCTGC TGTTGTTCGC CTCGGCCAGG GCCGCGATCC CGTTCACCCC ACTCAACTAC
CGGCTCAGCC GCGAGGGCCT GCACGAACTC ATCGCTCGGC TCCCCGATGC CCTGGTGATC
GCCGACCCGG ACTACCGGGA GGTCGTCGCG GGTGCGGGCA AGCAGCTGAT GGACTCCGAG
GAGTTCCTCG CCGCCGCGCG GACGGCCGAA CCCACCGCGG AGTTCGCCGA CCCCGACTCG
GTGGCCGTCG TGCTGTTCAC CTCGGGTACC ACCTCACGCC CCAAAGCCGT TGAGCTGACC
CACAACAACC TGACCAGCTA TGTGACCGGC ACCGTCGAGT TCGCCTCTGC CGACCCGGAG
GACGCCGCGC TGATCTGTGT ACCGCCGTAC CACATCGCCG GCGTCGGCGC CGCGATGTCG
AATCTGTACG CGGGCCGGAA GATGGTGTAT CTGCGGCATT TCGACGCGCG CGAGTGGATC
CGGCTGGTCA GCACCGAGGG CGTCACCACC GCCACGGTCG TGCCCACCAT GCTCGACCGC
ATCGTGTCCG CGCTGTCCGA GGAGCCGGTG GCGCTGCCCA CGCTGCGCAA CCTGGCCTAC
GGGGGTTCCA AGGTGGCCCT GCCGCTGGTG CGCCGAGCCC TCGAACTGCT GCCCGGCGTC
GGATTCGTCA ACGCCTACGG GCTCACCGAG ACCAGCTCCA CGATCGCAGT GCTCGGCCCC
GACGACCACC GCGCCGCACT CGCCTCCGAC GACGCGGCGG TGGCGCGCCG ACTCGGCTCG
GTGGGTCAGA TCGTGCCCGG TATCGAGGTA CAGATCCGCG CCGACGACGG CACCGTGCTC
GGCCCCGGGG AGACCGGCGA GCTGTTCGTG CGGGGCGATC AGGTCTCGGG TCGCTACACC
GACATCGGCT CGGTGCTCGA CGCGGACGGC TGGTTCCCCA CCAAAGACGT TGCCTCCCTT
GATGAGGACG GCTACCTGTT CATCGGCGGC CGCTCCGACG ACACCATCAT CCGCGGCGGG
GAGAACATCG CACCCGCCGA GATCGAGGAC GTGCTCGTCG AGCACCCCGA CGTGCGTGAC
GTCGCCGTCG TCGGCCCCGA GGATCCGCAG TGGGGGCAGA TCATCGTCGC GGTGGTGGTG
CCGGCTCCCG GCGCGGACCC CGACGCCGAC GATCTGCGCG CGCACGTGCG CAAGCACCTA
CGCGGCTCCC GCACCCCCGA CCGGGTGGTG TTCCGCGCCG AACTACCCAC CAACGCCACC
GGCAAGGTGC TGCGCCGCGA ACTGATCGAG GAATACGCTG CCGCAGCAAG AGACCCCGAG
AAGGAGCCCG CATGA
 
Protein sequence
MSISLLLEMS VSADPERTAV VSDGLRLSVE ELSALADGGA GVIAKSGAAH VAYVGTGGVL 
LPLLLFASAR AAIPFTPLNY RLSREGLHEL IARLPDALVI ADPDYREVVA GAGKQLMDSE
EFLAAARTAE PTAEFADPDS VAVVLFTSGT TSRPKAVELT HNNLTSYVTG TVEFASADPE
DAALICVPPY HIAGVGAAMS NLYAGRKMVY LRHFDAREWI RLVSTEGVTT ATVVPTMLDR
IVSALSEEPV ALPTLRNLAY GGSKVALPLV RRALELLPGV GFVNAYGLTE TSSTIAVLGP
DDHRAALASD DAAVARRLGS VGQIVPGIEV QIRADDGTVL GPGETGELFV RGDQVSGRYT
DIGSVLDADG WFPTKDVASL DEDGYLFIGG RSDDTIIRGG ENIAPAEIED VLVEHPDVRD
VAVVGPEDPQ WGQIIVAVVV PAPGADPDAD DLRAHVRKHL RGSRTPDRVV FRAELPTNAT
GKVLRRELIE EYAAAARDPE KEPA