Gene Mkms_0585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0585 
Symbol 
ID4614968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp641142 
End bp642272 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content66% 
IMG OID639790260 
Productcarboxylate-amine ligase 
Protein accessionYP_936591 
Protein GI119866639 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.670394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00946283 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTTATCCG TTCCGGCCAG TAGTCGTATC GACTTCGCCG GGTCACCCCG GCCCACCGTC 
GGCGTCGAGT GGGAGTTCGC GCTCGTCGAC GCCCACACCC GCGATCTGAG CAACGAGGCG
GCCACCGTCA TCGCCGAGAT CGGCGAGACC CCGCACGTGC ACAAGGAATT GCTGCGCAAC
ACCGTCGAGG TCGTCACCGG GATCTGCGAG AACACCGGTG AGGCGATGGC GGATTTGCAC
GACACCCTGA AGGTCGTGCG CCGGATCGTG CGGGACCGCG GCATGGAACT GTTCTGTGCC
GGGACGCACC CGTTCGCCAA CTGGTCGACC CAGCAGCTGA CCGACGCGCC GCGTTATGCC
GAGCTGATCA AGCGCACGCA GTGGTGGGGC AGGCAGATGC TGATCTGGGG TGTGCACGTG
CACGTCGGGA TCTCGTCGGC GCACAAGGTC ATGCCCATCA TCTCGTCGCT GCTCAACCAG
TACCCGCACC TGTTGGCGTT GTCCGCGTCC TCGCCGTACT GGGACGGTTC GGACACCGGA
TACGCCAGCA ACCGGGCGAT GATGTTCCAG CAGTTGCCGA CCGCGGGGCT GCCGTTCCAG
TTCCAGTCGT GGCCGGAGTT CGAACGGTTC GTCCACGATC AGAAGAAGAC CGGGATCATC
GACCACATGA ACGAGATCCG GTGGGACATC CGGCCCTCGC CTCATCTGGG CACCGTGGAG
ATCCGGGTCT TCGACGGGGT TTCCAACATC GCCGAACTCG GTTCACTGGT CGCGCTCACC
CACTGCCTGG TCGTCGACCT CGACCGCCGG CTCGACGCGG GGGAGCAGTT GCCGGTCATG
CCGCCCTGGC ATGTGCAGGA GAACAAGTGG CGCGCCGCGC GCTATGGACT CGACGCCGAG
ATCATCCTCG ACGCCGACAG CAACGAGCGG CTGGTCACCG AGGACCTCGA CGACCTGCTC
ACCCGGTTGC AGCCGGTGGC CCGATCCCTG GACTGTGCCG ACGAACTCGC CGGGGTCGCC
GAGATCTACC GGCACGGTGC CAGCTACCAG CGGCAACGCC GGGTCGCCGA GGAACACGAC
GGTGATCTGC TTGCGGTCGT TGACGCCCTG GTCGCCGAAC TGGAGCTATA G
 
Protein sequence
MLSVPASSRI DFAGSPRPTV GVEWEFALVD AHTRDLSNEA ATVIAEIGET PHVHKELLRN 
TVEVVTGICE NTGEAMADLH DTLKVVRRIV RDRGMELFCA GTHPFANWST QQLTDAPRYA
ELIKRTQWWG RQMLIWGVHV HVGISSAHKV MPIISSLLNQ YPHLLALSAS SPYWDGSDTG
YASNRAMMFQ QLPTAGLPFQ FQSWPEFERF VHDQKKTGII DHMNEIRWDI RPSPHLGTVE
IRVFDGVSNI AELGSLVALT HCLVVDLDRR LDAGEQLPVM PPWHVQENKW RAARYGLDAE
IILDADSNER LVTEDLDDLL TRLQPVARSL DCADELAGVA EIYRHGASYQ RQRRVAEEHD
GDLLAVVDAL VAELEL