Gene Mkms_4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4004 
Symbol 
ID4611944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4220480 
End bp4222315 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content72% 
IMG OID639793688 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_939986 
Protein GI119870034 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGC TCGAGGTCTC CGGGTTGACG GTGACGTTCG CGACCGACAC CGAGCGGGTG 
GCCGCGGTGC GCGGCCTGGA CTACCGCCTC GACGCCGGTG AGGTGGTGGC GCTGGTCGGC
GAGTCCGGGG CGGGCAAGTC GGCGGGCGCG ATGGCCGTCG CGGGGCTGCT GCCCGAACAT
GCCGAGGTGG CCGGGTCGGT GCGGCTCGAC GGCACCGAAC TGCTCGGGCT CTCCGACGCC
GAGATGTCGA AGATCCGGGG ACGCCGGATC GGCACGGTGT TCCAGGACCC GATGTCGGCG
CTCACGCCGG TGTACACCGT CGGCGACCAG ATCGCGGAGG CCCTGCGCGT GCACAACCGC
GACCTCGACC GCCGCGCCGC GCGCACCCGC GCGGTCGAAC TGCTCGAACT GGTCGGCATC
GCCCAGCCGG AGCGCCGCGC CCGGGCGTTC CCGCACGAAC TGTCCGGCGG CGAACGGCAG
CGGGTGGTCA TCGCCATCGC CATCGCCAAC GACCCCGACC TGCTGATCTG CGACGAGCCG
ACCACCGCGC TGGACGTGAC CGTGCAGGCG CAGATCCTCG AGGTGCTGCG CACCGCGCGC
GACGTCACCG GGGCAGGCGT GCTGATCATC ACCCACGACC TCGGCGTTGT CGCCGAGTTC
GCCGACCGGG CGCTGGTGAT GTACGCCGGC CGGGCCGTCG AGATCGCCTC GGTGTCGGAG
CTGTACACCG AGCGGCGGAT GCCCTACACG GTCGGGCTGC TCGGGTCGGT GCCACGGCTC
GACGCGCGCC AGGGCGAGCG CCTGGTGCCG ATTCCCGGTG CACCGCCGTC GCTGGCGGCC
CTGCCACCCG GCTGCCCGTT CGCACCGCGC TGCCCGCTGG CGATCGACGA GTGCCGCGCC
GCCGAACCGG AGCTCATCGA GGTGGCCCCG GGTCACCTCG CGGCGTGTAT CCGCACCGAA
CACGTCGCGG GCCGCAGCGC CGCCGAGGTG TACGGCGTGT CGACCGCAGC GGCCGCGTCG
CCGCCGTCGG CCGACGATCC CGTGGTGCTC AAGGTGAGCG ATCTGGTGAA GACCTACACG
CTGACCAAGG GCACCGTGTT CCGGCGCCGG ATCGGCGAGG TGCGCGCCGT CGACGGCATC
AGCTTCGAAC TGCAGCAGGG CCGCACGCTG GGGATCGTCG GCGAATCCGG ATCCGGCAAG
TCGACGACGC TGCACCAGAT CCTGGAACTC GAACCGCCGC AGGGCGGTTC GATAGAGGTG
CTCGGCGAGG ACGTGGCCGC ACTCGACACG CGGGCGCGGC GTGCGCTGCG CGGTGACCTG
CAGGTGGTGT TCCAGGATCC GGTGGCCTCG CTCGACCCCC GCCTACCGGT GTTCGAGGTG
TTGGCAGAAC CGTTGCAGGC CAACAACTTC GACAAGTCCC GCATCGACGA GCGGGTGGCC
GAACTGCTCG GCATCGTCGG GCTGCGCCGT GAGGATGCCA GCCGCTACCC CGCCGAGTTC
TCGGGCGGGC AGAAGCAGCG CATCGGCATC GCGCGGGCAC TGGCCACCCA GCCGAAGATC
CTCGCCCTCG ACGAACCCGT CTCCGCGCTG GACGTGTCCA TCCAGGCGGG CATCATCAAC
CTGCTGCTCG ACCTGCAGGA GCGGTTCGGG CTGTCGTATC TGTTCGTCTC GCACGACCTG
TCGGTGGTGC GGCACCTCGC CCACCGGGTG GCCGTCATGC ACAAGGGCGC CATCGTCGAA
CAGGGCGACG GGGACCGGGT CTTCACCGCG CCGCAACACG ACTACACGCG CCGCCTGTTG
GCGGCGGTAC CCCAGCCGAG CGTCCCGCAA CGTTAG
 
Protein sequence
MSLLEVSGLT VTFATDTERV AAVRGLDYRL DAGEVVALVG ESGAGKSAGA MAVAGLLPEH 
AEVAGSVRLD GTELLGLSDA EMSKIRGRRI GTVFQDPMSA LTPVYTVGDQ IAEALRVHNR
DLDRRAARTR AVELLELVGI AQPERRARAF PHELSGGERQ RVVIAIAIAN DPDLLICDEP
TTALDVTVQA QILEVLRTAR DVTGAGVLII THDLGVVAEF ADRALVMYAG RAVEIASVSE
LYTERRMPYT VGLLGSVPRL DARQGERLVP IPGAPPSLAA LPPGCPFAPR CPLAIDECRA
AEPELIEVAP GHLAACIRTE HVAGRSAAEV YGVSTAAAAS PPSADDPVVL KVSDLVKTYT
LTKGTVFRRR IGEVRAVDGI SFELQQGRTL GIVGESGSGK STTLHQILEL EPPQGGSIEV
LGEDVAALDT RARRALRGDL QVVFQDPVAS LDPRLPVFEV LAEPLQANNF DKSRIDERVA
ELLGIVGLRR EDASRYPAEF SGGQKQRIGI ARALATQPKI LALDEPVSAL DVSIQAGIIN
LLLDLQERFG LSYLFVSHDL SVVRHLAHRV AVMHKGAIVE QGDGDRVFTA PQHDYTRRLL
AAVPQPSVPQ R