Gene Mkms_5054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5054 
Symbol 
ID4612734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5293547 
End bp5294656 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content70% 
IMG OID639794748 
ProductABC transporter related 
Protein accessionYP_941033 
Protein GI119871081 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1125] ABC-type proline/glycine betaine transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.763811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACCT TCGAGCACAT CACCAAGCGC TACCCGGACG GCACCGTGGC CGTCGACGAT 
CTCAGTCTCG AGGTGCCCGA GGGCACGCTG ACGGTGTTCG TCGGTCCCTC GGGCTGCGGT
AAGACCACGT CGATGCGGAT GATCAACCGG ATGATCGAAC CCACCTCGGG CACGCTGACC
GTCGACGGGA ACGACGTCAC CAAGGTCGAC GCCGTCAAGC TCCGGCTGGG TATCGGCTAC
GTCATCCAGA GCGCCGGGCT GATGCCGCAC CTGCGGGTGG TCGACAACGT CGCGACCGTG
CCGGTGTTGC GCGGCGAATC CCGCCGCAGC GCCCGCAAGG CGGCACTGGC GGTGATGGAA
CGCGTCGGCC TCGACCCGAA GCTGGGCGAC CGCTACCCCG CGCAGTTGTC CGGTGGTCAG
CAACAGCGCG TCGGGGTGGC CCGTGCGCTG GCCGCCGATC CGCCGATCCT GTTGATGGAC
GAACCGTTCA GCGCCGTCGA TCCGGTGGTC CGCGAGGAAC TGCAGACCGA GATCGTGCGG
CTGCAGAACG AACTGCGCAA GACGATCGTG TTCGTCACCC ACGACATCGA CGAGGCGATC
AAACTCGGCG ACAAGGTCGC GGTGTTCGGC CGCGGCGGGG TGCTGCTGCA GTACGACGCG
CCTGCGCGGC TGTTGTCCAA CCCGGCCGAC GATGCGGTGG CCGGTTTCGT CGGGGCCGAC
CGCGGTTACC GCGGCCTGCA GTTCTACCCG GCCACCGGCA CCGCAGGCCT TCCGCTGCAC
GACCTCCGTC ATGTGCGCGA ATCCGAGATC GACGCGCTGC AACTGGCGCC GGGGGACTGG
GTGCTGGTGA CCCGGCCCGA CGGAACGCCC TACGCGTGGA TCAACGCCGA CGGTGTCGCG
CTGCACCGCA ACGGGAGTTC GCTCTACGAC AGCACGATCG CGGGCGGTTC GCTGTTCCGG
CCGGACGGCA CGCTGCGGCT GGCCCTCGAC GCGGCGCTGT CCTCGCCGTC GGGTCTCGGC
GTCGCCGTCG ACGAGCACGG TCAGGTGATC GGCGGGGTGC GCGCCGACGA CGTGCTCGCC
GCGCTCGACC GGCAGCGGCA GGAGGCCTGA
 
Protein sequence
MITFEHITKR YPDGTVAVDD LSLEVPEGTL TVFVGPSGCG KTTSMRMINR MIEPTSGTLT 
VDGNDVTKVD AVKLRLGIGY VIQSAGLMPH LRVVDNVATV PVLRGESRRS ARKAALAVME
RVGLDPKLGD RYPAQLSGGQ QQRVGVARAL AADPPILLMD EPFSAVDPVV REELQTEIVR
LQNELRKTIV FVTHDIDEAI KLGDKVAVFG RGGVLLQYDA PARLLSNPAD DAVAGFVGAD
RGYRGLQFYP ATGTAGLPLH DLRHVRESEI DALQLAPGDW VLVTRPDGTP YAWINADGVA
LHRNGSSLYD STIAGGSLFR PDGTLRLALD AALSSPSGLG VAVDEHGQVI GGVRADDVLA
ALDRQRQEA