Gene Mkms_5591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5591 
Symbol 
ID4610342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008703 
Strand
Start bp100414 
End bp102561 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content65% 
IMG OID639789254 
Productintegrase catalytic subunit 
Protein accessionYP_935589 
Protein GI119854984 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATG CCGCGGTGCC CGTGGGTGTG GGGTCGCGCA TCATCTACGA AGGCGAAGCG 
TTAGAGATCG TCGAGATGCA CGTGACGGCT TCGTCCCTGG AAGTGTTGTC GAGGGGTCTG
CGCGGCAACG TGATTCGTCG ATTATCCATG AGTGAATTGC TGATGTCTGA TCGCGCTCGG
GTGCTCAGCT CTGATGAAGG ACCCGCTTCC GATGATCCCG GCGAGGTCGC GGCGGTGACA
TTGTCTGCGG TCTCCGCGGC CGCCCGGAAA CAAGCGCTGG AACGGGCGGC CCACGTGCGA
GAAGTGTTGA CAGGTTACCG CTCCGGATGC CAGGAGACCG CGCTGCCGTA TGAACCGCGC
ACTGCGTACC GGTCCGAGCT GCCGGTGATG GAACGTTATG CCGCCAAGGC GGCCGAGTTG
GGTGTGCAGT CCCGCACGGT CGAGCGGTGG GTTGCGCAGT ACCAGGCGCA CGGGGAAGCG
GGATTGATCT CGCAGCGGGC GGTGCAGCCT GGCATGGGTG GGCGGCAGGA TCCGCGGTGG
CAGCAGACGG CCCTGGAGGT GATGAGCGAG TACACGGATC TGTCGAAGCC CAACGAGGAC
TTGGTGATTC GACGTACCCG TGCGCGTCTG GACGCGCGGT TCGGCGCCGG TGTGGTGAAG
GTGCCGTCGC AGCGCACGGC GTATCGGATC CTGGCCCGGC TCGAGGACCA GCGGCCGCTG
TTTAAGCAGA GCACGAAACG CAACCGCGAC ATCGCTTCCC GATCGAACGA GGTGTACGGC
AAGCTGCATC CGATGCGGCC GGGGGAGTAC CTGCTGATGG ACACCACTCG CCTGGATGTG
TTCGCGATGG ATCCCTACAC GCTGACGTGG GTGAACGCCG AGCTGACGGT CGCGATGGAC
TGGTACAGCC GCTGCGTCAC CGGGTTGCGG CTGACCCCGG TGTCGACGAA ATCCATTGAT
GCGGCCGCGG TGTTGTACGA GACGTTTCGG CCGCAACCGG CGGGGCGGGA CTGGCCGGCC
GAAGCGATGT GGCCCCCACG CGGGATCCCG CGCTCGGTGC TCGTCGAACA GGACGCTCTC
GACCCGGGCA GTGTGCCGGC GGCGACCCCG GCGGTCGTTC CCGAGACGTT GGTCGTCGAT
CACGGCAAAA TCTATGTCAG CGCGCACCTC AACAGCGTGT GTCAGCGGTT GGGGATCTCA
ATCCAGCCGG CCCGGTTGCG CCAACCCCGC GATAAGGGGC CGGTGGAGCG GTTCTTCTCC
ACGCTGCGCG TGGGTCTGCT GCAGGAGCTG CCCGGCTATA AGGGGCAGGA CATTTTCGCG
CGCGGGGTGT CACCGGAACA GGATGCCTGG TTCTACCTCG ATGAGCTGGA AGCCATCATC
CGGGAGTGGA TCGCCGTCAT CTATCACCAC AGCGCCCACG ACAGTCTGTC CGGGCTGGGG
GTGCCGAAGC TGACGATGAC CCCGGCGGAG ATGTTCGCCC ACGGGGTGGC GCGCGCCGGC
TATCTGGAGG TGCCGCGCGA TCCGGATCTC GCCTACGAGT TTCTGCCGGT GGTGTGGCGC
GTCATCCAGC ACTATGGCAT CGACGTGGGA GGGCGCCGCT ACAAAGGAGA CATCGTCGCC
GGACGCGCGA AGGAGAAAAG CCCGTATCCG AACCAGAAGT GGCCGATCGC CTACAACGTC
GACGACATCA CCAAAGTGTA TTTCCGCGAT GACCGCACCC GCCACTGGCA TCCGTTGGTA
TGGGAGCACG CCGACATGCT TGACGCCCCG ATGAGCGAGG AAATCTTGCG TTTCGAGCGC
ACCTTGGCCA AAGCCCAGAA CCGGTATGTC GATGACCCGC TGGCCATCAC TAGCTTCCTG
GAACGCCGCA AACTCAGCGT CGCGAACTCG ATGGCCGAGC GTCGCCGGGC ACTGCGGGTG
GCCCGCGAGC AGTCCAGCCT CATCGGTGAT CTCAACCCGC CACCGGACAG CGCCGAATTG
AGCTCGGTCG CTACAGCACT GGATCGCGAA CCTCGCGAGC GGGCCCACGA CGCCGGCGCT
GATGCGTTCG TTGATGACCT CGACGAGGAA CCCGGCGACC TCGATGACGC CGACGAGCCG
CCGCTGGAGC AGGGCAGCTT CTACGACGGT GTGCTGGAGG TGGAATGA
 
Protein sequence
MTNAAVPVGV GSRIIYEGEA LEIVEMHVTA SSLEVLSRGL RGNVIRRLSM SELLMSDRAR 
VLSSDEGPAS DDPGEVAAVT LSAVSAAARK QALERAAHVR EVLTGYRSGC QETALPYEPR
TAYRSELPVM ERYAAKAAEL GVQSRTVERW VAQYQAHGEA GLISQRAVQP GMGGRQDPRW
QQTALEVMSE YTDLSKPNED LVIRRTRARL DARFGAGVVK VPSQRTAYRI LARLEDQRPL
FKQSTKRNRD IASRSNEVYG KLHPMRPGEY LLMDTTRLDV FAMDPYTLTW VNAELTVAMD
WYSRCVTGLR LTPVSTKSID AAAVLYETFR PQPAGRDWPA EAMWPPRGIP RSVLVEQDAL
DPGSVPAATP AVVPETLVVD HGKIYVSAHL NSVCQRLGIS IQPARLRQPR DKGPVERFFS
TLRVGLLQEL PGYKGQDIFA RGVSPEQDAW FYLDELEAII REWIAVIYHH SAHDSLSGLG
VPKLTMTPAE MFAHGVARAG YLEVPRDPDL AYEFLPVVWR VIQHYGIDVG GRRYKGDIVA
GRAKEKSPYP NQKWPIAYNV DDITKVYFRD DRTRHWHPLV WEHADMLDAP MSEEILRFER
TLAKAQNRYV DDPLAITSFL ERRKLSVANS MAERRRALRV AREQSSLIGD LNPPPDSAEL
SSVATALDRE PRERAHDAGA DAFVDDLDEE PGDLDDADEP PLEQGSFYDG VLEVE