Gene Mkms_5572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5572 
Symbol 
ID4610369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008703 
Strand
Start bp79506 
End bp81038 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content68% 
IMG OID639789236 
Productintegrase catalytic subunit 
Protein accessionYP_935571 
Protein GI119854966 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.588721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCTCG CCAGGGCGCC TGCCCGACAG ACGGTTTCGG GTGGGGCCAT CGAGTGTGAG 
GAGGGCACTG CGGTGTCGTT GGAGGACCAC AAGCGTCGGG AACGGGCGAA TGCGATCGGG
TTGTTCCGCT ATCAGGTGAT CTGCCCCGCG CTGGAAGAGG GGCTCTCGAC CCGGCAGCGG
GGCCGGGTAG TCCGTGAGAT CGCCGGCCGT CGCCATATCG ACCCGTTCGG CACCCAGGTC
CAGATCGCGC GGGCCACCCT GGACCGCTGG ATCCGGCGCT ACCGCGGCGG CGGGTTCGAA
GCGCTGGTCC CTGAACCGCG CCGGCTGGCC ACCCGCACCG ATGTCCAGGT GCTGGAGTTG
GCCGCCTCGC TCAAACGGGA GAACCCGGCC CGCACCGCCG CGCAGGTGGC CCGGATTCTG
CGCACCGCGA CTGGGTGGGC GCCCTCGGAA TCGACGCTGC TGCGCCACTT CCATCGGTGT
GAGCTGATGG GCCCGGCCGC AGGTCAGAGC GCTGAGGTGT TCGGGCGGTT CGAAGCCCCC
GACCCGAATG AACTGTGGGT CGGCGATGCC CTGCACGGCC CGCGGGTCGG GGACCGCAAA
ACCTACCTCT TCGCGTTCCT CGACGACCAT TCTCGGTTGG TGGTCGGGCA CCGCTTCGGG
TTCGCCGAAG ACACCGTGCG CCTGGCGGCG GCGTTGAAGC CGGCGTTGGC TGCTCGCGGA
GTTCCCGCCT CGATCTATGT CGACAACGGG TCCGCGTTCG TCGATGCGTG GTTGCTGCGG
GCGTGCGCGA AACTCGGGAT CCGGCTGGTG CATTCCGCGC CCGGTCGCCC GCAGGGCCGC
GGCAAGATCG AACGGTTCTT CCGCACCGTG CGTGAACAGT TCCTCGTCGA GGTGACCGAC
ACCACCGCTG AGGACCTCGC TGCGGCCGGG GTCGACCATG CTGGTGCGTT GTTGGAGCTC
AACCGGCTGT TCATGGCCTG GGTTGAAACC GAATACCACC GCCGAATCCA TACCGAGACC
GGGCAATCCC CACTGGCCCG CTGGGAAGCC GGCTTCGACC GGCTCGGCCA CCCTGCGGCG
TTGCCGACCG CCGCGGATCT GACCGAAGCG TTCCTGTGGT CGGAGTTCCG GATAGTGACC
AAAACCGCCA CCGTCTCGCT GCATTCCAAC ACCTACCAGG TCGACCCCGC CCTGGTCGGG
CGCCGGGTGG AACTGGTGTT CTCCCCGTTC GACCTGCAGG CCATCGAGGT CCGCGACCAG
GACAGAAGCT ACGGCCAAGC CATAGCCCAC ACGATCACCC GCCACGCCCA CCCCAAAGCC
CGACCCGAAA TCAGCAGCCA GACACCGCCG GCAGCCACCG GTATCGACTA CCTGGCATTG
ACCGCCGCGG CCCACCATGA GCAGCTGCGC GATGACGAAC GCATCGGCTA CCACGCCCTC
TACGGCGCCC AGGCCACGGC CACCGATGAC AATCAGCTTC CCGGTCAACT CGCGCTGCCC
AGCCTCGATC ACAAGGATGG GGTGTCGGCA TGA
 
Protein sequence
MILARAPARQ TVSGGAIECE EGTAVSLEDH KRRERANAIG LFRYQVICPA LEEGLSTRQR 
GRVVREIAGR RHIDPFGTQV QIARATLDRW IRRYRGGGFE ALVPEPRRLA TRTDVQVLEL
AASLKRENPA RTAAQVARIL RTATGWAPSE STLLRHFHRC ELMGPAAGQS AEVFGRFEAP
DPNELWVGDA LHGPRVGDRK TYLFAFLDDH SRLVVGHRFG FAEDTVRLAA ALKPALAARG
VPASIYVDNG SAFVDAWLLR ACAKLGIRLV HSAPGRPQGR GKIERFFRTV REQFLVEVTD
TTAEDLAAAG VDHAGALLEL NRLFMAWVET EYHRRIHTET GQSPLARWEA GFDRLGHPAA
LPTAADLTEA FLWSEFRIVT KTATVSLHSN TYQVDPALVG RRVELVFSPF DLQAIEVRDQ
DRSYGQAIAH TITRHAHPKA RPEISSQTPP AATGIDYLAL TAAAHHEQLR DDERIGYHAL
YGAQATATDD NQLPGQLALP SLDHKDGVSA