Gene Mkms_5785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5785 
Symbol 
ID4610452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008703 
Strand
Start bp299504 
End bp300919 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content67% 
IMG OID639789440 
ProductType IV secretory pathway VirD4 components-like protein 
Protein accessionYP_935775 
Protein GI119855170 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.435047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0557182 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCGA CACCGGCCGC CTCCGGCGGC TATCGGCCAC CGGTGACCCC GTTCGCCGGC 
TGGGTCCACG ACGACCGCGG CCGACTCGTG CCGCTGGAGA CCGCCCCGCA TATCGGCGCG
TTCGCCCCGC CGGGCACCGG CAAGACCCGC AAGTGGCTGG CGCAGTCGGC GGTGCTGTGG
CCGGGCCCAG CAGTGGTGTC GTCTTCCAAG GACGACCTCA TGCAGATGGT GGCCTCGCGG
CGCTACGGGC CTGCCGCGCT ACTGGATCTG CGGCCTATCG CTGCGCCGTA CTATCCGGCT
GACCTGCGTC CGTACCGGTT CGATCCCACC GCATGGATCA CCGGACTTGA GGATGCTAAG
GCCGTCGCCC GCACGTTGTT GAGCACCTCG GCGGTCGCGC TGTCGGGCAG CTCCTTTCGC
GGCAGCGACC CCGGCCCGTG GGATGACCTG GCGTTCGCCC CGCTGACCTG CCTACTGTTG
GCCGCGAGCC CGCAAGGGCT GGGGCTGGGC ATCGACTGGG TAATCGAGGC TGCCGAGGAC
GTGACCCTGC CCAACCCCTG GCCCACCTCG CCGCAGATGA GCACTGATGC CTCCTGGGTG
ACCGCGGCCG TCTGGTCGTC CAACCGCTAC TTCGAGGCCC GCGTGCGCAG CGTGTTGAGC
ATGGAGGCCA AGCAACGCGA CTCGGTCAAA ATCACCGTGA CCAAGGTCCT GACTGCCTGG
CTGCAGACGA TGGAACGCGA CAGAGGCCTG CCAGTCCTCG ATGAGGAGTT CCTGGAGGAC
CCCAACGCCA CCATCTATCT GCTGACACCC TTTGATGGCA CGGTCGCCGC GCAGGCGATC
ACTCTGATGG ATCAGCTGAT TAACCGTCAA CGCGTGAAAG TCGCTCAGTG GGACGAGTTT
TCCCGGTTGG GCATGTTTCT CGACGAGATC ACCAACACGC CTTTGCCGCG GCTGCCGCAA
TACATCGCCG AGTCCCGCGG CCTGGGCGTC TCGCTGTGTT TTGCCGCCCA GGCCGGCGAG
CAGCTCGACG CGATCTACGG CCCGCTGCAG GGGGCGGCGA TTCGCGCTGT CGTTCCGGCG
TCGCTGCTGA TGTACGGCTC CCATGAGAAG GACCTGATGG AGTCGGCGTC GTTCTGGAGC
GGCAAGACCA CCCGCAGCCA ACTGTCCTAC GACCACAGCC TGGATTCGAC GTCGACGAGC
CGCACCTTCG GCAATGCCCT TGAGCCTGAG GAACTTATGC CACCCAATGA CTCGTTCGCG
CGGCTGCTGA TCCGCGGCAC CCCCGGGCGC ATGGTCACAC TCATCGACTG GACCGAGTTC
GTGAAGTACC TCGACGAACT GCGGGCAGCG CGGCTCCACC ACAGCAGCGG CGGCCGCCAG
CGATTCAGCG GCCTCGACGC GGCGGCGACG GCATGA
 
Protein sequence
MSATPAASGG YRPPVTPFAG WVHDDRGRLV PLETAPHIGA FAPPGTGKTR KWLAQSAVLW 
PGPAVVSSSK DDLMQMVASR RYGPAALLDL RPIAAPYYPA DLRPYRFDPT AWITGLEDAK
AVARTLLSTS AVALSGSSFR GSDPGPWDDL AFAPLTCLLL AASPQGLGLG IDWVIEAAED
VTLPNPWPTS PQMSTDASWV TAAVWSSNRY FEARVRSVLS MEAKQRDSVK ITVTKVLTAW
LQTMERDRGL PVLDEEFLED PNATIYLLTP FDGTVAAQAI TLMDQLINRQ RVKVAQWDEF
SRLGMFLDEI TNTPLPRLPQ YIAESRGLGV SLCFAAQAGE QLDAIYGPLQ GAAIRAVVPA
SLLMYGSHEK DLMESASFWS GKTTRSQLSY DHSLDSTSTS RTFGNALEPE ELMPPNDSFA
RLLIRGTPGR MVTLIDWTEF VKYLDELRAA RLHHSSGGRQ RFSGLDAAAT A