Gene Mkms_5565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5565 
Symbol 
ID4610347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008703 
Strand
Start bp73062 
End bp74261 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content65% 
IMG OID639789229 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_935564 
Protein GI119854959 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.48027 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCC CCCGCCTCGT CGAGTTGATC GACAGGTATC GGACCGCGCA CGGCGTGAGC 
GAATCCGAAG TGGCCCGCCG GATTGGCATG TCCCGCGAGA ACCTGCGCAA GTGGCGGATT
AACGGTGTCA GCCGCCTCCC GGACCGCGAG AATTTGGCGG CTGTCGCCCG TGTGATCGGC
AAGCCCTATC GCGAGGTCGT TTCTGCGGCC TTGTTCGACA CCGGATACCT CACTGACGAC
CAGGCGTCCA CACCGAGGCC TCACGATGAG GTCCTCCACG ACGCGATCAG CGTGCTTACC
GAGGCAACCC GGCTGACCAA TCAACCTATG CGGCAAGCGA CGTCAGGCCA ATGGGAGGTT
GATCCCGATC CCCGCGCCGC GCTGCGCATC GACTGGGCCG CCTTCGTAAC GCTGGCCCTG
GCGGGCACGG CGGCCAACGT TGGGAGCATC GAGGAAGCGC TGGCAGGCCG GCCAGGCTCG
TGGGAGGCCG AAATGGTGCG GCGCACATTA CAAGCCACGG TCTTTGACGA CAAAGACTTA
CTGCGCCATC GCACCGAGCC GGTCGTGGTC GATCTCTGGG TGGAGAGCAT TCTTGCCCAG
ATCGACGACG GCAGCGACGA CGCCTATGCC GACGCACAGC TCGAACTCGA CGCCCGCGCC
GATGCCATTC CTGAGCCCAC CGACCTGCCG CCAGGCCCCT TCTCCCCCAA CGACCCCCGC
ATCGCAGCCG TGGACTGGGT GAATGTGGAC GACAACGGCT ACCTGGTCAT CACCCACGAT
GGCTGGGCCG GCGACGCTGA CGACGTGGCC CTGCTGACCG AGCTCTCAGC CGAGGCCGAA
GCCTCCCGAA ATCCGACCCT CGGCGAGATC GCCTATGAGC AGGCCATGCA GTCGATCGCT
GCCCTGGTCG ACGCGCTCAA TGAGCAGCAG AAACGCGAAT ACACCGAGTA CGCGACCCGG
CTCACCGAGG CCATCCACGC GCAACTCGCG GCCCTAGAGC TGACGGTACC GCTGTCGATC
ACCATCACCC TTGCCCCCGA GACCTGGGCT CACGAGGACT TCGACAAGCA CGCACCGTCG
ACGTACTCAA GCAGCGCCAT CGAGGGCGCC ATCGAAAGCG CCGTCATGGA AACCCCCACC
CCTGCAGCCC TCCCCGGCAG TCCCCTCGAA CGGCTCGAAG CAAGTCGAAA AGGTCAGTAG
 
Protein sequence
MAIPRLVELI DRYRTAHGVS ESEVARRIGM SRENLRKWRI NGVSRLPDRE NLAAVARVIG 
KPYREVVSAA LFDTGYLTDD QASTPRPHDE VLHDAISVLT EATRLTNQPM RQATSGQWEV
DPDPRAALRI DWAAFVTLAL AGTAANVGSI EEALAGRPGS WEAEMVRRTL QATVFDDKDL
LRHRTEPVVV DLWVESILAQ IDDGSDDAYA DAQLELDARA DAIPEPTDLP PGPFSPNDPR
IAAVDWVNVD DNGYLVITHD GWAGDADDVA LLTELSAEAE ASRNPTLGEI AYEQAMQSIA
ALVDALNEQQ KREYTEYATR LTEAIHAQLA ALELTVPLSI TITLAPETWA HEDFDKHAPS
TYSSSAIEGA IESAVMETPT PAALPGSPLE RLEASRKGQ