Gene Mkms_5039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5039 
Symbol 
ID4612718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5279774 
End bp5281015 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content70% 
IMG OID639794732 
Producthypothetical protein 
Protein accessionYP_941018 
Protein GI119871066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0487726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCAAAC ATCGCAAGGC GCGGCGCACA GCTGCCGCCC CCGCCTTCTT CGCCGGCGCC 
ACCGCCGCGA TCTCCACCGC GCTCGCCCTG GGCCACGCCA CCAACGCGAC CGCCGCCACG
ATTCCCACCG CTGACACGGT GATCGGTGTC GGCGGCTGGC GGAACCCGAC GAGCGACCGG
ATCCCGAACA AGTTCGAGGG CGAGTTGGTA CAGCCCGGGG AGACGTTCGT CGGGGTCCAG
TACCCGGCGG AACTGCCCGT GGACCCCAGC GTGGCCGCCG GTCAGAAACC GTTGGGCGAC
GCGGTCGACG CCGCCTCCGG TTCCGTGCTG ATCGTCGGCT ACTCGGAGGG TTCGCTGGTC
GCCGAGCGGT ACAAGCGCGG ACTCGTCGCG TCCGGTGCGC CGAACACCGA TGACGTCGCG
TTCATGTTCA TCGCCGCCCC GTTCGTCCCC AACGGCGGCG TCTACTCCCG ATTCCCCGGT
CTGCGGCTCC CCGGCTTCAC GAGCACCGGT GCCGCGGCGC CGTCACCTTA CGACGAGACG
TTCGTGACGC TCGAGTACGA CCTGATCGGC GACTTCCCGG CGTATGCGAA CCCGCTGTCA
CTCGCCAACG CCCTCGCGGG TCTGGTCTAC GTGCACGGCG ACCAGGGCCC CGACAACGTC
GACCTGGAGA CCGCCCCGAA GGCGGTGAAG GTGGTCACCA GCGAGGCGGG CGGCACCGAC
ACCTACATCC TGGTCCGTGC CGAACATCTG CCGCTGCTGC AGCCGATCCG CGATCTGGCG
GCCGCCACCG GCACCACCGT GGTGGTCGAA CCGTTCGTCG GCGCCGTCGA ACCGACGCTG
CGCCTGCTCG TCGACATGGG CTACACCGAC CGCGACTACC AGAACGCGGA CAAGCCGACG
CGGTTCTCGC TGATCACCCC GCCCGATCGG ATCATCGAGA CCGCCGCGGC CATGAAGAAG
GTCGCGCCGA AGGCCGTTGC GCCCGACGAG AAGCCGGCCG AGCAGTCCGA ACCCGAGCCG
CAGAAGGCCG ACGAACCCGA GAGCAAGCCT TCGGTGAAGC GGAACACGTT GCTGCGCAAT
GTCTTCGGCG GACCCAAGGG CAAGCACCGG GCCGACACCA CACCGGCAGC CGAACCGGAG
CCGTCACGCC AGGACTCCGC GACGAACGAC CCCGCCGACA ACGGCACCAC CGGTACCGAC
AGCGACACCA GCACCGACGA GAAGGACCAG CCCGCCGCCT GA
 
Protein sequence
MAKHRKARRT AAAPAFFAGA TAAISTALAL GHATNATAAT IPTADTVIGV GGWRNPTSDR 
IPNKFEGELV QPGETFVGVQ YPAELPVDPS VAAGQKPLGD AVDAASGSVL IVGYSEGSLV
AERYKRGLVA SGAPNTDDVA FMFIAAPFVP NGGVYSRFPG LRLPGFTSTG AAAPSPYDET
FVTLEYDLIG DFPAYANPLS LANALAGLVY VHGDQGPDNV DLETAPKAVK VVTSEAGGTD
TYILVRAEHL PLLQPIRDLA AATGTTVVVE PFVGAVEPTL RLLVDMGYTD RDYQNADKPT
RFSLITPPDR IIETAAAMKK VAPKAVAPDE KPAEQSEPEP QKADEPESKP SVKRNTLLRN
VFGGPKGKHR ADTTPAAEPE PSRQDSATND PADNGTTGTD SDTSTDEKDQ PAA