Gene Mkms_4571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4571 
Symbol 
ID4612515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4799848 
End bp4802142 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content73% 
IMG OID639794258 
Producthypothetical protein 
Protein accessionYP_940552 
Protein GI119870600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0747645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCAC ACACTCCGCC CACGGGCAGG AGCGAAGCGA CTCGGGGGAC CACCGGTGTC 
CCTCTGGGCG CCTGGTTGGC CCAACTACCC GACGAGCGGC TGATCCGGCT GCTCGAGTTG
CGCCCTGACC TCACCCAGCC GCCGCCGGGC ACCATCGCCG CGCTGGCCGC ACGGGCGACG
TCACGACAGT CGGTCAAGGC CGCCACCGAC GGCCTCGATT TCCTGCGGTT GGCCGTGCTC
GATGCGCTGC TGGTGCTGCA CGCCGACACC ACCGCGGTGC CGTTGACCAA GTTGTTCGAG
CTGATCGGGG GGCGCGCGGA CGAGGGTGCG ATCGTCGTCG CGGTCGACGA TCTGCGCGCG
CGGGCTTTGG TGTGGGGTGA CGACGACCAG GTGCGCGTCG CGGCCGAGGC GGCGTCCGGG
CTGCCGTGGT ATCCGGGTCA GGCGGTCGTG GAGACGGCCG AGCACGGCGC CGACGACATC
GCGGAGAAGC TGGCCGGACT CGACACCGCA CAGCGTGAGC TGCTCGAGCG GCTCCTCGAG
GGGTCCCCGG TCGGCCGCAC GCGGGATGCG GCGCCCGGAA CGCCGGCCGA CCGTCCGGTG
CAGCGCCTGC TGGCGGCGGG GCTGCTGCGC CAGGTCGACG ACGACACCGT GATCCTGCCC
CGTCTCGTCG GCCAGGTGCT GCGCGGCGAG GCGCCCGGAC CGACGGAGCT GAACCCACCC
GATCCCGTCA CGACCTCGAC GAAACCGTCC GACGTCGACG CCGTCGCGGC CGGCGCGGCG
ATCGACGTGC TGCGTGAGGT CGACGTGGTG CTCGAGGCGC TCTCGGCGGC CCCGGTGCCG
GAACTGCGCA GCGGCGGCCT CGGGGTGCGC GACCTCAAAC GCCTCGTGAA GGCCACGGGG
ATCGACGAGC GACGGCTGGG GCTGATCCTG GAGGTCGCGT TGGGGGCGGG CCTCATCGCG
GCCGGGATGC CCGAACCGGA TCCGGGCGAC GGGACCAGCA CGTTCTGGGC GCCGACGGTG
GCGGCCGATC GGTTCATCGA GTCACCGACT GCGGTGCGCT GGCACCTTCT GGCCTCGACG
TGGCTCGACC TGCCGGCCAG GCCGGGGCTC ACCGGCAGCC GGGGACCCGA CGGCAAACCG
TATGCGTCGC TGTCGGATGC GCTGTACTCG ACGGCGGCTC CGCTGGACCG CCGGCTGCTG
CTGGCGGTGC TGGCCGACCT GCCCGCAGGT TCGGGGGTCG ACGCGGCGTC GGCCTCTCGG
GCGATGATCT GGCGCAGGCC GCGCTGGGCG GTCCGGCTGC AGCCCGAACC GGTCGGCGGT
CTGCTCACCG AGGCGCACGC ACTCGGCATG GTGGGCCGCG GCGCGATCGC GACACCCACC
CGCAGGCTGC TCGCCGGTGA ACCGCCCGAG GACGTCGTGG CGGCCAAGGC CAAGGTGCTG
CCCGCCCCCA TCGACCATTT CCTGGTTCAG GCCGACCTGA CCGTCGTCGT CCCCGGCCCG
CTCGAACGCG ACCTCGCCGA GCAGCTGGCG GCCGTCGCGG CGGTGGAGTC CGCGGGCGCG
GCGATGGTGT ACCGGGTCAG TGAGGCGTCG ATCCGCCGTG CGCTCGACAC CGGCAAGACC
GCCAGCGAAT TGCACTCGTT CTTCGGGCGG CATTCGAAAA CCCCTGTGCC GCAGGGGTTG
ACATATCTGA TCGACGACGT CGCGCGCCGT CACGGCCAGC TCCGGGTCGG TATGGCGGCG
TCGTTCGTGC GGTGCGAGGA TCCGGCGCTG CTGGCCCAGG CCGTCGCCGC ACCGGCCACC
GGCGCGGTGG AACTGCGGTT GTTGGCGCCG ACGGTGGCGG TGTCGCAGGC GCCGATCGCC
GACGTGCTCG CCGCGCTGCG CAACGCCGGG CTCGCCCCGG CGGCCGAGGA TTCGTCCGGC
GCGATCGTCG ACATCCGCTC CCGCGGTGCC CGGGTGCCGG CACCGGGCCG GCGACGGGTC
TTCCGCCCCG CGCCCACCCC GACCGGCCAG ACGCTCGGTG CGATCGTCGC GGTGCTGCGC
AAGGTCGCCG CCGCGCCGTC CGGGAACATG CGGCTCGATC CGGGCGTTGC GATAACGCAG
TTGCAGGAAG CGGCGCTACA GCAGACTTCG GTGGTGATCG GCTACGTGGA CCCGGCCGGG
GTGGCCACGC AGCGGGTGGT GGCCCCCGTC AACGTCCGCG GCGGCCAGTT GACCGCCTAC
GATCCGGCAT CCGGGCGGGT GCGCGAATTC GCGATTCACC GCGTTACCTC GGTGGTGTCG
GCCGAGAACG AATAA
 
Protein sequence
MTAHTPPTGR SEATRGTTGV PLGAWLAQLP DERLIRLLEL RPDLTQPPPG TIAALAARAT 
SRQSVKAATD GLDFLRLAVL DALLVLHADT TAVPLTKLFE LIGGRADEGA IVVAVDDLRA
RALVWGDDDQ VRVAAEAASG LPWYPGQAVV ETAEHGADDI AEKLAGLDTA QRELLERLLE
GSPVGRTRDA APGTPADRPV QRLLAAGLLR QVDDDTVILP RLVGQVLRGE APGPTELNPP
DPVTTSTKPS DVDAVAAGAA IDVLREVDVV LEALSAAPVP ELRSGGLGVR DLKRLVKATG
IDERRLGLIL EVALGAGLIA AGMPEPDPGD GTSTFWAPTV AADRFIESPT AVRWHLLAST
WLDLPARPGL TGSRGPDGKP YASLSDALYS TAAPLDRRLL LAVLADLPAG SGVDAASASR
AMIWRRPRWA VRLQPEPVGG LLTEAHALGM VGRGAIATPT RRLLAGEPPE DVVAAKAKVL
PAPIDHFLVQ ADLTVVVPGP LERDLAEQLA AVAAVESAGA AMVYRVSEAS IRRALDTGKT
ASELHSFFGR HSKTPVPQGL TYLIDDVARR HGQLRVGMAA SFVRCEDPAL LAQAVAAPAT
GAVELRLLAP TVAVSQAPIA DVLAALRNAG LAPAAEDSSG AIVDIRSRGA RVPAPGRRRV
FRPAPTPTGQ TLGAIVAVLR KVAAAPSGNM RLDPGVAITQ LQEAALQQTS VVIGYVDPAG
VATQRVVAPV NVRGGQLTAY DPASGRVREF AIHRVTSVVS AENE