Gene Mkms_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2454 
Symbol 
ID4616014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2574171 
End bp2577011 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content68% 
IMG OID639792122 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_938441 
Protein GI119868489 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.302733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.778298 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCGGG TCCGCCGGAC GCCGATACTG GAACGCATGG CTGACCTCCC AGAAGCGCTC 
GAACCGATCG GCTCGGTGAC CCGGACCGAG GTGGGCCGCG AAGCCAGTGA GCCGATGCGC
GAAGACATCC GGTTGCTGGG CGCCATCCTG GGTGACACGG TCCGCGAGCA GAACGGTGAG
GACGTGTTCG ACCTCGTCGA ACGCGCCCGC GTGGAGTCCT TCCGGGTCCG CCGCTCCGAG
ATCGACCGGG CCGAACTCGC CGGACTGTTC GACGGTATCG GGGTGCGCGC CGCGATACCC
GTCATCCGGG CGTTCACCCA TTTCGCACTG CTGGCCAACG TGGCCGAGGA CATCCACCGC
GAGCGCCGCC GTGCCGTGCA CGTGGCCGCC GGTGAACCGC CGCAGGACAG CAGCCTGGCA
GCGACGTACG CCAAACTCGA TTCCGCACAA CTGGATTCCG ACGAGGTGTC GGCCGCGCTG
AGCGGTGCGC TGGTGTCGCC GGTGATCACC GCCCACCCCA CCGAGACCCG CAGGCGCACG
GTGTTCGACA CCCAGCACCG CATCACCGAG CTCATGCGGC TGCGGGCACA CGGTCACGAG
ACCACCGACG ACGGGCGCGA CATCGAGCTG GAATTGCGCA GGCACATCCT CACGCTGTGG
CAGACGGCGT TGATCAGGCT GTCGCGCTTG AAGATCCAGG ACGAGATCGA GACCGGGCTG
CGCTACTACC GCGCCGCGTT CTTCGAGGTC ATCCCACGCG TCAACGCCGA GGTCCGCTCC
GCACTGCAGA CCCGGTGGCC TGACGCGGAC CTGCTGGCAG AACCGATCCT GAGGCCCGGC
TCCTGGATCG GCGGCGACCG TGACGGCAAT CCCAACGTCA CCGCGGAGGT GGTGCGGCAG
GCCACTTCCC GCGCCGCCTA CACGGCCTTC GAGCACTATT TCGCCGAGCT GACCGGGCTG
GAGCAGGAAC TGTCGATGTC CGCGCGGTTG GTGCACGTGA GCGACGAACT CGCCGCACTG
GCCGATGCCT GTCACGAGGC CGCCCGGGCC GACGAACCGT ACCGGCGCGC GCTGCGGGTG
ATCCACGGCA GGCTCACCGC CACCGCCCTG GAGATTCTCG ACAACCAGCC CGAACACGAA
CTCGACCTCG GCCTGGAGCG CTACGCCACG CCGGCCGAGC TGCTCGCCGA CCTCGACGTC
ATCGATGCGT CGCTGCGGGG CCACGGCAGC GCCGTACTCG CCGACGACCG GTTGCTACGC
CTGCGAGAAG CGGTGCGAGT CTTCGGTTTC CACCTGTCCG GCCTGGACAT GCGGCAGAAC
TCCGACGTCC ACGAGGAGGT CGTGGCGGAA CTGCTGGCGT GGGCGGGTGT GCACGACGAC
TACGCCTCGC TGCCAGAGTC GGACCGTATC GACCTGCTGG TCTCTGAGCT GGCCACCCGT
CGCCCGCTCA CCTCCCAGAA GGCCGAACTG TCGGAACTGG CACGAAAAGA ACTCGACATC
GTCCGGGCCG GGGCGCGTGC GGTGCGGGTG TTCGGCCCGC AGGCGGTGCC CAACTACATC
ATCTCGATGT GTGAGTCGGT GTCGGACATG CTCGAAGCGG CCATCCTGCT CAAAGAGGCC
GGCCTGCTCG ACGTCTCGGA TGACGAACCG TACGCCCCGG TCGGCATCGT CCCGCTGTTC
GAGACGATCG ACGACCTACA GCGCGGATCG TCGATCCTCG AAGCGGCCCT TGACCTTCCG
CTGTACCGCG GCATGGTCAC CGCACGCGGT GACAGCCAGG AGGTCATGCT CGGCTACTCC
GACTCCAACA AGGACGGCGG CTATCTCGCC GCGAACTGGG CGCTCTACCG GGCCGAACTC
GACCTGGTCG AATCGTCCCG CAAGACCGGT ATCCGGTTGC GGCTCTTCCA CGGACGCGGC
GGCACCGTCG GCCGCGGTGG TGGACCGAGC TACGACGCGA TCCTCGCGCA ACCGCCAGGC
GCGGTGAACG GTTCGCTGCG CATCACCGAA CAGGGTGAGG TGATCGCCGC CAAGTACGCC
GAACCGCGGA TCGCGCACCG CAACCTCGAG ACGCTGGTGG CCGCGACGCT GGAATCGACG
CTGCTCGACG TCGAGGGGTT GGGGGACGAG GCCGGGCCGG CTTACGAGGT ACTCGACGAT
CTCGCGGCCA GGGCGCAGCG GGCATACGCC GAACTGGTCC ACGAGACACC GGGATTCGTC
GACTACTTCA AGGCGTCCAC CCCGGTGAGC GAGATCGGGG CGCTCAACAT CGGCAGCAGG
CCGGCCTCAC GCAAGCCGAC GACGTCCATC TCCGATCTGC GGGCCATCCC GTGGGTGCTG
TCGTGGAGCC AGTCCCGGGT CATGCTGCCC GGCTGGTACG GCACCGGCAG CGCCTTCGAG
CAGTACATCG CCGAAGGGGC GGCCGAATCC GAGGACCGGC TGGGGGTGCT GCAGGACCTC
TACCGGCGGT GGCCGTTCTT CCGCACCGTG CTGTCCAACA TGGCGCAGGT GCTGGCGAAG
TCGGATCTCG GTCTGGCCGC GCGGTATTCG GAGCTCGTCG AGGACGAGGA CCTGCGGCGC
CGTGTATTCG ACAAGATCGC CGACGAACAC GAGCGCACCA TCCGGATGCA CAGGCTGATC
ACCGGTCAGG ACGATCTGCT GGCCGACAAC CCGGCGCTGG CCCGCTCGGT GTTCAACCGG
TTCCCGTATC TCGAACCGCT CAACCATCTA CAGGTCGAAC TCCTGCGCCG GTACCGGTCG
GGGGAGGACG ACGAACTGGT GCAGCGCGGG ATCCTGCTCA CGATGAGCGG GCTCGCGACG
GCGCTGCGCA ACAGCGGTTA G
 
Protein sequence
MRRVRRTPIL ERMADLPEAL EPIGSVTRTE VGREASEPMR EDIRLLGAIL GDTVREQNGE 
DVFDLVERAR VESFRVRRSE IDRAELAGLF DGIGVRAAIP VIRAFTHFAL LANVAEDIHR
ERRRAVHVAA GEPPQDSSLA ATYAKLDSAQ LDSDEVSAAL SGALVSPVIT AHPTETRRRT
VFDTQHRITE LMRLRAHGHE TTDDGRDIEL ELRRHILTLW QTALIRLSRL KIQDEIETGL
RYYRAAFFEV IPRVNAEVRS ALQTRWPDAD LLAEPILRPG SWIGGDRDGN PNVTAEVVRQ
ATSRAAYTAF EHYFAELTGL EQELSMSARL VHVSDELAAL ADACHEAARA DEPYRRALRV
IHGRLTATAL EILDNQPEHE LDLGLERYAT PAELLADLDV IDASLRGHGS AVLADDRLLR
LREAVRVFGF HLSGLDMRQN SDVHEEVVAE LLAWAGVHDD YASLPESDRI DLLVSELATR
RPLTSQKAEL SELARKELDI VRAGARAVRV FGPQAVPNYI ISMCESVSDM LEAAILLKEA
GLLDVSDDEP YAPVGIVPLF ETIDDLQRGS SILEAALDLP LYRGMVTARG DSQEVMLGYS
DSNKDGGYLA ANWALYRAEL DLVESSRKTG IRLRLFHGRG GTVGRGGGPS YDAILAQPPG
AVNGSLRITE QGEVIAAKYA EPRIAHRNLE TLVAATLEST LLDVEGLGDE AGPAYEVLDD
LAARAQRAYA ELVHETPGFV DYFKASTPVS EIGALNIGSR PASRKPTTSI SDLRAIPWVL
SWSQSRVMLP GWYGTGSAFE QYIAEGAAES EDRLGVLQDL YRRWPFFRTV LSNMAQVLAK
SDLGLAARYS ELVEDEDLRR RVFDKIADEH ERTIRMHRLI TGQDDLLADN PALARSVFNR
FPYLEPLNHL QVELLRRYRS GEDDELVQRG ILLTMSGLAT ALRNSG