Gene Mkms_3654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3654 
Symbol 
ID4611586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3849038 
End bp3851413 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content69% 
IMG OID639793332 
Producthypothetical protein 
Protein accessionYP_939638 
Protein GI119869686 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCAGTC AGTCATGTCG CGCGGGAGGT GCGGCGCTGG CGGTGGGTAT CGGCATGCTC 
CTCGCGCCAG GGATCGCGGC AGCAGACCCC TCCGCCGACG CGGCAGGCAC CGATGTCTCC
GCGCATGCAC CGGCCGACAC CCGGCAAGAC GACCACACCG AGAAAGCCGA CGAGGAAACG
GACGCCCCCG AAGACAACGC GGAGGACATC CCCGAGGACA GCGCGGAGGA CGAAGCGGAG
GACGAAGCGG CGCCGGTGGA GGACGAAGAT ACCGACGCGA AAACCGGCCA CCGCGACGCG
GACGACGAAG AACCCACCGA GGATCCCGTC GACGAACCCG TTGCGGACGA CCCCGTTGAA
GACAACGAAG TCGATCCAGA AGAACCCGCC GAATCGCCCG CGCCCGTCGC GACGCTCACC
GACACCGTCG GCGCCGGCGG CACGCCTGCC CCGGTCGAGT CACCCGCGAC GTGGGCCGTA
CTCGCCTGGG CCCGCCGCCA ACCGTTCAGC ACCACCACGG CCGCGAACAC ATCCGCCAGG
CACACGACCG CGTCATCCTC GACGGCGACG CCGGCCACGA CGGTCGACGT CAAGGACTAC
GGGGCGGTCG GTGACGGCGT CACCGACGAC TCGGCGGCGA TCAAGGCCGC CGAGGCCGCG
CTGGCCTCGG GTCAGCGCCT CTACTTCCCC GAGGGCAGTT ACCGGTTCGC CCAGCAGAAC
CCCGCCGGCA ACGCCGCGGT CCTGCTCAAG GGTCTCTCCG ACGTCACGGT GGAGTTCGCA
CCGCATGCCC GGCTGCTGAT GGACAACCTC GACGCCGCCG AGCACGGCAC CAGCCACGGC
ATCCGCGTCG AGGGCGCGGC GTCGAACGTG ACGATCCTCA ACGCCACGAT CGAGTGGAAG
ACCCGACCAT CCGCGCGCAG CTTCGGCGAC GGGTTCTCGA TCCTCGGGTG GGCGTCGAAC
ACCGCGCCCC CGCCGGGCTG GACCGGATCG ACCGGAACGG TCTCCAACGT GTCGCTCGTC
AACGCCACGG TGATCAACGC GCCGCAGACC GGCGCGATCT TCATGGGCGC CTCCGACGTG
ACCGTCACGA ACTTCACCGC GATCGGCACG CTGGCCGACG GGTTGCACTT CAACGCGAAC
CGCCGGGTGA CCGTGCACGG GCTCCTCGCG CAGAACACCG GCGACGACGG CCTGGCGTTC
GTCACCTACT ACGACCCGAC CCTGCCGTGG ACCTACGGGC CCGGCGACGG CCCGTTCAAC
CAGCCCGGCC TCGGCGAGTG GAACAACGGC GGTTCGGTGG CGACGAACAT CACCGTGACG
GGTGGGGCGG CCAGCGGGGT GCGCGTCCAG GGTGGTTATG ACATCACGAT CACCGATGTC
ACCGTGACCG GTAAGGAGTT CGGCCTCCAG GTCAACTCCG CCAAGGCCAC CGGTCCGGGC
GACTGGACGA GTCTGGCGTC GCGCGACATC TCCATCTCCG ACGTGACCAT CAGCGCTACC
GTGACAGGAA TCGTCCTGGC CACCAACAAC ATCGACGGCA CCGAGGCCTC CATGTGGTGG
GACTTCTCGG GCCTGACGAT CAGCGACGTC ACCATCCACA ACTCCCGCAA CTGGTCGCTC
GCCGTCGAGA CGCCGGCGAG CACCACGAGC AGATTCGCCG GCGTCACCCT GCGCAACATT
CATGCCGAAG TCGACGCGGA CGTCGGCCCA CTCGGCGGCG GCAACGGCGG CATCCTGCTT
GCGTCGCTTC GGGATTCCGT GATCGACGGT GTGCGCCTGG TGTCGGTCCA CGGTAGCGAC
ATCAACGTCG TCGGCGCGGC TCAGATCCGC AGTCAGTACA GCGTCGCCGA TCTGCCGTCG
TCGAACCTGA CGATCGACGA TCTGGTCCTC GAGGGCCCGG GTCGGATCCT GATCCAGGAC
ATCGCCGGCC TGGACGTCGG GACTGTGGCG TCCCACGGCG CCAACAGCGC CGCCATCGAA
CTCTTCCGCG TCAAGTCCGC CTCGTTCGAC ACCATCGGGG CGTACCTGCC CGGCCGCGGC
AACGGGGCGG GCTGGGGCGT ACGGCTGCTG CAGGTCCACG ACCTCGACGT GGCGAACATC
GAGGTGATCA CCGACGACCA CATCGGAACA TCCTGGTGGG CAGTCGAACT CGGCGGCGGC
AATCCTGCAC AGGACATCGC CGGCGCCGGT GTGCGCATCG ACGACGTCAC CTACGTCAGC
GGTCGTGACG CCACGGACTC CGACATCGTG GTCCAGGGTG GACCGTACGG ACCGGTCGAC
TGGTACATCA ACGCAACCTG GCGCCACGAG GGTGAGGCGT CGCCGCTGTG GCGCGCCGGT
CTGTGGGGCG ACGCGATCCC CTCGCTCACA TCCTGA
 
Protein sequence
MVSQSCRAGG AALAVGIGML LAPGIAAADP SADAAGTDVS AHAPADTRQD DHTEKADEET 
DAPEDNAEDI PEDSAEDEAE DEAAPVEDED TDAKTGHRDA DDEEPTEDPV DEPVADDPVE
DNEVDPEEPA ESPAPVATLT DTVGAGGTPA PVESPATWAV LAWARRQPFS TTTAANTSAR
HTTASSSTAT PATTVDVKDY GAVGDGVTDD SAAIKAAEAA LASGQRLYFP EGSYRFAQQN
PAGNAAVLLK GLSDVTVEFA PHARLLMDNL DAAEHGTSHG IRVEGAASNV TILNATIEWK
TRPSARSFGD GFSILGWASN TAPPPGWTGS TGTVSNVSLV NATVINAPQT GAIFMGASDV
TVTNFTAIGT LADGLHFNAN RRVTVHGLLA QNTGDDGLAF VTYYDPTLPW TYGPGDGPFN
QPGLGEWNNG GSVATNITVT GGAASGVRVQ GGYDITITDV TVTGKEFGLQ VNSAKATGPG
DWTSLASRDI SISDVTISAT VTGIVLATNN IDGTEASMWW DFSGLTISDV TIHNSRNWSL
AVETPASTTS RFAGVTLRNI HAEVDADVGP LGGGNGGILL ASLRDSVIDG VRLVSVHGSD
INVVGAAQIR SQYSVADLPS SNLTIDDLVL EGPGRILIQD IAGLDVGTVA SHGANSAAIE
LFRVKSASFD TIGAYLPGRG NGAGWGVRLL QVHDLDVANI EVITDDHIGT SWWAVELGGG
NPAQDIAGAG VRIDDVTYVS GRDATDSDIV VQGGPYGPVD WYINATWRHE GEASPLWRAG
LWGDAIPSLT S