Gene Mjls_3586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3586 
Symbol 
ID4879297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3782188 
End bp3784542 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content69% 
IMG OID640140892 
Producthypothetical protein 
Protein accessionYP_001071854 
Protein GI126436163 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCAGTC AGTCATGTCG CGCGGGAGGT GCGGCGCTGG CGGTGGGTAT CGGCATGCTC 
CTCGCGCCAG GGATCGCGGC AGCAGACCCC TCCGCCGACG CGGCAGGCAC CGATGTCTCC
GCGCATGCAC CGGCCGACAC CCGGCAAGAC GACCACACCG AGAAAGCCGA CGAGGAAACG
GACGCCCCCG AAGACAACGC GGAGGACATC CCCGAGGACA GCGCGGAGGA CGAAGCGGAG
GACGAAGATA CCGACGCGAA AACCGGCCAC CGCGACGCGG ACGACGAAGA ACCCACCGAG
GATCCCGTCG ACGAACCCGT TGCGGACGAC CCCGTTGCGG ACAATGAAGA CGATCCAGAA
GAACCCGCCG AATGGCCCGC GCCCGTCGCG ACGCTCACCG ACACTGTCGG CGCCGGCGGC
ACGCCTGCCC CGGTCGAGTC ACCCGCGACG TGGGCCGTAC TCGCCTGGGC CCGCCGCCAA
CCGTTCAGCA CCACCACGGC CGCGCACACA TCCGCCAGGC ACACGACCGC GTCATCCTCG
ACGGCGACGC CGGCCACGAC CGTCGACGTC AGGGACTACG GCGCGGTCGG CGACGGCGTC
ACCGACGACT CGGCGGCGAT CAAGGCCGCC GAGGCCGCGC TGGCCTCCGG CCAGCGTCTC
CACTTCCCCG AGGGGAGTTA CCGGTTCGCC CAGCAGAACC CGAATGGCGG CGCGGCGGTC
CTACTCAAGG GTCTCTCCGA CGTCACGGTG GAGTTCGCAC CGTGCGCCCG GCTACTGATG
GACAACCTCG ACGCCGCCGG GCACGGCACC AGCCACGGCA TCCGCGTCGA GGGCGCGGCG
TCGAACGTGA CGATCCTCAA CGCCACGATC GAGTGGAAGA CCCGACCGTC TGCGCGCAGC
TTCGGCGACG GGTTCTCGAT CCTCGGGTGG GCGTCGAACA CCCCGCCCCC GCCGGGTTGG
ACCGGATCGA CCGGAACGGT CTCCAACGTG TCGCTCGTCA ACGCCACGGT GATCAACGCG
CCGCAGACCG GCGCGATCTT CATGGGCGCC TCCGACGTGA CCGTCACGAA CTTCACGGCG
ATCGGCACGC TGGCCGACGG GTTGCACTTC AACGCGAACC GCCGGGTGAC CGTGCACGGG
CTCCTCGCGC AGAACACCGG CGACGACGGC CTGGCGTTCG TCACCTACTA CGACCCGACC
CTGCCGTGGA CGTACGGGCC CGGCGACGGC CCGTTCAACC AGCCCGGCCT CGGCGAGTGG
AACAACGGCG GTTCGGTGGC GACGAACATC ACCGTGACGG GTGGGGCGGC CAGCGGGGTG
CGCGTCCAGG GTGGTTACGA CATCACGATC ACCGATGTCA CCGTGACCGG TAAGGAGTTC
GGCCTCCAGG TCAACTCCGC CAAGGCCACC GGTCCGGGCG ACTGGACGAG TCTGGCGTCG
CGCGACATCG CCGTCTCCGA TGTGACCATC AGCGATACCG TGACAGGAAT CGTCCTGGCC
ACCAACAACA TCGACGGCAC CGAGGCCTCC ATGTGGTGGG ACTTCTCGGG CCTGACGATC
AGCGACGTCA CCATCCACAA CTCCCGCAAC TGGTCGCTAG CCGTCGAGAC GCCGGCGAGC
ACCACGAGCA GATTCGCCGG CGTCACCCTG CGCAACATCC ATGCCGAAGT CGACGCGGAC
GTCGGCCCAC TCGGCGGCGG CAACGGCGGC ATCCTGCTTG CGTCGCTTCG GGATTCCGTG
ATCGACGGTG TGCGCCTGGT GTCGGTCCAC GGTAGCGACA TCAACGTCGT CGGCGCGGCT
CAGATCCGCA GTCAGTACAG CGTCGCCGAT CTGCCGTCGT CGAACCTGAC GATCGACGAT
CTGGTCCTCG AGGGCCCGGG TCGGATCCTG ATCCAGGACA TCGCCGGCCT GGACGTCGGG
GCTGTGGCGT CCCACGGCGC CAACAGCGCC GCCATCGAAC TCTTCCGCGT CAAGTCCGCC
TCGTTCGACA CCATCGGGGC GTACCTGCCC GGCCGCGGCA ACGGGGCGGG CTGGGGCGTA
CGGCTGCTGC AGGTCCACGA CCTCGACGTG GCGAACATCG AGGTGATCAC CGACGACCAC
ATCGGAACAT CCTGGTGGGC AGTGGAACTC GGCGGTGGCA ATCCTGCACA GGACATCGCC
GGTGCCGGTG TGCGCATCGA CAACATCACC TACGTCAGCG GTCGTGACGC CACGGACTCC
GACATCGTGG TCCAGGGTGG ACCGTACGGA CCGGTGGACT GGTACATCAA CGCAACCTGG
CTCCACGAGG GCGAGGCGTC GCCGCTGTGG CGCGCCGGTC TGTGGGGCGA CGCGATCCCC
CCGCTCACAT CCTGA
 
Protein sequence
MVSQSCRAGG AALAVGIGML LAPGIAAADP SADAAGTDVS AHAPADTRQD DHTEKADEET 
DAPEDNAEDI PEDSAEDEAE DEDTDAKTGH RDADDEEPTE DPVDEPVADD PVADNEDDPE
EPAEWPAPVA TLTDTVGAGG TPAPVESPAT WAVLAWARRQ PFSTTTAAHT SARHTTASSS
TATPATTVDV RDYGAVGDGV TDDSAAIKAA EAALASGQRL HFPEGSYRFA QQNPNGGAAV
LLKGLSDVTV EFAPCARLLM DNLDAAGHGT SHGIRVEGAA SNVTILNATI EWKTRPSARS
FGDGFSILGW ASNTPPPPGW TGSTGTVSNV SLVNATVINA PQTGAIFMGA SDVTVTNFTA
IGTLADGLHF NANRRVTVHG LLAQNTGDDG LAFVTYYDPT LPWTYGPGDG PFNQPGLGEW
NNGGSVATNI TVTGGAASGV RVQGGYDITI TDVTVTGKEF GLQVNSAKAT GPGDWTSLAS
RDIAVSDVTI SDTVTGIVLA TNNIDGTEAS MWWDFSGLTI SDVTIHNSRN WSLAVETPAS
TTSRFAGVTL RNIHAEVDAD VGPLGGGNGG ILLASLRDSV IDGVRLVSVH GSDINVVGAA
QIRSQYSVAD LPSSNLTIDD LVLEGPGRIL IQDIAGLDVG AVASHGANSA AIELFRVKSA
SFDTIGAYLP GRGNGAGWGV RLLQVHDLDV ANIEVITDDH IGTSWWAVEL GGGNPAQDIA
GAGVRIDNIT YVSGRDATDS DIVVQGGPYG PVDWYINATW LHEGEASPLW RAGLWGDAIP
PLTS