Gene Mmcs_3581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3581 
Symbol 
ID4112413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3814556 
End bp3816931 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content69% 
IMG OID638032718 
Producthypothetical protein 
Protein accessionYP_640744 
Protein GI108800547 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTCAGTC AGTCATGTCG CGCGGGAGGT GCGGCGCTGG CGGTGGGTAT CGGCATGCTC 
CTCGCGCCAG GGATCGCGGC AGCAGACCCC TCCGCCGACG CGGCAGGCAC CGATGTCTCC
GCGCATGCAC CGGCCGACAC CCGGCAAGAC GACCACACCG AGAAAGCCGA CGAGGAAACG
GACGCCCCCG AAGACAACGC GGAGGACATC CCCGAGGACA GCGCGGAGGA CGAAGCGGAG
GACGAAGCGG CGCCGGTGGA GGACGAAGAT ACCGACGCGA AAACCGGCCA CCGCGACGCG
GACGACGAAG AACCCACCGA GGATCCCGTC GACGAACCCG TTGCGGACGA CCCCGTTGAA
GACAACGAAG TCGATCCAGA AGAACCCGCC GAATCGCCCG CGCCCGTCGC GACGCTCACC
GACACCGTCG GCGCCGGCGG CACGCCTGCC CCGGTCGAGT CACCCGCGAC GTGGGCCGTA
CTCGCCTGGG CCCGCCGCCA ACCGTTCAGC ACCACCACGG CCGCGAACAC ATCCGCCAGG
CACACGACCG CGTCATCCTC GACGGCGACG CCGGCCACGA CGGTCGACGT CAAGGACTAC
GGGGCGGTCG GTGACGGCGT CACCGACGAC TCGGCGGCGA TCAAGGCCGC CGAGGCCGCG
CTGGCCTCGG GTCAGCGCCT CTACTTCCCC GAGGGCAGTT ACCGGTTCGC CCAGCAGAAC
CCCGCCGGCA ACGCCGCGGT CCTGCTCAAG GGTCTCTCCG ACGTCACGGT GGAGTTCGCA
CCGCATGCCC GGCTGCTGAT GGACAACCTC GACGCCGCCG AGCACGGCAC CAGCCACGGC
ATCCGCGTCG AGGGCGCGGC GTCGAACGTG ACGATCCTCA ACGCCACGAT CGAGTGGAAG
ACCCGACCAT CCGCGCGCAG CTTCGGCGAC GGGTTCTCGA TCCTCGGGTG GGCGTCGAAC
ACCGCGCCCC CGCCGGGCTG GACCGGATCG ACCGGAACGG TCTCCAACGT GTCGCTCGTC
AACGCCACGG TGATCAACGC GCCGCAGACC GGCGCGATCT TCATGGGCGC CTCCGACGTG
ACCGTCACGA ACTTCACCGC GATCGGCACG CTGGCCGACG GGTTGCACTT CAACGCGAAC
CGCCGGGTGA CCGTGCACGG GCTCCTCGCG CAGAACACCG GCGACGACGG CCTGGCGTTC
GTCACCTACT ACGACCCGAC CCTGCCGTGG ACCTACGGGC CCGGCGACGG CCCGTTCAAC
CAGCCCGGCC TCGGCGAGTG GAACAACGGC GGTTCGGTGG CGACGAACAT CACCGTGACG
GGTGGGGCGG CCAGCGGGGT GCGCGTCCAG GGTGGTTATG ACATCACGAT CACCGATGTC
ACCGTGACCG GTAAGGAGTT CGGCCTCCAG GTCAACTCCG CCAAGGCCAC CGGTCCGGGC
GACTGGACGA GTCTGGCGTC GCGCGACATC TCCATCTCCG ACGTGACCAT CAGCGCTACC
GTGACAGGAA TCGTCCTGGC CACCAACAAC ATCGACGGCA CCGAGGCCTC CATGTGGTGG
GACTTCTCGG GCCTGACGAT CAGCGACGTC ACCATCCACA ACTCCCGCAA CTGGTCGCTC
GCCGTCGAGA CGCCGGCGAG CACCACGAGC AGATTCGCCG GCGTCACCCT GCGCAACATT
CATGCCGAAG TCGACGCGGA CGTCGGCCCA CTCGGCGGCG GCAACGGCGG CATCCTGCTT
GCGTCGCTTC GGGATTCCGT GATCGACGGT GTGCGCCTGG TGTCGGTCCA CGGTAGCGAC
ATCAACGTCG TCGGCGCGGC TCAGATCCGC AGTCAGTACA GCGTCGCCGA TCTGCCGTCG
TCGAACCTGA CGATCGACGA TCTGGTCCTC GAGGGCCCGG GTCGGATCCT GATCCAGGAC
ATCGCCGGCC TGGACGTCGG GACTGTGGCG TCCCACGGCG CCAACAGCGC CGCCATCGAA
CTCTTCCGCG TCAAGTCCGC CTCGTTCGAC ACCATCGGGG CGTACCTGCC CGGCCGCGGC
AACGGGGCGG GCTGGGGCGT ACGGCTGCTG CAGGTCCACG ACCTCGACGT GGCGAACATC
GAGGTGATCA CCGACGACCA CATCGGAACA TCCTGGTGGG CAGTCGAACT CGGCGGCGGC
AATCCTGCAC AGGACATCGC CGGCGCCGGT GTGCGCATCG ACGACGTCAC CTACGTCAGC
GGTCGTGACG CCACGGACTC CGACATCGTG GTCCAGGGTG GACCGTACGG ACCGGTCGAC
TGGTACATCA ACGCAACCTG GCGCCACGAG GGTGAGGCGT CGCCGCTGTG GCGCGCCGGT
CTGTGGGGCG ACGCGATCCC CTCGCTCACA TCCTGA
 
Protein sequence
MVSQSCRAGG AALAVGIGML LAPGIAAADP SADAAGTDVS AHAPADTRQD DHTEKADEET 
DAPEDNAEDI PEDSAEDEAE DEAAPVEDED TDAKTGHRDA DDEEPTEDPV DEPVADDPVE
DNEVDPEEPA ESPAPVATLT DTVGAGGTPA PVESPATWAV LAWARRQPFS TTTAANTSAR
HTTASSSTAT PATTVDVKDY GAVGDGVTDD SAAIKAAEAA LASGQRLYFP EGSYRFAQQN
PAGNAAVLLK GLSDVTVEFA PHARLLMDNL DAAEHGTSHG IRVEGAASNV TILNATIEWK
TRPSARSFGD GFSILGWASN TAPPPGWTGS TGTVSNVSLV NATVINAPQT GAIFMGASDV
TVTNFTAIGT LADGLHFNAN RRVTVHGLLA QNTGDDGLAF VTYYDPTLPW TYGPGDGPFN
QPGLGEWNNG GSVATNITVT GGAASGVRVQ GGYDITITDV TVTGKEFGLQ VNSAKATGPG
DWTSLASRDI SISDVTISAT VTGIVLATNN IDGTEASMWW DFSGLTISDV TIHNSRNWSL
AVETPASTTS RFAGVTLRNI HAEVDADVGP LGGGNGGILL ASLRDSVIDG VRLVSVHGSD
INVVGAAQIR SQYSVADLPS SNLTIDDLVL EGPGRILIQD IAGLDVGTVA SHGANSAAIE
LFRVKSASFD TIGAYLPGRG NGAGWGVRLL QVHDLDVANI EVITDDHIGT SWWAVELGGG
NPAQDIAGAG VRIDDVTYVS GRDATDSDIV VQGGPYGPVD WYINATWRHE GEASPLWRAG
LWGDAIPSLT S