Gene Mmcs_0949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0949 
Symbol 
ID4109789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1049867 
End bp1051456 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content66% 
IMG OID638030073 
ProductPE-PPE-like protein 
Protein accessionYP_638120 
Protein GI108797923 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAAGT CTGCAAACGC AACGATCATC GTCATCGTCT CGCTCATCGC CACCGTCGGC 
CTCTGGCTCG CGTCGACCTT CGCGGCAGCG ATCGCCTTCG GCGCCATCGC GCTCATCGTG
CCGGGCACCG GAACGCACAA CGTCACCACC GACACGCAGT ACCGGGAGAA TGCGGCCAAC
CGCTACATCG ACCCGTCCGG CGTGCCGTGT ACGTCAGAGG ACGGCTGCGA CCTCCGAGGC
GTCGACTATC CCGCCAGCTT CTGGCCGATT CCGCTTCCAG GCTGGTGCCC CGGCCTGACG
TGCGACACCT GGAACAAGTC GGTGGGCGAA GGCGTCACCA ACCTCAACAC CGACCTCGCC
GGCATCCTGG CCGACCCGGC CAACGACGAC GAGGACATCA TCGTCTTCGG CTACTCCCAG
GGTGGCGCCG TCGTCTCCCG CGCCATGTAC GACATCGCCG AACTCGACGA GGAGACCCGG
GACCGCATCA CCGTCGTCAC CATCGGCAAC ATCAACAACC CCCAGGGGCT GTGGTCGCGA
CTGAGCTTCC TGCGCTACAT CCCGCTCCTC GACGTGTCCT TCGGGCCGCA ACTGCCCACC
GACATCGGCG TCAAGAGCAC CAACTACTCC TTCGAGTACG ACCCCGTCGG CGACGCCCCG
CAGTACTGGG GCAACCCCTT GGCGATGCTC AACGCGCTCA TGGCATTCGA GTACGTCCAC
GGTTACTACC TCGACCCCAA CAGCAACGGG CCCAACGACT CGATGCCCTA CGGCTACGAC
AACGTCAGCC TCGCCGCCGC GATCGCGGCC GCGCCCAAGC GCGTCCACGG TGACGCCACC
TTCGTGCTCA TCCCGCAGCG CGGCACACTC CCGATCTTCA TGCCGCTCGT CGATCTCGGA
AACGCCACCG GCACATCCGC ATTCATCGAA CCCGTCATCC GGCTGCTGCA GCCGGTGACC
AAGCTGCTGA TCGACCTGGG CTACGACCGC ACCACCAACC CGGGCATTGC GCGGAACCTG
TCGATCCTGC CGTTCAACCC CTTCGCCTTC AATCCCGTCC AGTTCTCCGT GGACTTCGTC
GAAGCCATCG TCCAGGGCAT CGAAGACGCC TTCAACGGCG GCAGCATGAT CGCGGTGCCG
GTCCCGACAC CGTCGGAATC TGAAACCAAC GACGTGCTTG CCGCGGGCAA GGCGCTGGGC
CGGCTGGCCG CGGATGAAGC GGAGGAAGGC CTCACCCCGG TCGTCGAGCA GGTCTCGACC
GGCAGCGAGG AGTCGGTCGA AGACCCCTCC GAAAGCGACA TCGCGGGCAC CCCGCCGATC
GCCGACGAAG CCGCGACCAA CGACGAGCAG GAACAGATCG AGGCCCCGAA GGACGAAGAG
GCCCAGGAAG TAGCCACCAA GGAAGAGGAC CTCGACGAGA CCGAGAACCT GCAGGAGGAC
CTCGGGGAAG AGGAACTCCC GCAGGAGGAC GTTGAGAGCG AGCAAGAGGA CTCCGCCGAG
GAGACCGAAG AAGCCGAGGA CACCGACGAC GTCGAGAACG CCGAGGCCGA GGCCGAGACC
GCAGCTCCCG AAGAGAAGGC GGCTGCCTGA
 
Protein sequence
MRKSANATII VIVSLIATVG LWLASTFAAA IAFGAIALIV PGTGTHNVTT DTQYRENAAN 
RYIDPSGVPC TSEDGCDLRG VDYPASFWPI PLPGWCPGLT CDTWNKSVGE GVTNLNTDLA
GILADPANDD EDIIVFGYSQ GGAVVSRAMY DIAELDEETR DRITVVTIGN INNPQGLWSR
LSFLRYIPLL DVSFGPQLPT DIGVKSTNYS FEYDPVGDAP QYWGNPLAML NALMAFEYVH
GYYLDPNSNG PNDSMPYGYD NVSLAAAIAA APKRVHGDAT FVLIPQRGTL PIFMPLVDLG
NATGTSAFIE PVIRLLQPVT KLLIDLGYDR TTNPGIARNL SILPFNPFAF NPVQFSVDFV
EAIVQGIEDA FNGGSMIAVP VPTPSESETN DVLAAGKALG RLAADEAEEG LTPVVEQVST
GSEESVEDPS ESDIAGTPPI ADEAATNDEQ EQIEAPKDEE AQEVATKEED LDETENLQED
LGEEELPQED VESEQEDSAE ETEEAEDTDD VENAEAEAET AAPEEKAAA