Gene Mmcs_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1004 
Symbol 
ID4109843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1108198 
End bp1109610 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content72% 
IMG OID638030127 
Productglycosyl transferase family protein 
Protein accessionYP_638174 
Protein GI108797977 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGGGC CAAGGCTGCC CGACGGCTTT GCTGTGCAGG TCGACCGCCG GGTCAAGGTG 
CTCGACGAGG GCGCTGCGCT GCTCGGCGGT TCGCCCACCC GGCTGCTGCG GCTGGCGCCG
GCCGCGCAGA CCATGCTCAG CGGTGGCCGC CTCGAGGTGC ACGACGCCAC GAGCGCACAG
TTGGCGCGCA CCCTGCTCGA CGCCACCGTC GCCCACCCAC GCCCGGCCAG CGGCCCGTCG
CACCGCGACG TGACGGTCGT TATTCCAGTG CGCGACAACA TCTCCGGTCT GCAACGCCTG
CTCGCGTCGC TGCGCGGCCT GCGCGTCATC GTCGTCGACG ACGGATCGGC GACGCCGATC
GAGTGCACCC ACATGTCCGG CGTGCACTGC GACGTCCGGG TGATCCGCCA CGACCGCAGC
AGGGGACCCG CGGCCGCCCG CAACACCGGC GCGGCGGCCT GCTCGACCGA CTTCGTGGCG
TTCCTCGACT CCGACGTGCT ACCGCGTCGC GGATGGCTCG AGGCGTTGCT GGGGCACTTC
TGCGACCCCG CGGTCGCGCT GGTCGCACCC CGCATCGTCG GGCTGCGCAC CGCCGACAAC
CCGGTGGCCC GGTACGAGGC GGTGCGCTCG TCGCTCGACC TCGGGCACCG GGAGGCGCCC
GTGGTGCCCT ACGGTCCGGT GTCCTACGTG CCGAGCGCCG CGATCATCTG CCGGCGGCGC
GCGCTCGACG AGGTAGGCGG GTTCGACGAG ACGATGCACT CCGGTGAGGA CGTCGACCTG
TGCTGGCGGC TGGTGGAGGC GGGAGCCCGA CTGCGCTACG AGCCGATCGC CCTGGTCGCC
CACGACCACC GGACCGACCT GGGGGAGTGG TTCCTCCGCA AGGCGTTCTA CGGCAAGTCC
GCGGCGCCGC TGGCGGTTCG GCATCCGGGC AAGACCGCGC CGCTGGTGAT CTCGGGCTGG
ACCCTGGTGG TCTGGGTGCT GATGGCGATG GGCAGCTGTA TCGGCTACCT GGCCTCGATG
CTCGCCGCGG CCCTGACCGC CCGGCGGGTG GCCAACTCGT TGAGCTCGGT ACGGACCGAA
CCGCGCCAGG TGGCCGCCAT CGCCGCGCAG GGACTGTGGT CGGCGGCGCT GCAACTGGCG
TCGGCGATCT GCCGGCACTA CTGGCCGATC GCCCTGCTGG CGGCGCTGGT GTCGCGCCGC
TGCCGGCAGG CGGTGTTGAT CGCCGCGGTG GTCGACGGCG TGGTCGACTG GGCCGCGCGG
CGCGGTAACA CCGACGACGA CACCAAACAG GTCGGGTTGC TGACCTATGT GCTGTTGCGC
CGGCTCGACG ACATCGCCTA CGGGCTCGGC CTGTGGACCG GGGTGGTGCG CGAACGGCAC
CTCGGGGCGC TCAAACCCCA GATCCGGACG TAA
 
Protein sequence
MTGPRLPDGF AVQVDRRVKV LDEGAALLGG SPTRLLRLAP AAQTMLSGGR LEVHDATSAQ 
LARTLLDATV AHPRPASGPS HRDVTVVIPV RDNISGLQRL LASLRGLRVI VVDDGSATPI
ECTHMSGVHC DVRVIRHDRS RGPAAARNTG AAACSTDFVA FLDSDVLPRR GWLEALLGHF
CDPAVALVAP RIVGLRTADN PVARYEAVRS SLDLGHREAP VVPYGPVSYV PSAAIICRRR
ALDEVGGFDE TMHSGEDVDL CWRLVEAGAR LRYEPIALVA HDHRTDLGEW FLRKAFYGKS
AAPLAVRHPG KTAPLVISGW TLVVWVLMAM GSCIGYLASM LAAALTARRV ANSLSSVRTE
PRQVAAIAAQ GLWSAALQLA SAICRHYWPI ALLAALVSRR CRQAVLIAAV VDGVVDWAAR
RGNTDDDTKQ VGLLTYVLLR RLDDIAYGLG LWTGVVRERH LGALKPQIRT