Gene Mmcs_3573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3573 
Symbol 
ID4112405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3804893 
End bp3806197 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content71% 
IMG OID638032708 
Productglycosyl transferase family protein 
Protein accessionYP_640736 
Protein GI108800539 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGAAG TACTGATCGC GTCCATGTCG CCGATAGGCC ACCTCGGGCC GTTGCTCAAC 
CTCGCCCGCG GCCTCGTCGA CCGCGGTGAC CGGGTCACGG TCCTGACCTC GGCCGCGCGC
GCCGGGATGA TCCGCGCGGC CGGGGCACGA CCCCGCGCCC TGCCGCCGCA GACCGACATC
GACGAGAGCC GGCTCAACGA AAGCCTCCCC GGGCGCGAGA AGACGTCCGG CATCAAACGG
GTCGACTTCG ACATCACCAA CGTCTTCGTG ACCCCGATGC CCCATCAGGC GGCGGCCCTG
GCCGAGGCGT TCGCCGAGAC ACGGTATGAC GCCGTCATCG TCGACGCGAT GTTCCTGGGC
ATCCTGCCGT TCCTGCTCGG TGAACACGCC GCCCGCCCAC CGGTGCTGGC CTACTCGACC
ACGCCGCTGT TGATCAGCAG CCGGGACACC GCTCCTCCGG GGTTGGGTCT GCCGCCGTCG
TCGAGCCCGC TCGGGCGGCT GCGCAACTCG GCGCTGACCA CGCTGACGCA CCGGGTCCTC
CTGCGAGGCT GCCACCGGGC CGCCGACGAG GCGCTGCACC GGATGAACAG CCGCCCGCTG
CCGATGTTCG TCACCGACGC CGCGTTGCTC GCCGACCGCT TCATCGCCCC TACCGTCCCC
GAATTCGACT ATCCGCGCGG CGATCTGCCG CCTCATGTGC GCTACGTGGG CGCCGTGCAT
CCCGCACGGA CGCAGACGTT CACCCCGCCC CCGTGGTGGG GGGCGCTCGA CGGCGAACGC
CCGGTGGTGC ACGTCACCCA GGGCACCGTC GACAACGCCG ACCCCCGGCG GCTACTGCTG
CCGACCGTCG AGGCGCTGGC CGGTGAGGAG GTCACCGTGG TGGTCACCAC CGGTGGCCGT
GGACTTTCCG TACCTCACAC CGCCCTGCCG ACGAATACCC ATGTGGCCGA ATTCATTCCG
CACGACGTGT TGCTTCCGAA GGTCGACGTG ATGGTCACCA ACGGCGGGTT CGGTGCGGTG
CAGCGCGCGC TGTCCCTCGG CGTGCCGCTC GTGGTCGCGG GCGACACCGA GGACAAGCCG
GAGGTCGCCG CGCGCGTCGC CTGGACCGGT GCCGGTGTCG ACCTGCGCAC CGGCACGCCG
ACTCCCGGTG CGATCCGCTC GGCGGTCCGC GACGTGCTCG ACCGCGCGCA CTACCGGGAG
AACGCCCGAC GGCTCGAGGT CGCCTTCACA CGCCGCGACG GGGTGGCCGA GATCGCCGCG
GTGATCGACG AAGTCCTCGC CGAGCGTCGT CAGACAGTGC GGTGA
 
Protein sequence
MPEVLIASMS PIGHLGPLLN LARGLVDRGD RVTVLTSAAR AGMIRAAGAR PRALPPQTDI 
DESRLNESLP GREKTSGIKR VDFDITNVFV TPMPHQAAAL AEAFAETRYD AVIVDAMFLG
ILPFLLGEHA ARPPVLAYST TPLLISSRDT APPGLGLPPS SSPLGRLRNS ALTTLTHRVL
LRGCHRAADE ALHRMNSRPL PMFVTDAALL ADRFIAPTVP EFDYPRGDLP PHVRYVGAVH
PARTQTFTPP PWWGALDGER PVVHVTQGTV DNADPRRLLL PTVEALAGEE VTVVVTTGGR
GLSVPHTALP TNTHVAEFIP HDVLLPKVDV MVTNGGFGAV QRALSLGVPL VVAGDTEDKP
EVAARVAWTG AGVDLRTGTP TPGAIRSAVR DVLDRAHYRE NARRLEVAFT RRDGVAEIAA
VIDEVLAERR QTVR