Gene Mmcs_3115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3115 
Symbol 
ID4111947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3298419 
End bp3299684 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content69% 
IMG OID638032245 
Productglycosyl transferase family protein 
Protein accessionYP_640278 
Protein GI108800081 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.448802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAATTCG TGGTGGCCGT CCACGGCACC CGTGGCGATG TCGAACCCTG TGCGGCCGTC 
GGGCTCGAAC TCGCCCGGCG CGGGCACGAG GTGCGGACTG CCGTGCCGCC CAACCTCATA
TCCTTCGTCG AGGAATGCGG ACTCGGTGTC CCGGTGTCCT ACGGCGTCGA TTCGCAGCAA
CAACTCGACG CGGACATCTT CCGCGAGTGG TACCGGCTGC GGAACCCGAT GACGGTGCTG
CGCGAAGCGC GCGAGTACGT CGTCGAGGGA TGGGCGGAGA TGAGCCGTTC GCTCGACGCG
CTTGCCGACG GCGCCGACCT GATCCTCACC GGTACGACGT ACCAGGAACT CGCCGCCAAT
GTGGCTGCGG CGCACGCTAT CCCGCTGGCC GCGCTGCACT ACTTCCCGGT GCGCCCGAGC
ACCAAGTCGC TGCCGGTACC CGTGCCGTCC GCGGTGGTCG GCCCGGTCTG GGCCGTCGGT
GAGTGGGCGC ACTGGCGGGT GCTCAAACAG GCCGAGGACG AGCAACGGCG CGAGCTGGGC
CTGCCTCCGG CGAGCACCCG CGCGGTGCGT CGCATGCTCG ACGACGGCGC GTTGGAAATC
CAGGCCTACG ACCGGGTTTT CTTTCCCGGA CTGGCCGAGG AGTGGGGTCC GCAGCGGCCC
CTGGTCGGCG GTATCACCCT CGAGAAGAAC ACCGACGCCG ACGACGATGT GGTCTCCTGG
ACAGCCGCCG GGACACCGCC CGTCTACTTC GGATTCGGCA GCATGCCGGT GAAGTCGCCC
GCCGACGCGG TGGCGATGAT AGAAGCGGCG TGCGCCGATC TCGGCGAGCG GGCGCTGATC
TGCTCGGGAG TGTGGGACGT CGACGAACTG CCGCACGCTG CGCACGTGAA GATCGTGCGG
AGCGTCAACC ACGCGGCGGT CTTCCCGTTG TGCCGCGCCG TGGTTCACCA CGGTGGCGCG
GGTACGACGG CGGCCGGTGT CCGCGCCGGT GTTCCCACGC TGGTGCTGTG GGTGGGTGCC
GAACAACCGA TCTGGGGTTC GCGGGTCAAA CACCTCGGTG TGGGTGATTA CCAACGGTTC
TCGTCCACCA CACGCAAATC GCTGCGTCGC GCCCTGAGCA GGGTGCTGGG ACCGCGATAC
GTCGAGCGCG CACGCGAGGT CGCCGCAGCG ATGACGAAAC CGGCCTCGAG TGTGGGTACC
GCGGCCGACC TTCTCGAAGA TGCGGCGCGT CAGGAGCGCC GACATGGTCA GACGATCTCG
CCGTAG
 
Protein sequence
MKFVVAVHGT RGDVEPCAAV GLELARRGHE VRTAVPPNLI SFVEECGLGV PVSYGVDSQQ 
QLDADIFREW YRLRNPMTVL REAREYVVEG WAEMSRSLDA LADGADLILT GTTYQELAAN
VAAAHAIPLA ALHYFPVRPS TKSLPVPVPS AVVGPVWAVG EWAHWRVLKQ AEDEQRRELG
LPPASTRAVR RMLDDGALEI QAYDRVFFPG LAEEWGPQRP LVGGITLEKN TDADDDVVSW
TAAGTPPVYF GFGSMPVKSP ADAVAMIEAA CADLGERALI CSGVWDVDEL PHAAHVKIVR
SVNHAAVFPL CRAVVHHGGA GTTAAGVRAG VPTLVLWVGA EQPIWGSRVK HLGVGDYQRF
SSTTRKSLRR ALSRVLGPRY VERAREVAAA MTKPASSVGT AADLLEDAAR QERRHGQTIS
P