Gene Mmcs_3924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3924 
Symbol 
ID4112754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4179488 
End bp4181353 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content72% 
IMG OID638033067 
Productglycosyl transferase family protein 
Protein accessionYP_641085 
Protein GI108800888 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.73215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCGAG TGACCCTGAC CCTTGACGCC GAGCAGACGC AGAATCGCCC GAATGATCGC 
CCGAGAGCGC GATTCTCTGT CTGGTCGCCG AGGATCGGGC TCGGCGTGCT GCTGGCCGGG
ACCGCCGTGC TGTACCTGTG GAACCTGTCG GCCAGCGGAT GGGCCAACGC GTTCTATTCG
GCCGCCGCGC AGGCCGGGTC CCAGAACTGG ACGGCCATGC TGTTCGGGTC CAGTGATGCG
GCCAACGCCA TCACCGTCGA CAAGACGCCC GCGGCGCTGT GGGTGATGGA CCTGTCGGTA
CGGGTGTTCG GGCTGAACTC GTGGAGCATC CTGGCGCCGC AGGCCCTGAT GGGAGTGGCC
GCCGTCGCGG TGCTGTACGC GGCGGTGCGG CGTGTCAGCG GACCGGGCGC CGCGCTGCTG
GCCGGCGCGG TGCTCGCGGT GACCCCCGTG GCCGCGTTGA TGTTCCGGTT CAACAACCCC
GACGCGCTGC TGGTCCTGCT GCTCGTCGTG GCCGGCTACT GCGTGACCCG GGCCTGCGAA
CCCGATGCGC GCCGATGGTG GCTGATCGCC GCCGGGGTGG CCGTCGGATT CGGCTTCCTG
GCCAAGATGC TGCAGGCATT CCTCGTGCTC CCGGGTTTCG TGGCGGCCTA TCTGCTCGCC
GGCAGCCGTC CGGTGGGCCG CCGGATCCTC GACCTGGCAG GCGCGGCCGC GGCGATGGTG
GCGGCCGCCG GCTGGTATCT GCTGCTCGCC GAGCTGTGGC CGGCCGACTC CCGGCCATAC
ATCGGCGGAT CGCAGCACAA CAGCATCGTC GAACTGGCCT TGGGTTACAA CGGTTTCGGC
CGACTCACCG GTGACGAACC GGGTGGGTTG GGCAACCTCA ACCACGACGT CGGGGCGGGG
CGGCTGTTCG GTTTCGGGAT GGGTCTCGAC ATCGCGTGGC TGCTGCCCGC GGCGCTGATC
TGCCTCGGCG CCGCGCTGCT GCTCACCCGC CGGACACCCC GCACCGACAC CACCCGCGCG
GCCCTGCTCA GCTGGGGCGG GTGGCTGGTC GTGACGGCCG TGGTGTTCAG CTTCGCCAAC
GGCATCGTGC ACTCGTACTA CACGGTCGCG CTGGCACCGG CGATCGCCGC GGTCATCGGC
ATCGGCTCAC ACCTGCTGTG GCGCAACAGG TCCCGACCGT GGTGTGCCGT GTCCATGGCC
GGTGCAGTGC TCGTCACCGC GGTGCTGGCC GCGGTGCTGC TGTCGCGCAA CGCCGACTGG
ATGCCGTGGC TGCGGGCGGC CGTCGCGGTC GGGGGAGTCG GTGCTGCGGT GCTGCTGATC
GTGGCGGGCC GGCTGCCCGA CGGTGTCGTC CGCGCCGCCG CCGGACTGGC CGTCGTGGTG
TGTCTCGCAT CGCCCGCGGC CTATTCGGTC GCCACCGCGG CGGCCCCGCA CACCGGCGCC
ATCCCGTCGG TGGGGCCGGC GCGCGGGGGT TTCGGCGGAC CGCCCGGACT GCTGAGCTCA
CCCGAGCCCG GTGAACAGCT CACCGCGCTG CTGGCCCGCG ACGCCCACGC GTACCGGTGG
ACCGCCGCGG TGGTCGGGTC GAACAACGCG GCCGGCTACC AATTGGCAAG CGGCGCACCG
GTGATGGCGC TGGGCGGGTT CAACGGCACC GATCCGGCGC CCACCCTCGA ACAGTTCCAA
CGTCACGTCG CCGACGGCGA TGTGCACTAC TTCATCGGAA GCCGCTCACC CCTCGGCTTC
GGCCGCGGCG CCGAGCAGAG CGGCAGCCGG GCCGCCGCGG ACATCGCGGA CTGGGTGCAG
GCGCGTTACC CGGGGCGAAC CGTCGACGGT GTCGTCGTCT ACGACCTCAC CCGGGCCCCG
GCGTGA
 
Protein sequence
MGRVTLTLDA EQTQNRPNDR PRARFSVWSP RIGLGVLLAG TAVLYLWNLS ASGWANAFYS 
AAAQAGSQNW TAMLFGSSDA ANAITVDKTP AALWVMDLSV RVFGLNSWSI LAPQALMGVA
AVAVLYAAVR RVSGPGAALL AGAVLAVTPV AALMFRFNNP DALLVLLLVV AGYCVTRACE
PDARRWWLIA AGVAVGFGFL AKMLQAFLVL PGFVAAYLLA GSRPVGRRIL DLAGAAAAMV
AAAGWYLLLA ELWPADSRPY IGGSQHNSIV ELALGYNGFG RLTGDEPGGL GNLNHDVGAG
RLFGFGMGLD IAWLLPAALI CLGAALLLTR RTPRTDTTRA ALLSWGGWLV VTAVVFSFAN
GIVHSYYTVA LAPAIAAVIG IGSHLLWRNR SRPWCAVSMA GAVLVTAVLA AVLLSRNADW
MPWLRAAVAV GGVGAAVLLI VAGRLPDGVV RAAAGLAVVV CLASPAAYSV ATAAAPHTGA
IPSVGPARGG FGGPPGLLSS PEPGEQLTAL LARDAHAYRW TAAVVGSNNA AGYQLASGAP
VMALGGFNGT DPAPTLEQFQ RHVADGDVHY FIGSRSPLGF GRGAEQSGSR AAADIADWVQ
ARYPGRTVDG VVVYDLTRAP A