Gene Mjls_3939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3939 
Symbol 
ID4879648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4160895 
End bp4162760 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content72% 
IMG OID640141251 
Productglycosyl transferase family protein 
Protein accessionYP_001072205 
Protein GI126436514 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.186492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCGAG TGACCCTGAC CCTTGACGCC GAGCAGACGC AGAATCGCCC GAATGATCGC 
CCGAGAGCGC GATTCTCTGT CTGGTCGCCG AGGATCGGGC TCGGCGTGCT GCTGGCCGGG
ACCGCCGTGC TGTACCTGTG GAACCTGTCG GCCAGCGGAT GGGCCAACGC GTTCTATTCG
GCCGCCGCGC AGGCCGGGTC CCAGAACTGG ACGGCCATGC TGTTCGGGTC CAGTGATGCG
GCCAACGCCA TCACCGTCGA CAAGACGCCC GCGGCGCTGT GGGTGATGGA CCTGTCGGTA
CGGGTGTTCG GGCTGAACTC GTGGAGCATC CTGGCGCCGC AGGCCCTGAT GGGAGTGGCC
GCCGTCGCGG TGCTGTACGC GGCGGTGCGG CGTGTCAGCG GACCGGGCGC CGCGCTGCTG
GCCGGCGCGG TGCTCGCGGT GACCCCCGTG GCCGCGTTGA TGTTCCGGTT CAACAACCCC
GACGCGCTGC TGGTCCTGCT GCTCGTCGTG GCCGGCTACT GCGTGACCCG GGCCTGCGAA
CCCGATGCGC GCCGATGGTG GCTGATCGCC GCCGGGGTGG CCGTCGGATT CGGCTTCCTG
GCCAAGATGC TGCAGGCATT CCTCGTGCTC CCGGGTTTCG TGGCGGCCTA TCTGCTCGCC
GGCAGCCGTC CGGTGGGCCG CCGGATCCTC GACCTGGCAG GCGCGGCCGC GGCGATGGTG
GCGGCCGCCG GCTGGTATCT GCTGCTCGCC GAGCTGTGGC CGGCCGACTC CCGGCCATAC
ATCGGCGGAT CGCAGCACAA CAGCATCGTC GAACTGGCCT TGGGTTACAA CGGTTTCGGC
CGACTCACCG GTGACGAACC GGGTGGGTTG GGCAACCTCA ACCACGACGT CGGGGCGGGG
CGGCTGTTCG GTTTCGGGAT GGGTCTCGAC ATCGCGTGGC TGCTGCCCGC GGCGCTGATC
TGCCTCGGCG CCGCGCTGCT GCTCACCCGC CGGACACCCC GCACCGACAC CACCCGCGCG
GCCCTGCTCA GCTGGGGCGG GTGGCTGGTC GTGACGGCCG TGGTGTTCAG CTTCGCCAAC
GGCATCGTGC ACTCGTACTA CACGGTCGCG CTGGCACCGG CGATCGCCGC GGTCATCGGC
ATCGGCTCAC ACCTGCTGTG GCGCAACAGG TCCCGACCGT GGTGTGCCGT GTCCATGGCC
GGTGCAGTGC TCGTCACCGC GGTGCTGGCC GCGGTGCTGC TGTCGCGCAA CGCCGACTGG
ATGCCGTGGC TGCGGGCGGC CGTCGCGGTC GGGGGAGTGG GTGCTGCGGT GCTGCTGATC
GTGGCGGGCC GGCTGCCCGA CGGTGTCGTC CGCGCCGCCG CCGGACTGGC CGTCGTGGTG
TGTCTCGCAT CGCCCGCGGC CTATTCGGTC GCCACCGCGG CGGCCCCGCA CACCGGCGCC
ATCCCGTCGG TGGGGCCGGC GCGCGGGGGT TTCGGCGGAC CGCCCGGACT GCTGAGCTCA
CCCGAGCCCG GTGAACAGCT CACCGCGCTG CTGGCCCGCG TCGCCCACGC GTACCGGTGG
ACCGCCGCGG TGGTCGGGTC GAACAACGCG GCCGGCTACC AATTGGCAAG CGGCGCACCG
GTGATGGCGC TGGGCGGGTT CAACGGCACC GATCCGGCGC CCACCCTCGA ACAGTTCCAA
CGTCACGTCG CCGACGGCGA TGTGCACTAC TTCATCGGAA GCCGCTCACC CCTCGGCTTC
GGCCGCGGCG CCGAGCAGAG CGGCAGCCGG GCCGCCGCGG ACATCGCGGA CTGGGTGCAG
GCGCGTTACC CGGGGCGAAC CGTCGACGGT GTCGTCGTCT ACGACCTCAC CCGGGCCCCG
GCGTGA
 
Protein sequence
MGRVTLTLDA EQTQNRPNDR PRARFSVWSP RIGLGVLLAG TAVLYLWNLS ASGWANAFYS 
AAAQAGSQNW TAMLFGSSDA ANAITVDKTP AALWVMDLSV RVFGLNSWSI LAPQALMGVA
AVAVLYAAVR RVSGPGAALL AGAVLAVTPV AALMFRFNNP DALLVLLLVV AGYCVTRACE
PDARRWWLIA AGVAVGFGFL AKMLQAFLVL PGFVAAYLLA GSRPVGRRIL DLAGAAAAMV
AAAGWYLLLA ELWPADSRPY IGGSQHNSIV ELALGYNGFG RLTGDEPGGL GNLNHDVGAG
RLFGFGMGLD IAWLLPAALI CLGAALLLTR RTPRTDTTRA ALLSWGGWLV VTAVVFSFAN
GIVHSYYTVA LAPAIAAVIG IGSHLLWRNR SRPWCAVSMA GAVLVTAVLA AVLLSRNADW
MPWLRAAVAV GGVGAAVLLI VAGRLPDGVV RAAAGLAVVV CLASPAAYSV ATAAAPHTGA
IPSVGPARGG FGGPPGLLSS PEPGEQLTAL LARVAHAYRW TAAVVGSNNA AGYQLASGAP
VMALGGFNGT DPAPTLEQFQ RHVADGDVHY FIGSRSPLGF GRGAEQSGSR AAADIADWVQ
ARYPGRTVDG VVVYDLTRAP A