Gene Mjls_3937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3937 
Symbol 
ID4879646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4157674 
End bp4159542 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content71% 
IMG OID640141249 
Productglycosyl transferase family protein 
Protein accessionYP_001072203 
Protein GI126436512 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.260108 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTGA CCACCGAGGC ACCACCGGCG GCGGCGCCCG CCACCTCCGG AGGCCGCGCG 
CGCTGGGAGC GGCCGGCGCT GCTCACCCTG CTCGCCGCCA CCGCCGTGCT CTACCTGTGG
GGGCTGGGCT CCTCGGGCTG GGCCAACGAA TACTACGCCG CGGCCGTGCA GGCCGGCACC
CAGAACTGGA CGGCCTGGCT GTTCGGGTCG CTGGACCCCG GCAACTCCAT CACCGTCGAC
AAGCCACCGG CTTCGCTCTG GCTGATGGTG TTGTCCGCCA GGTTGTTCGG GTTCAGCGCG
TTTGCGATGC TGTTACCGCA GGCGCTGATG GGTGTGGCGA CGGTGGCGGT GCTGTTCGCC
GCGGTGCGAC GGGTCAGCGG TGCGGGCGCC GGGATGGTCG CCGGTGCGGT GATGGCTACG
ATGCCGGTGG CGGCGTTGAT GTTCCGCTTC AACAATCCCG ACGCGCTGCT GGTGCTGCTG
CTCGTCGTCG CCGCGTACTG CATGGTGCGG GCGATCGAGA CCGCGAGTAC GCGCTGGATG
GTCCTCGTCG GCTGCGCGTT GGGGTTCGCG TTCCTCACCA AGATGCTGCA GGCCTTCCTC
GTGATGCCCG GTCTGGCGCT GGCGTTCCTG GTGGCGGCGC CGGTGGCGTT GTGGCGGCGG
ATCGGCACGC TCGCCGTCGG CGCGGTGTCG ATGGTGGTGT CGGCGGGATG GTTCATCGCT
CTGGTCGAGG TGTGGCCGGC GTCGTCGCGT CCCTACATCG GCGGTTCGAC CGACAACAGC
CTGCTGCAGT TGGCCCTGGG CTACAACGGC ATCCAGCGAA TCGCCGGTGG CGGGGGACCG
GGCGGCGGGC CCGGCGGCGG TCCGGGGGAC GGACCGGGTC GCGGCGCGAA TCTGTTCTTC
GGCGGTGAGC CTGGGATCGG ACGCCTGTTC GGGCATTCGA TGGGTGTCGA GGCCTCGTGG
CTCCTGCCCG CGGCGCTGAT CGGCCTGGCC GCCGGCATCT GGTTCACCCG CCGCGCCGTG
CGCACCGACG CGGTACGCGC GAGCCTGCTG CTGTGGGGCG GGTGGCTGCT GGTCACCGGC
GTCGTGTTCA GTTTCATGGA CGGCACGATC CACCCGTACT ACACGGTGGC GCTGTCGCCC
GCGGTGGCCG CGCTGGTCGG CATCGCGGTC GTGGAGTGCT GGCGCGGCAG GCGCTACCTT
CAGCCCCGCC TCGCGCTGGC CGCGATGATG GCGGCGACGG GCGTCTGGGC GTTCGTGTTG
CTCGTCCGCA CCCCGGACTG GCTGCCGTGG CTGCGCTGGG TGGTGCTCGC GCTCGCGATT
TTGGTCGCGG CGATCCTGGT GGTCGGTGCG CACCGGCTGA AGCGGGCCGC GACAGCCGTC
GTCGTCGCCG CGGCGCTGGC CGGCCTCGCC GCGCCCACCG CCTTCGCGGT CTACAACGTG
GCGCACCCCG CGAGTGGTCC CGGCACCATG TCCGGTCCCG CACGCGGCGA CGCCTTCGGA
GGAATGCCAC CGGGAGGCCC CCGCGGCGAC CGGGACGACG CCGCCGTGGC GGAGCTGGTC
CGAGGTGTCG ACAGCCGTTG GGCGGCAGCC AGTGTCGGGT CGATGGGATC GGCGGGTCTG
CAGTTGGACT CCGGGGCCTC GATCATGGCG ATCGGCGGGT TCACCGGCTC GGACGCCTCG
CCGACACTCG CGCAGTTCCA GCAGTACGTC GCCGACGGTG ATGTCCGGTA TTTCATCGGC
AGTGACAGGG GTGGTCCACC CGGCTTCGGG CGCGACGGCA CCGCCGCGGA GATCACCGCG
TGGGTGCAGG AGAACTTCAC CCCCGTTCAG GTTGGTGGAG CGACCGTCTA CGACCTGCAA
TCCGGCTGA
 
Protein sequence
MTVTTEAPPA AAPATSGGRA RWERPALLTL LAATAVLYLW GLGSSGWANE YYAAAVQAGT 
QNWTAWLFGS LDPGNSITVD KPPASLWLMV LSARLFGFSA FAMLLPQALM GVATVAVLFA
AVRRVSGAGA GMVAGAVMAT MPVAALMFRF NNPDALLVLL LVVAAYCMVR AIETASTRWM
VLVGCALGFA FLTKMLQAFL VMPGLALAFL VAAPVALWRR IGTLAVGAVS MVVSAGWFIA
LVEVWPASSR PYIGGSTDNS LLQLALGYNG IQRIAGGGGP GGGPGGGPGD GPGRGANLFF
GGEPGIGRLF GHSMGVEASW LLPAALIGLA AGIWFTRRAV RTDAVRASLL LWGGWLLVTG
VVFSFMDGTI HPYYTVALSP AVAALVGIAV VECWRGRRYL QPRLALAAMM AATGVWAFVL
LVRTPDWLPW LRWVVLALAI LVAAILVVGA HRLKRAATAV VVAAALAGLA APTAFAVYNV
AHPASGPGTM SGPARGDAFG GMPPGGPRGD RDDAAVAELV RGVDSRWAAA SVGSMGSAGL
QLDSGASIMA IGGFTGSDAS PTLAQFQQYV ADGDVRYFIG SDRGGPPGFG RDGTAAEITA
WVQENFTPVQ VGGATVYDLQ SG