Gene Mkms_3996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3996 
Symbol 
ID4611936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4210749 
End bp4212617 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content71% 
IMG OID639793680 
Productglycosyl transferase family protein 
Protein accessionYP_939978 
Protein GI119870026 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0580758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.973427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTGA CCACCGAGGC ACCACCGGCG GCGGCGCCCG CCACCTCCGG AGGCCGCGCG 
CGCTGGGAGC GGCTGGCGCT GCTCACCCTG CTCGCCGCCA CCGCCGTGCT CTACCTGTGG
GGGCTGGGCT CCTCGGGCTG GGCCAACGAA TACTACGCCG CGGCCGTGCA GGCCGGCACC
CAGAACTGGA CGGCCTGGCT GTTCGGGTCG CTGGACCCCG GCAACTCCAT CACCGTCGAC
AAGCCACCGG CTTCGCTCTG GCTGATGGTG TTGTCCGCCA GGTTGTTCGG GTTCAGCGCG
TTTGCGATGC TGTTACCGCA GGCGCTGATG GGTGTGGCGA CGGTGGCGGT GCTGTTCGCC
GCGGTGCGAC GGGTCAGCGG TGCGGGCGCC GGGATGGTCG CCGGTGCGGT GATGGCTACG
ATGCCGGTGG CGGCGTTGAT GTTCCGCTTC AACAATCCCG ACGCGCTGCT GGTGCTGCTG
CTCGTCGTCG CCGCGTACTG CATGGTGCGG GCGATCGAGA CCGCGAGTAC GCGCTGGATG
GTCCTCGTCG GCTGCGCGTT GGGGTTCGCG TTCCTCACCA AGATGCTGCA GGCCTTCCTC
GTGATGCCCG GTCTGGCGCT GGCGTTCCTG GTGGCGGCGC CGGTGGCGTT GTGGCGGCGG
ATCGGCACGC TCGCCGTCGG CGCGGTGTCG ATGGTGGTGT CGGCGGGATG GTTCATCGCT
CTGGTCGAGG TGTGGCCGGC GTCGTCGCGT CCCTACATCG GCGGTTCGAC CGACAACAGC
CTGCTGCAGT TGGCCCTGGG CTACAACGGC ATCCAGCGAA TCGCCGGTGG CGGGGGACCG
GGCGGCGGGC CCGGCGGCGG TCCGGGGGAC GGACCGGGTC GCGGCGCGAA TCTGTTCTTC
GGCGGTGAGC CTGGGATCGG ACGCCTGTTC GGGCATTCGA TGGGTGTCGA GGCCTCGTGG
CTCCTGCCCG CGGCGCTGAT CGGCCTGGCC GCCGGCATCT GGTTCACCCG CCGCGCCGTG
CGCACCGACG CGGTACGCGC GAGCCTGCTG CTGTGGGGCG GGTGGCTGCT GGTCACCGGC
GTCGTGTTCA GTTTCATGGA CGGCACGATC CACCCGTACT ACACGGTGGC GCTGTCGCCC
GCGGTGGCCG CGCTGGTCGG CATCGCGGTC GTGGAGTGCT GGCGCGGCAG GCGCTACCTT
CAGCCCCGCC TCGCGCTGGC CGCGATGATG GCGGCGACGG GCGTCTGGGC GTTCGTGTTG
CTCGTCCGCA CCCCGGACTG GCTGCCGTGG CTGCGCTGGG TGGTGCTCGC GCTCGCGATT
TTGGTCGCGG CGATCCTGGT GGTCGGTGCG CACCGGCTGA AGCGGGCCGC GACAGCCGTC
GTCGTCGCCG CGGCGCTGGC CGGCCTCGCC GCGCCCACCG CCTTCGCGGT CTACAACGTG
GCGCACCCCG CGAGTGGTCC CGGCACCATG TCCGGTCCCG CACGCGGCGA CGCCTTCGGA
GGAATGCCAC CGGGAGGCCC CCGCGGCGAC CGGGACGACG CCGCCGTGGC GGAGCTGGTC
CGAGGTGTCG ACAGCCGTTG GGCGGCAGCC AGTGTCGGGT CGATGGGATC GGCGGGTCTG
CAGTTGGACT CCGGGGCCTC GATCATGGCG ATCGGCGGGT TCACCGGCTC GGACGCCTCG
CCGACACTCG CGCAGTTCCA GCAGTACGTC GCCGACGGTG ATGTCCGGTA TTTCATCGGC
AGTGACAGGG GTGGTCCACC CGGCTTCGGG CGCGACGGCA CCGCCGCGGA GATCACCGCG
TGGGTGCAGG AGAACTTCAC CCCCGTTCAG GTTGGTGGAG CGACCGTCTA CGACCTGCAA
TCCGGCTGA
 
Protein sequence
MTVTTEAPPA AAPATSGGRA RWERLALLTL LAATAVLYLW GLGSSGWANE YYAAAVQAGT 
QNWTAWLFGS LDPGNSITVD KPPASLWLMV LSARLFGFSA FAMLLPQALM GVATVAVLFA
AVRRVSGAGA GMVAGAVMAT MPVAALMFRF NNPDALLVLL LVVAAYCMVR AIETASTRWM
VLVGCALGFA FLTKMLQAFL VMPGLALAFL VAAPVALWRR IGTLAVGAVS MVVSAGWFIA
LVEVWPASSR PYIGGSTDNS LLQLALGYNG IQRIAGGGGP GGGPGGGPGD GPGRGANLFF
GGEPGIGRLF GHSMGVEASW LLPAALIGLA AGIWFTRRAV RTDAVRASLL LWGGWLLVTG
VVFSFMDGTI HPYYTVALSP AVAALVGIAV VECWRGRRYL QPRLALAAMM AATGVWAFVL
LVRTPDWLPW LRWVVLALAI LVAAILVVGA HRLKRAATAV VVAAALAGLA APTAFAVYNV
AHPASGPGTM SGPARGDAFG GMPPGGPRGD RDDAAVAELV RGVDSRWAAA SVGSMGSAGL
QLDSGASIMA IGGFTGSDAS PTLAQFQQYV ADGDVRYFIG SDRGGPPGFG RDGTAAEITA
WVQENFTPVQ VGGATVYDLQ SG