Gene Mmcs_0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0939 
Symbol 
ID4109779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1036970 
End bp1038259 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content57% 
IMG OID638030063 
ProductO-antigen polymerase 
Protein accessionYP_638110 
Protein GI108797913 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.51136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTCA CGCATAATAA CTTGACGATC GCCACCAGTG CTCCACGGTT TGGTCTCGTT 
CTCGCGGCAT GGACGTGTGT CGCAGTTGTC CTGCCGGTTT ACCGAATACT CCCCGAACCG
GCAAGAATCG CATGGTTCTG CGCAACCTTT GGCCTGATGG TGCTTTGGTT GGTGTTAGGA
CGCGTTGCCA GACCGCTGTA CCCCGTAATT TGGATTCTTG CAGGCTATGG TGCTGTCGTC
GCAACGGTCA CAGCCACGGG TCGAGCATCT GTGCCGGACA ACTTGTTTAC GGGCAGTCAG
CTAGCCATCC TTCTTGGAGT TGGCCCCTTT GTACTTCGAT GGCTTGTCTT GAACATTCCC
GATTTTACCC GAACGGTGTG CATTGCATTC CTGATTGGCC AAACGTGCTC GTCGGCGGCC
GGTATTGCCC AGATCATGGG GACTTCGGTC TTTGGCTTTG CCACGGTACA GGGGCGCGCC
CCCGGACTCT CAGCTCACCC GAATGTCCTC GGATTGCTTT CCTGTCTCGC GCTCTTGGTT
TGTGTACAGG CGCTTGTTCA CGAGCGTCAG CCGCGCATCT TGATCGGTGC CGCTGGTGCG
AGTGCCATAA ATATCGGCGG CCTCCTGTCA ACCGGCAGCC TTAGTTCTCT TATGGCCGGC
GCGGCGGGTT TGCTCGTTAC CGCAATCTGC CTCCGCGACC AAATCAAGCA CCTCAGTAGA
ATAATCGTGG GAACAGCAAT CGTCTCGTGG GTTGTTCTAA CTTATACTGA CTTCGCGGAC
AATATGCGTA CGCCAGCAGA CCGATACTTA CAGGTCACGG GGCAGACCGA TGCAGAGAGC
ACTTGGGAGA TCCGGCAACG GACGTACCAA TTTGCTTGGG ACGCAATTAG AGAGGATCCT
CTTTTCGGCG TTGGTTTGCC GGTGAAGTTC GGAGCAACGT TCGACGGGAT CACCCTCACG
CACAACTTTC TTCTACGATC CTGGTTTCAG GGGGGTATTG CGCTGGCTCT CCTCGGCTCC
CTGATCGTCC TTGCTGTTCT TATTGTTGCC ATGAAAGCGC TTCGTCATAA GGACAACGGT
CTCGCGGCCG GCGTCCTCGT GACGGTGATG GCGTTCGCGT TGACCTCCGC ATTCTTCGAG
CAGCCCAACT ATTGGTTGCC TGCTCTGCTG GCGTGGGCGG CGCTTAGGCC GTGGAGCAAG
CCGGAAAGCG CGCCTGAGCT TGTTACGGGA AACAATGGTG CTGCCCCGCC GGGACTGATC
ATTGCGGGCA CTTCGACGCC TTCGCCGTGA
 
Protein sequence
MRVTHNNLTI ATSAPRFGLV LAAWTCVAVV LPVYRILPEP ARIAWFCATF GLMVLWLVLG 
RVARPLYPVI WILAGYGAVV ATVTATGRAS VPDNLFTGSQ LAILLGVGPF VLRWLVLNIP
DFTRTVCIAF LIGQTCSSAA GIAQIMGTSV FGFATVQGRA PGLSAHPNVL GLLSCLALLV
CVQALVHERQ PRILIGAAGA SAINIGGLLS TGSLSSLMAG AAGLLVTAIC LRDQIKHLSR
IIVGTAIVSW VVLTYTDFAD NMRTPADRYL QVTGQTDAES TWEIRQRTYQ FAWDAIREDP
LFGVGLPVKF GATFDGITLT HNFLLRSWFQ GGIALALLGS LIVLAVLIVA MKALRHKDNG
LAAGVLVTVM AFALTSAFFE QPNYWLPALL AWAALRPWSK PESAPELVTG NNGAAPPGLI
IAGTSTPSP