Gene Mmcs_3471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3471 
Symbol 
ID4112303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3692991 
End bp3694640 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content70% 
IMG OID638032606 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_640634 
Protein GI108800437 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.164344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG GATTCCAGCA TGTGCACAGC GACCTGACCG ATGGCTTCGT CCCATTCCCC 
GAAGACCGCG CCGAGCAGTA CCGGCGGGCC GGGTACTGGA CCGGGCGGCC CCTCGAGTCC
CTTCTCCTTG ACGCGGCACA CCGACGGCCC GATCACCCGG CCGTCGTCGA CGTCGACGGA
TCCCTCACGT TCGCGGAGCT GACCGCGCGC GCCGACACCG TCGCCGCGGC GCTGGCCGGA
CTCGGGATCC GTCCTGGTGA CCGGGTGCTG CTGCAACTAC CCAACTCCGT GCGTTTCGCC
GTCGCGTTCT TCGGTCTGCT GCGGGCGGGC GCGGTGCCGG TGATGTGCCT GCCCGGCCAC
CGCACCGCCG AGCTGGGACA CTTCGCCGAC GTCAGCGGCG CGGTGGCGCT GATCGTGCCC
GACGAGGTCG GCGGCTTCGA CTACCGCGAG ATGGCCGCCC AGTTGGTGGC GGACCGGCCC
ACGCTGCGGC ACGTGCTCGT CGACGGTGAC CCGGGACCGT TCCTATCGTG GGCGGCACTG
ATCGACAGCG GCGGCGTGGC ACCGGAGATC GGACCCGTCG ACACCTCGCT GCCTGCCCTG
CTGCTGGTCT CCGGCGGAAC GACCGGACTG CCCAAGCTGA TCCCCCGCAC CCACGACGAC
TACGTCTACA CCGCGGTGTC GAGCGCGCAG GCGTGCCACT TCACCCCCGA CGACGTCTAC
CTGGTGGCGT TGCCGGCCGG CCACAACTTC CCGTTGGCGT GCCCCGGCAT GCTGGGGGCA
ATGACCGTGG GGGCGACGAC GGTGTTCACC GCCGATCCCA GCCCGGAGGA GGCGTTCGCG
CTCATCGACA AACACCAGGT GACCGTCACC GGGCTGGTCA ACGCGCTGGG CAAGCTGTGG
GCGCAGGCGT GCGACTGGGA ACCGGTGCTG CCGACGTCGC TGCGCCTGGT GCAGGTGGGC
GGTTCGCGGA TGAGCCCGGA GGAGGCGCGG TTCATCCTCG ATCGCCTGAC CCCCGGGCTG
TCCCAGATCT TCGGAATGGC CGAGGGCATG CTGAATTTCA CGCGGCCGGG AGATCCGCTC
GACGTCGTGG TGCACACCCA GGGCAGGCCG GTCTCCCCGC ACGACGAGAT GCGGGTGGTC
GACGAGTCCG GTGTCGAGGT CGCACCGGGT GAAGAGGGCG AACTGCTGGT GCGCGGTCCC
AACACACTCA ACGGGTACTA CCGGGCCGAC GAGGCCAACG CCCGGTGCTT CAGCCCGGAC
GGCTTCTACC GCACCGGGGA CCGGGTGCGG ATCTTCGCCG ACGGCCCGCT GGCCGGCAAC
GTCGAGGTGA CCGGCCGGAT CAAGGACGTC ATCCACCGCG GCGGTGAGAC GGTTTCGGCG
ACCGACCTCG AGGACCATCT GCTGACCCAC CCCGCGATCT ATGCGGCGGC CGCCGTCGCG
CTACCCGACG ACTATCTCGG TGAGAAGATC TGCGCCGCAG TGGTGTTCCG CGGCAAGCAG
CTCACGCTCG CCGAACTCAA CGCATTCCTC GACGAACGCG GGGCGTCCAC TCACGCCCGG
CCCGACGTGC TGGCTGCGAT GCCGTCGCTG CCGCTGACCG CGGTCGGCAA GGTCGACAAG
AAGAAGCTCG TCGCCCAGCT GACGGGGTGA
 
Protein sequence
MSTGFQHVHS DLTDGFVPFP EDRAEQYRRA GYWTGRPLES LLLDAAHRRP DHPAVVDVDG 
SLTFAELTAR ADTVAAALAG LGIRPGDRVL LQLPNSVRFA VAFFGLLRAG AVPVMCLPGH
RTAELGHFAD VSGAVALIVP DEVGGFDYRE MAAQLVADRP TLRHVLVDGD PGPFLSWAAL
IDSGGVAPEI GPVDTSLPAL LLVSGGTTGL PKLIPRTHDD YVYTAVSSAQ ACHFTPDDVY
LVALPAGHNF PLACPGMLGA MTVGATTVFT ADPSPEEAFA LIDKHQVTVT GLVNALGKLW
AQACDWEPVL PTSLRLVQVG GSRMSPEEAR FILDRLTPGL SQIFGMAEGM LNFTRPGDPL
DVVVHTQGRP VSPHDEMRVV DESGVEVAPG EEGELLVRGP NTLNGYYRAD EANARCFSPD
GFYRTGDRVR IFADGPLAGN VEVTGRIKDV IHRGGETVSA TDLEDHLLTH PAIYAAAAVA
LPDDYLGEKI CAAVVFRGKQ LTLAELNAFL DERGASTHAR PDVLAAMPSL PLTAVGKVDK
KKLVAQLTG