Gene Mmcs_4272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4272 
Symbol 
ID4113102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4539567 
End bp4540841 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content73% 
IMG OID638033417 
Productaminodeoxychorismate synthase component I 
Protein accessionYP_641433 
Protein GI108801236 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.424437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGGATCG AGCGGCTCGG CGACCTCGGC GATGCGCCCA CGGTGCTCGC CGCGGTCGCC 
TCGGCCGGTG CCGCGCTGGG ACTCCCGCCG CCCGCCGCAC TGCTCGGGGA CTGGTTCGGG
TCCACGGCCG TCATCGCCCC GTCGGTGACG ATCGCACCCG TCGCGCAGAT CGACGTGTTC
GACGTGCCGC CCGGCGCCGG GAACGCCGTC GGCGGAGGGT GGTTCGGATA CCTGTCCTAC
CCGGATCCCG GCGCCGACGG CGCCGGCCCG CGCATCCCCG CGGCGGCGGG CGGCTGGTCG
GACTGTGTGC TGCGCCGGGA CCGCGAGGGC TGCTGGTGGC ATGAAAGCCT CAGCGGCACA
ACGGCTCCCG CCTGGTTGCT CGACGCCGTC AGGTCCCCCG GGGCGCCGTC GGCCTACGAG
ATCGGCTGGA CGCCACCGGA TCGCGACACC CACCGCCGGG GGGTGACCGA CTGCCTGGCC
GCGATCGCCG CCGGCGAGGT GTATCAGGCC TGCGTCTGCA CACAGTTCAC CGGACGGCTC
CGCGGATCGC CGCTGGACTT CTTCGTCGAC ACCGCCCGGC GCACCACCCC GGCCCGCGCC
GCATACCTGG CCGGCGACTG GGGTGCGGTG GCGTCGCTGT CCCCGGAACT GTTCCTGCGC
CGCCGCGGCA CGGCGGTGAC GTCCAGCCCG ATCAAGGGCA CGCTGCCGTC CTCGGCGGAT
CCGCTCGAGC TGCGGGCCTC GGTCAAGGAC GTGGCGGAGA ACATCATGAT CGTCGACCTG
GTCCGCAACG ACCTCGGCCG GATCGCGCGG ACGGGAACGG TGACGGTGCC CGAACTGCTG
GCCGTCCGCC CCGCCCCCGG GGTCTGGCAT CTGGTGTCGA CGGTCACCGC GGACGTCCCC
GTCGACCTCC CGATGTCCGA CGTGCTCGAC GCGACGTTCC CGCCGGCGTC GGTCACCGGC
ACCCCGAAGG GCAGAGCGCG CAGCCTGCTG CGGCACTGGG AGCCGAAGCG ACGCGGAATC
TATTGCGGCA CAATCGGTCT CGCCTCCCCA GCGGCGGGGT GCGAATTGAA CGTGGCGATC
CGGACGGTGG AGTTCGGCGC CGACGGGTCG GCCGTGCTCG GCGTCGGCGG CGGCATCACC
GCCGACTCCG ATCCCGACCG CGAATGGGAC GAATGCCTGC ACAAGGCCGC ATCCATCGTC
GGCCCCTGTT CGCCGGTGCC GAGCGCCGAA CTGATCGCGA CGACGCACTG GTCGGTCACC
GCCGAACAGC CCTAA
 
Protein sequence
MRIERLGDLG DAPTVLAAVA SAGAALGLPP PAALLGDWFG STAVIAPSVT IAPVAQIDVF 
DVPPGAGNAV GGGWFGYLSY PDPGADGAGP RIPAAAGGWS DCVLRRDREG CWWHESLSGT
TAPAWLLDAV RSPGAPSAYE IGWTPPDRDT HRRGVTDCLA AIAAGEVYQA CVCTQFTGRL
RGSPLDFFVD TARRTTPARA AYLAGDWGAV ASLSPELFLR RRGTAVTSSP IKGTLPSSAD
PLELRASVKD VAENIMIVDL VRNDLGRIAR TGTVTVPELL AVRPAPGVWH LVSTVTADVP
VDLPMSDVLD ATFPPASVTG TPKGRARSLL RHWEPKRRGI YCGTIGLASP AAGCELNVAI
RTVEFGADGS AVLGVGGGIT ADSDPDREWD ECLHKAASIV GPCSPVPSAE LIATTHWSVT
AEQP