Gene Mkms_4358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4358 
Symbol 
ID4612300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4578772 
End bp4580046 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content73% 
IMG OID639794043 
Productaminodeoxychorismate synthase component I 
Protein accessionYP_940339 
Protein GI119870387 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0126279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGATCG AGCGGCTCGG CGACCTCGGC GATGCGCCCA CGGTGCTCGC CGCGGTCGCC 
TCGGCCGGTG CCGCGCTGGG ACTCCCGCCG CCCGCCGCAC TGCTCGGGGA CTGGTTCGGG
TCCACGGCCG TCATCGCCCC GTCGGTGACG ATCGCACCCG TCGCGCAGAT CGACGTGTTC
GACGTGCCGC CCGGCGCCGG GAACGCCGTC GGCGGAGGGT GGTTCGGATA CCTGTCCTAC
CCGGATCCCG GCGCCGACGG CGCCGGCCCG CGCATCCCCG CGGCGGCGGG CGGCTGGTCG
GACTGTGTGC TGCGCCGGGA CCGCGAGGGC TGCTGGTGGC ATGAAAGCCT CAGCGGCACA
ACGGCTCCCG CCTGGTTGCT CGACGCCGTC AGGTCCCCCG GGGCGCCGTC GGCCTACGAG
ATCGGCTGGA CGCCACCGGA TCGCGACACC CACCGCCGGG GGGTGACCGA CTGCCTGGCC
GCGATCGCCG CCGGCGAGGT GTATCAGGCC TGCGTCTGCA CACAGTTCAC CGGACGGCTC
CGCGGATCGC CGCTGGACTT CTTCGTCGAC ACCGCCCGGC GCACCACCCC GGCCCGCGCC
GCATACCTGG CCGGCGACTG GGGTGCGGTG GCGTCGCTGT CCCCGGAACT GTTCCTGCGC
CGCCGCGGCA CGGCGGTGAC GTCCAGCCCG ATCAAGGGCA CGCTGCCGTC CTCGGCGGAT
CCGCTCGAGC TGCGGGCCTC GGTCAAGGAC GTGGCGGAGA ACATCATGAT CGTCGACCTG
GTCCGCAACG ACCTCGGCCG GATCGCGCGG ACGGGAACGG TGACGGTGCC CGAACTGCTG
GCCGTCCGCC CCGCCCCCGG GGTCTGGCAT CTGGTGTCGA CGGTCACCGC GGACGTCCCC
GTCGACCTCC CGATGTCCGA CGTGCTCGAC GCGACGTTCC CGCCGGCGTC GGTCACCGGC
ACCCCGAAGG GCAGAGCGCG CAGCCTGCTG CGGCACTGGG AGCCGAAGCG ACGCGGAATC
TATTGCGGCA CAATCGGTCT CGCCTCCCCA GCGGCGGGGT GCGAATTGAA CGTGGCGATC
CGGACGGTGG AGTTCGGCGC CGACGGGTCG GCCGTGCTCG GCGTCGGCGG CGGCATCACC
GCCGACTCCG ATCCCGACCG CGAATGGGAC GAATGCCTGC ACAAGGCCGC ATCCATCGTC
GGCCCCTGTT CGCCGGTGCC GAGCGCCGAA CTGATCGCGA CGACGCACTG GTCGGTCACC
GCCGAACAGC CCTAA
 
Protein sequence
MRIERLGDLG DAPTVLAAVA SAGAALGLPP PAALLGDWFG STAVIAPSVT IAPVAQIDVF 
DVPPGAGNAV GGGWFGYLSY PDPGADGAGP RIPAAAGGWS DCVLRRDREG CWWHESLSGT
TAPAWLLDAV RSPGAPSAYE IGWTPPDRDT HRRGVTDCLA AIAAGEVYQA CVCTQFTGRL
RGSPLDFFVD TARRTTPARA AYLAGDWGAV ASLSPELFLR RRGTAVTSSP IKGTLPSSAD
PLELRASVKD VAENIMIVDL VRNDLGRIAR TGTVTVPELL AVRPAPGVWH LVSTVTADVP
VDLPMSDVLD ATFPPASVTG TPKGRARSLL RHWEPKRRGI YCGTIGLASP AAGCELNVAI
RTVEFGADGS AVLGVGGGIT ADSDPDREWD ECLHKAASIV GPCSPVPSAE LIATTHWSVT
AEQP