Gene Mkms_3112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3112 
Symbol 
ID4610947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3258587 
End bp3260146 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content70% 
IMG OID639792783 
Productanthranilate synthase component I 
Protein accessionYP_939096 
Protein GI119869144 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.89202 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGACGA CCGCCGCTTC CGCCTTCGAC TCCTCGCGCG AGCGTTCGTC GCTGGCCACG 
ACCACGTCTC GCGAGGACTT CCGGGCACTG GCAGCCGAGC ACCGCGTGGT GCCGGTGGTC
CGCAAGGTGC TCGCCGACAG CGAGACCCCG CTGTCGGCGT ACCGCAAGCT CGCCGCCAAC
CGGCCCGGCA CGTTCCTGCT CGAATCGGCC GAGAACGGCA GGTCGTGGTC GCGGTGGTCG
TTCATCGGGG CGGGCGCACC GTCGGCGCTG ACGGTCCGCG ACGGCGAGGC GGTGTGGTTG
GGCGTGACGC CGAAGGATGC GCCGAGCGGT GGTGATCCAC TGCAGGCACT GCGGTCCACG
CTGGCGCTGC TGGAGACCGC GCCGCTGCCG GGCCTGCCGC CGCTGTCGAG CGGTCTGGTC
GGGTTCTTCG CCTATGACAT GGTGCGGCAG CTGGAGCGGC TGCCGTCGCT GGCCGTCGAC
GATCTCGGAC TGCCCGACAT GCTGCTGCTG TTGGCCACCG ACATCGCCGC CGTCGACCAC
CACGAGGGCA CCATCACGCT GATCGCCAAC GCGGTGAACT GGAACGGCAC CGACGAGAAC
GTGGACGGCG CGTATGACGA CGCCGTCGCC CGGCTCGACG TGATGACCAA GGCGCTGGGG
CAGTCGCTGC CCTCGTCGGT GGCCACGTTC GCCCGGCCGG CCCCGACGCA CCGGGCGCAG
CGCACCGTCG AGGAGTACAC CGCGATCGTC GAGAAGCTCG TCGGCGACAT CGAGGCCGGT
GAGGCGTTCC AGGTGGTGCC GTCGCAACGC TTCGAGATGG ACACCGTCGC CGATCCGCTC
GATGTGTACC GGATGCTGCG GGTCACCAAT CCCAGTCCGT ACATGTATCT GCTGAACGTG
CCGGATGAGA CTGGGGGACT GGACTTCTCG GTGGTCGGGT CGAGTCCGGA GGCGCTGGTG
ACCGTCGCCG ACGGGAAGGC CACGACGCAC CCGATCGCCG GCACCCGCTG GCGCGGCGAC
ACCGAGGAAG AGGACCTGCT GCTCGAGAAG GAGCTGCTGG CCGACGAGAA GGAACGCGCC
GAACACCTGA TGCTGGTGGA CCTGGGCCGT AACGATCTGG GCCGGGTGTG TGAACCCGGC
ACCGTGCGGG TCGAGGACTA CAGCCACATC GAGCGGTACA GCCACGTCAT GCACCTGGTG
TCGACGGTCA CCGGACGTCT CGCCGAGGGC ATGACCGCGC TCGACGCGGT GACGGCCTGT
TTCCCGGCGG GCACGCTGTC GGGCGCCCCG AAGGTGCGGG CCATGGAGCT CATCGAGGAG
GTCGAGAAGA CCCGCCGCGG GCTCTACGGC GGGGTGCTGG GCTACCTCGA CTTCGCGGGC
AACGCCGATT TCGCGATCGC CATCCGGACC GCGCTGATGC GCGACGGGGT CGCCTACGTC
CAGGCCGGCG GGGGAGTCGT GGCCGACTCC AACGGGCCGT ACGAGTTCAA CGAGGCCACC
AATAAGGCCA AGGCGGTGCT GGCCGCCGTC GCCGCCGCCG AAACCCTGCG CGAACCATGA
 
Protein sequence
MQTTAASAFD SSRERSSLAT TTSREDFRAL AAEHRVVPVV RKVLADSETP LSAYRKLAAN 
RPGTFLLESA ENGRSWSRWS FIGAGAPSAL TVRDGEAVWL GVTPKDAPSG GDPLQALRST
LALLETAPLP GLPPLSSGLV GFFAYDMVRQ LERLPSLAVD DLGLPDMLLL LATDIAAVDH
HEGTITLIAN AVNWNGTDEN VDGAYDDAVA RLDVMTKALG QSLPSSVATF ARPAPTHRAQ
RTVEEYTAIV EKLVGDIEAG EAFQVVPSQR FEMDTVADPL DVYRMLRVTN PSPYMYLLNV
PDETGGLDFS VVGSSPEALV TVADGKATTH PIAGTRWRGD TEEEDLLLEK ELLADEKERA
EHLMLVDLGR NDLGRVCEPG TVRVEDYSHI ERYSHVMHLV STVTGRLAEG MTALDAVTAC
FPAGTLSGAP KVRAMELIEE VEKTRRGLYG GVLGYLDFAG NADFAIAIRT ALMRDGVAYV
QAGGGVVADS NGPYEFNEAT NKAKAVLAAV AAAETLREP