Gene Mmcs_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3053 
Symbol 
ID4111885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3230305 
End bp3231864 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content70% 
IMG OID638032183 
Productanthranilate synthase component I 
Protein accessionYP_640216 
Protein GI108800019 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0820885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGACGA CCGCCGCTTC CGCCTTCGAC TCCTCGCGCG AGCGTTCGTC GCTGGCCACG 
ACCACGTCTC GCGAGGACTT CCGGGCACTG GCAGCCGAGC ACCGCGTGGT GCCGGTGGTC
CGCAAGGTGC TCGCCGACAG CGAGACCCCG CTGTCGGCGT ACCGCAAGCT CGCCGCCAAC
CGGCCCGGCA CGTTCCTGCT CGAATCGGCC GAGAACGGCA GGTCGTGGTC GCGGTGGTCG
TTCATCGGGG CGGGCGCACC GTCGGCGCTG ACGGTCCGCG ACGGCGAGGC GGTGTGGTTG
GGCGTGACGC CGAAGGATGC GCCGAGCGGT GGTGATCCAC TGCAGGCACT GCGGTCCACG
CTGGCGCTGC TGGAGACCGC GCCGCTGCCG GGCCTGCCGC CGCTGTCGAG CGGTCTGGTC
GGGTTCTTCG CCTATGACAT GGTGCGGCAG CTGGAGCGGC TGCCGTCGCT GGCCGTCGAC
GATCTCGGAC TGCCCGACAT GCTGCTGCTG TTGGCCACCG ACATCGCCGC CGTCGACCAC
CACGAGGGCA CCATCACGCT GATCGCCAAC GCGGTGAACT GGAACGGCAC CGACGAGAAC
GTGGACGGCG CGTATGACGA CGCCGTCGCC CGGCTCGACG TGATGACCAA GGCGCTGGGG
CAGTCGCTGC CCTCGTCGGT GGCCACGTTC GCCCGGCCGG CCCCGACGCA CCGGGCGCAG
CGCACCGTCG AGGAGTACAC CGCGATCGTC GAGAAGCTCG TCGGCGACAT CGAGGCCGGT
GAGGCGTTCC AGGTGGTGCC GTCGCAACGC TTCGAGATGG ACACCGTCGC CGATCCGCTC
GATGTGTACC GGATGCTGCG GGTCACCAAT CCCAGTCCGT ACATGTATCT GCTGAACGTG
CCGGATGAGA CTGGGGGACT GGACTTCTCG GTGGTCGGGT CGAGTCCGGA GGCGCTGGTG
ACCGTCGCCG ACGGGAAGGC CACGACGCAC CCGATCGCCG GCACCCGCTG GCGCGGCGAC
ACCGAGGAAG AGGACCTGCT GCTCGAGAAG GAGCTGCTGG CCGACGAGAA GGAACGCGCC
GAACACCTGA TGCTGGTGGA CCTGGGCCGT AACGATCTGG GCCGGGTGTG TGAACCCGGC
ACCGTGCGGG TCGAGGACTA CAGCCACATC GAGCGGTACA GCCACGTCAT GCACCTGGTG
TCGACGGTCA CCGGACGTCT CGCCGAGGGC ATGACCGCGC TCGACGCGGT GACGGCCTGT
TTCCCGGCGG GCACGCTGTC GGGCGCCCCG AAGGTGCGGG CCATGGAGCT CATCGAGGAG
GTCGAGAAGA CCCGCCGCGG GCTCTACGGC GGGGTGCTGG GCTACCTCGA CTTCGCGGGC
AACGCCGATT TCGCGATCGC CATCCGGACC GCGCTGATGC GCGACGGGGT CGCCTACGTC
CAGGCCGGCG GGGGAGTCGT GGCCGACTCC AACGGGCCGT ACGAGTTCAA CGAGGCCACC
AATAAGGCCA AGGCGGTGCT GGCCGCCGTC GCCGCCGCCG AAACCCTGCG CGAACCATGA
 
Protein sequence
MQTTAASAFD SSRERSSLAT TTSREDFRAL AAEHRVVPVV RKVLADSETP LSAYRKLAAN 
RPGTFLLESA ENGRSWSRWS FIGAGAPSAL TVRDGEAVWL GVTPKDAPSG GDPLQALRST
LALLETAPLP GLPPLSSGLV GFFAYDMVRQ LERLPSLAVD DLGLPDMLLL LATDIAAVDH
HEGTITLIAN AVNWNGTDEN VDGAYDDAVA RLDVMTKALG QSLPSSVATF ARPAPTHRAQ
RTVEEYTAIV EKLVGDIEAG EAFQVVPSQR FEMDTVADPL DVYRMLRVTN PSPYMYLLNV
PDETGGLDFS VVGSSPEALV TVADGKATTH PIAGTRWRGD TEEEDLLLEK ELLADEKERA
EHLMLVDLGR NDLGRVCEPG TVRVEDYSHI ERYSHVMHLV STVTGRLAEG MTALDAVTAC
FPAGTLSGAP KVRAMELIEE VEKTRRGLYG GVLGYLDFAG NADFAIAIRT ALMRDGVAYV
QAGGGVVADS NGPYEFNEAT NKAKAVLAAV AAAETLREP