Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_3053 |
Symbol | |
ID | 4111885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | - |
Start bp | 3230305 |
End bp | 3231864 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638032183 |
Product | anthranilate synthase component I |
Protein accession | YP_640216 |
Protein GI | 108800019 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0820885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAGACGA CCGCCGCTTC CGCCTTCGAC TCCTCGCGCG AGCGTTCGTC GCTGGCCACG ACCACGTCTC GCGAGGACTT CCGGGCACTG GCAGCCGAGC ACCGCGTGGT GCCGGTGGTC CGCAAGGTGC TCGCCGACAG CGAGACCCCG CTGTCGGCGT ACCGCAAGCT CGCCGCCAAC CGGCCCGGCA CGTTCCTGCT CGAATCGGCC GAGAACGGCA GGTCGTGGTC GCGGTGGTCG TTCATCGGGG CGGGCGCACC GTCGGCGCTG ACGGTCCGCG ACGGCGAGGC GGTGTGGTTG GGCGTGACGC CGAAGGATGC GCCGAGCGGT GGTGATCCAC TGCAGGCACT GCGGTCCACG CTGGCGCTGC TGGAGACCGC GCCGCTGCCG GGCCTGCCGC CGCTGTCGAG CGGTCTGGTC GGGTTCTTCG CCTATGACAT GGTGCGGCAG CTGGAGCGGC TGCCGTCGCT GGCCGTCGAC GATCTCGGAC TGCCCGACAT GCTGCTGCTG TTGGCCACCG ACATCGCCGC CGTCGACCAC CACGAGGGCA CCATCACGCT GATCGCCAAC GCGGTGAACT GGAACGGCAC CGACGAGAAC GTGGACGGCG CGTATGACGA CGCCGTCGCC CGGCTCGACG TGATGACCAA GGCGCTGGGG CAGTCGCTGC CCTCGTCGGT GGCCACGTTC GCCCGGCCGG CCCCGACGCA CCGGGCGCAG CGCACCGTCG AGGAGTACAC CGCGATCGTC GAGAAGCTCG TCGGCGACAT CGAGGCCGGT GAGGCGTTCC AGGTGGTGCC GTCGCAACGC TTCGAGATGG ACACCGTCGC CGATCCGCTC GATGTGTACC GGATGCTGCG GGTCACCAAT CCCAGTCCGT ACATGTATCT GCTGAACGTG CCGGATGAGA CTGGGGGACT GGACTTCTCG GTGGTCGGGT CGAGTCCGGA GGCGCTGGTG ACCGTCGCCG ACGGGAAGGC CACGACGCAC CCGATCGCCG GCACCCGCTG GCGCGGCGAC ACCGAGGAAG AGGACCTGCT GCTCGAGAAG GAGCTGCTGG CCGACGAGAA GGAACGCGCC GAACACCTGA TGCTGGTGGA CCTGGGCCGT AACGATCTGG GCCGGGTGTG TGAACCCGGC ACCGTGCGGG TCGAGGACTA CAGCCACATC GAGCGGTACA GCCACGTCAT GCACCTGGTG TCGACGGTCA CCGGACGTCT CGCCGAGGGC ATGACCGCGC TCGACGCGGT GACGGCCTGT TTCCCGGCGG GCACGCTGTC GGGCGCCCCG AAGGTGCGGG CCATGGAGCT CATCGAGGAG GTCGAGAAGA CCCGCCGCGG GCTCTACGGC GGGGTGCTGG GCTACCTCGA CTTCGCGGGC AACGCCGATT TCGCGATCGC CATCCGGACC GCGCTGATGC GCGACGGGGT CGCCTACGTC CAGGCCGGCG GGGGAGTCGT GGCCGACTCC AACGGGCCGT ACGAGTTCAA CGAGGCCACC AATAAGGCCA AGGCGGTGCT GGCCGCCGTC GCCGCCGCCG AAACCCTGCG CGAACCATGA
|
Protein sequence | MQTTAASAFD SSRERSSLAT TTSREDFRAL AAEHRVVPVV RKVLADSETP LSAYRKLAAN RPGTFLLESA ENGRSWSRWS FIGAGAPSAL TVRDGEAVWL GVTPKDAPSG GDPLQALRST LALLETAPLP GLPPLSSGLV GFFAYDMVRQ LERLPSLAVD DLGLPDMLLL LATDIAAVDH HEGTITLIAN AVNWNGTDEN VDGAYDDAVA RLDVMTKALG QSLPSSVATF ARPAPTHRAQ RTVEEYTAIV EKLVGDIEAG EAFQVVPSQR FEMDTVADPL DVYRMLRVTN PSPYMYLLNV PDETGGLDFS VVGSSPEALV TVADGKATTH PIAGTRWRGD TEEEDLLLEK ELLADEKERA EHLMLVDLGR NDLGRVCEPG TVRVEDYSHI ERYSHVMHLV STVTGRLAEG MTALDAVTAC FPAGTLSGAP KVRAMELIEE VEKTRRGLYG GVLGYLDFAG NADFAIAIRT ALMRDGVAYV QAGGGVVADS NGPYEFNEAT NKAKAVLAAV AAAETLREP
|
| |