Gene Mjls_3069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3069 
Symbol 
ID4878782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3209930 
End bp3211489 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content70% 
IMG OID640140369 
Productanthranilate synthase component I 
Protein accessionYP_001071339 
Protein GI126435648 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0340194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.379754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGACGA CCGCCGCTTC CGCCTTCGAC TCCTCGCGCG AGCGTTCGTC GCTGGCCACG 
ACCACGTCTC GCGAGGACTT CCGGGCACTG GCAGCCGAGC ACCGCGTGGT GCCGGTGGTC
CGCAAGGTGC TCGCCGACAG CGAGACCCCG CTGTCGGCGT ACCGCAAGCT CGCCGCCAAC
CGGCCCGGCA CGTTCCTGCT CGAATCGGCC GAGAACGGCA GGTCGTGGTC GCGGTGGTCG
TTCATCGGGG CGGGCGCACC GTCGGCGCTG ACAGTTCGCG ACGGCGAGGC GGTGTGGTTG
GGCGTGACGC CGAAGGATGC GCCGAGCGGT GGTGATCCGC TGCAGGCACT GCGGTCCACG
CTGGCGCTGC TGGAGACCGC GCCGCTGCCG GGCCTGCCGC CGCTGTCGAG CGGTCTGGTC
GGGTTCTTCG CCTATGACAT GGTGCGGCGG CTGGAGCGGC TGCCGTCGCT GGCCGTCGAC
GATCTCGGAC TGCCCGACAT GCTGCTGCTG TTGGCCACCG ACATCGCCGC CGTCGACCAC
CACGAGGGCA CCATCACGCT GATCGCCAAC GCGGTGAACT GGAACGGCAC CGACGAGAAC
GTGGACGGCG CGTATGACGA CGCAGTCGCC CGGCTCGACG TGATGACCAA GGCGCTGGGG
CAGTCGCTGC CCTCGTCGGT GGCCACGTTC GCCCGGCCGG CCCCGACGCA CCGGGCGCAG
CGCACCGTCG AGGAGTACAC CGCGATCGTC GAGAAGCTCG TCGGCGACAT CGAGGCCGGT
GAGGCGTTCC AGGTGGTGCC GTCGCAACGC TTCGAGATGG ACACCGTCGC CGATCCGCTC
GATGTGTACC GGATGCTGCG GGTCACCAAC CCCAGCCCGT ACATGTACCT GCTGAACGTG
CCGGATGAGA CTGGGGGACT GGACTTCTCG GTGGTCGGGT CGAGTCCGGA GGCGCTGGTG
ACCGTCGCCG ACGGGAAGGC CACGACGCAC CCGATCGCCG GCACCCGCTG GCGCGGCGAC
ACCGAGGAAG AGGACCTGCT GCTCGAGAAG GAGCTGCTGG CCGACGAGAA GGAACGCGCC
GAACACCTGA TGCTGGTGGA CCTGGGCCGT AACGATCTGG GCCGGGTGTG TGAACCCGGC
ACCGTGCGGG TCGAGGACTA CAGCCACATC GAGCGGTACA GCCACGTCAT GCACCTGGTG
TCGACGGTCA CCGGACGTCT CGCCGAGGGC ATGACCGCGC TCGACGCGGT GACGGCCTGT
TTCCCGGCGG GCACGCTGTC GGGCGCCCCG AAGGTGCGGG CCATGGAGCT CATCGAGGAG
GTCGAGAAGA CCCGCCGCGG GCTCTACGGC GGGGTGCTGG GCTACCTCGA CTTCGCGGGC
AACGCCGATT TCGCGATCGC CATCCGGACC GCGCTGATGC GCGACGGGGT CGCCTACGTC
CAGGCCGGCG GGGGAGTCGT GGCCGACTCC AACGGGCCGT ACGAGTTCAA CGAGGCCACC
AACAAGGCCA AGGCGGTGCT GGCCGCCGTC GCCGCCGCCG AAACCCTGCG CGAACCATGA
 
Protein sequence
MQTTAASAFD SSRERSSLAT TTSREDFRAL AAEHRVVPVV RKVLADSETP LSAYRKLAAN 
RPGTFLLESA ENGRSWSRWS FIGAGAPSAL TVRDGEAVWL GVTPKDAPSG GDPLQALRST
LALLETAPLP GLPPLSSGLV GFFAYDMVRR LERLPSLAVD DLGLPDMLLL LATDIAAVDH
HEGTITLIAN AVNWNGTDEN VDGAYDDAVA RLDVMTKALG QSLPSSVATF ARPAPTHRAQ
RTVEEYTAIV EKLVGDIEAG EAFQVVPSQR FEMDTVADPL DVYRMLRVTN PSPYMYLLNV
PDETGGLDFS VVGSSPEALV TVADGKATTH PIAGTRWRGD TEEEDLLLEK ELLADEKERA
EHLMLVDLGR NDLGRVCEPG TVRVEDYSHI ERYSHVMHLV STVTGRLAEG MTALDAVTAC
FPAGTLSGAP KVRAMELIEE VEKTRRGLYG GVLGYLDFAG NADFAIAIRT ALMRDGVAYV
QAGGGVVADS NGPYEFNEAT NKAKAVLAAV AAAETLREP