Gene GYMC61_0066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_0066 
Symbol 
ID8523850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp79152 
End bp80573 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content57% 
IMG OID 
ProductAnthranilate synthase 
Protein accessionYP_003251248 
Protein GI261417566 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAGC GGCGCAGGCG ACTGAAACGA ACTATCGATT ATCGCGGACG GGATTGGTTC 
CGCCAGTACG AACAGCTGGC CTATAGCCGG CCTCATCATG TGTTGCTTGA GAGCGGCCAA
GGGGGAAGGT ACAGCATTAT CGGCCTTGAC CCGATCGGAG TGATCCGCGC TGACGAGCGG
CGGCTCATCA TCAAGCAGCG CGGGGTTGAA ACGGTGCTGG ATGGCTCGCC GCTCGAAGGG
CTCCGGCAAT GGCTTCGGTG TTTCGCCGTG CCGGATGAGG GGGAGTCGTT GCCTTGCCAA
GGCGGGCTGA TCGGTTTCAT TAGCTATGAT GCGGTTCGCT ATCTGGAACG GCTCCCGGTG
CTCGCGCAAG ATGATTTGCG GCTGCCGCTC ATGTATTTTT TCCTCTTTGA CGACGTAGCG
ATTTATGATC ACCAAACTGA ACAGCTTCAT TTGCTTGCCT ACGCGAATGA AGGGGAGGAA
AGCGAAGCAA ACCGGCGGCT TGCGCGGCAT GCACGGATGT GGCTCGAGGA TCGGAATGAA
GCGCTGGTCT GGCCGCTTGC TGCCTCGACA GCTGCGCCGT CTGTTTCAAT GACAAAAGAG
AGGTTTATGG ATGCGGTTCG CCGCGTGCAA CGCTATATTG CGGCGGGCGA TGTGTTTCAA
GTCAACTTAT CGGTGCGTCA GTCGCAGCCG CTGGTGACGC ATCCATTTGC CGTCTACAAG
CAGTTGCGGA TGCTGAATCC GTCCCCTTAC ATGGCGTATT TGCATACCCC GGAATTTCAA
GTCGTCAGCG GCTCGCCGGA GTTGCTCGTC CGCAAGCGGG GATGGCGTCT TGAGACGCGG
CCGATTGCCG GCACACGTTC GCGCGGTCGG ACGGCGGCAG AGGATGAACA AATTGCTCGC
AAGTTGCTTG CGAGTGAAAA GGAGCGGGCC GAGCACGCCA TGCTCGTTGA TCTTGAACGG
AATGACCTTG GGCGCGTCTG TGCATACGGG ACGGTTCGAG TTGACGAATG GATGACCGTC
GAAAAGTATT CTCATGTGAT GCATATCGTT TCTCACGTAT CCGGCACCAT GACGACGGAG
CACGATGCGT TTTCCGTCAT TCGCGCCATG TTTCCCGGCG GGACGATCAC CGGCGCTCCG
AAAGTGCGAA CAATGGAAAT TATTGAAGAG TTGGAACCGG TCCGCCGCGG CTTGTACACG
GGCTCGATCG GTTGGATCGA TTTTCAAGGA AACATGGAGC TAAACATCGC CATTCGAACG
ATGGTCGTCA AAGACGGATT GGCGCATGTA CAGGCAGGAG CGGGCATCGT CATCGATTCC
AACCCAGAGC ATGAGTACAA GGAATGTTTA AAGAAAGCGG CTGCCCTTTG GAAAGCGAAA
GAGCTGAGCG AAGCAGAGGC ATTATTTCCG AGCACGAGGT GA
 
Protein sequence
MEQRRRRLKR TIDYRGRDWF RQYEQLAYSR PHHVLLESGQ GGRYSIIGLD PIGVIRADER 
RLIIKQRGVE TVLDGSPLEG LRQWLRCFAV PDEGESLPCQ GGLIGFISYD AVRYLERLPV
LAQDDLRLPL MYFFLFDDVA IYDHQTEQLH LLAYANEGEE SEANRRLARH ARMWLEDRNE
ALVWPLAAST AAPSVSMTKE RFMDAVRRVQ RYIAAGDVFQ VNLSVRQSQP LVTHPFAVYK
QLRMLNPSPY MAYLHTPEFQ VVSGSPELLV RKRGWRLETR PIAGTRSRGR TAAEDEQIAR
KLLASEKERA EHAMLVDLER NDLGRVCAYG TVRVDEWMTV EKYSHVMHIV SHVSGTMTTE
HDAFSVIRAM FPGGTITGAP KVRTMEIIEE LEPVRRGLYT GSIGWIDFQG NMELNIAIRT
MVVKDGLAHV QAGAGIVIDS NPEHEYKECL KKAAALWKAK ELSEAEALFP STR