Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_0476 |
Symbol | |
ID | 8524282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | + |
Start bp | 478646 |
End bp | 480172 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | anthranilate synthase component I |
Protein accession | YP_003251640 |
Protein GI | 261417958 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGCTG ATGGGCTGGC CGCTTTTTTG GCGGAAGCGA ACGAATTTCG AACCATCCCG ATTGTGCGCA AATTTGTAGC CGATGTCATT GAGCCGCTCG GCGTGTTTGC CAATTTGCGC GAGGAAGCCG TGTTTTTGCT TGAAAGCAAG GATGACGAAT CGCCGTGGGC GCGCTATTCG TTCATCGGTG TGGCGCCGTT TTTGACGCTG GAGAGCGAAA CCGGCGAAAC GTTTTCGGTG AAAGACGAAA ACGGGAACGA ACAAATCACC GCACCGACGC TGAAAGAAGC GTTTCAATGG GTCGAGCGGA CGCTTGCCGT CAAGCCGCTT GCCGAGACGG TGCCGTTTAC AGGCGGTGCA GTTGGGTTTT TAGGCTACGA TTTCATTTCC GCCATCGAAA AAGTGCCGCG CCACAAAAAC CGCGATGTGC CTATGAAGAC GGCCTATTTC GTGTTTTGTG AATCGCTGTT TGCGTTTGAC CAAAAAAAGC GTGAGTTGCT CGTCATTCAC TATATTCGGT TGAGCGGCAA CGAAACGGAG GAAGAGAAAA TCGAAGCGTA CCGCGCGGCC GAACGGCGCA TGGCCGATCT CGCGGCGAAA GCGGCCCGCC CTCAAGCCGA GCAGCCGCTC TTGCCGGCGG AAAGCGAATC GGGGCGGACC GCCTCGTTTG CCAAAGCGGT GTCCAACTAT GACAAAAAAC AGTTTTTGCG CGATGTGGAG GCCGTGAAAC GGTACATCGC GGCCGGCGAT GTGTTTCAAG CGGTTCTGTC GCAGCGCTTC TGCGTGCCGG TTCAAGCCGG AGGCTTTGCC ATTTATCGGC TGCTCCGATA CATCAACCCG TCGCCGTACA TGTTTTATTT CCAGCTTGAC GGTGTGGAAA TTGTCGGCAG TTCGCCGGAA AAACTGATTC AAGTGCACCG TCGGCGCGTT GAAATCGATC CGATTGCCGG CACACGGCGG CGCGGCCGAT CGCCGGAGGA AGACGAGAGG CTGGCTGATG AGCTGTACCA CGACCCGAAA GAACGGGCTG AGCATTATAT GCTTGTCGAT TTGGCGCGCA ACGACATCGG CCGGGTCGCC AAGTATGGAA CGGTCGAGGT GCCGGTGTTG CTTCAGATCG GCAAATTTTC ACACGTGATG CACTTAATTT CCAAAGTCGT CGGTGAACTG GACGACAACG TCCACCCGAT CGATGCACTC CTTGCCGCCT TTCCGGCTGG AACGGTGAGC GGGGCGCCGA AAGTGCGGGC GATGCAAATT TTGCAGGAAC TCGAGCCGAC AGCAAGGGGG CTGTATGCCG GAGCGATTGC CTATATCGGG TTTGATGGCA ACATCGATTC GTGCATCGCC ATCCGGACAG CGGTCGTGAA AGACGGCTAT GCCTATGTCC AAGCCGGCGC CGGCATTGTC GCCGACTCCG TTCCGGAACT GGAGTGGAAA GAGACGCGCA ATAAGGCGAG CGCCTTGATG AATGCGATTG AACAAGCGGA ACGACTATTT GCTAAAGGGG AGAGGGCCGT ATGTTGA
|
Protein sequence | MSADGLAAFL AEANEFRTIP IVRKFVADVI EPLGVFANLR EEAVFLLESK DDESPWARYS FIGVAPFLTL ESETGETFSV KDENGNEQIT APTLKEAFQW VERTLAVKPL AETVPFTGGA VGFLGYDFIS AIEKVPRHKN RDVPMKTAYF VFCESLFAFD QKKRELLVIH YIRLSGNETE EEKIEAYRAA ERRMADLAAK AARPQAEQPL LPAESESGRT ASFAKAVSNY DKKQFLRDVE AVKRYIAAGD VFQAVLSQRF CVPVQAGGFA IYRLLRYINP SPYMFYFQLD GVEIVGSSPE KLIQVHRRRV EIDPIAGTRR RGRSPEEDER LADELYHDPK ERAEHYMLVD LARNDIGRVA KYGTVEVPVL LQIGKFSHVM HLISKVVGEL DDNVHPIDAL LAAFPAGTVS GAPKVRAMQI LQELEPTARG LYAGAIAYIG FDGNIDSCIA IRTAVVKDGY AYVQAGAGIV ADSVPELEWK ETRNKASALM NAIEQAERLF AKGERAVC
|
| |