Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2143 |
Symbol | |
ID | 7976953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2209318 |
End bp | 2210844 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644798959 |
Product | anthranilate synthase component I |
Protein accession | YP_002950119 |
Protein GI | 239827495 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.105763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACT GCACAATCAC CACTTTTTTG GAAGATGCGG TTCATTTCCA GACCATTCCG ATTGTACGCC GCTTTTTTGC AGATGTGTTT GAACCAGTGA AAATATTTGA AAATTTAAAA AGTGAAGCGG TGTTTTTGCT AGAAAGCAAA GATGATCAAT CGCCATGGGC CCGCTATTCC TTTATCGGTT TATCGCCGTT TTTAACGATT GAGAGTGAAA CCGGCCATAC GTTTTCTGTG ATGGATGAGC GCGGCGAAGA AATGATGAAA GCTTCTTCGT TAAAAGAAGC ATTTCTGTTC ATTGAGCAGA AGCTGCGCGT CAAGCAGCTT GCAAGCGAGA TACCATTTAC CGGCGGCGCG GTCGGTTTTT TAGGATATGA TTTCATTTCC GCAATCGAAA AAGTCCCAAT TCATTCCAAT CGCGATATTT CCTTGAAGAC GGGTTATTTT ACGTTTTGTG AATCGTTGAT CGTGTTTGAT CATCATAAGC GTGAGATTGT GTTTATTCAT TATGTCCGCA TATCCGACAA GGACACAGAA GAAGCGAAGA AGAGAAAGTA CGGGGAAGGG TTAAAGCGGA TCGAAACATT GATGAAAAAA GCAGCAAGCG GAAGGGAAGA GCCATTGTTA TTGCTTCATC ACGATGACGA AGAAACGCGT GTTTCCTTTC AAGGTGTCAT ATCCAATTAT GACAAAGCAT CATTTATGAG GGATGTAGAA ACCATTAAAA GCTATATCGC CAACGGAGAT GTTTTTCAGG CGGTATTGTC CCAGCGATTT ACCGTTCCGA TTCAAGTGAG CGGGTTTCAC ATTTATCGAA TGCTGCGTCA TATTAATCCG TCGCCATATA TGTTTTATTT TCAGCTAGAC GGTATCGAAA TTGTCGGAAG CTCTCCGGAA AAGTTAATTC AAGTGCATAA CCGTCATATG GAAATCCACC CGATCGCCGG GACTAGAAGA AGAGGACGGT CGGCAGAAGA AGATGAGCAT CTTCAAAGGG AGCTTTACAA TGACCCGAAA GAGAGAGCCG AACATTATAT GTTAGTAGAT TTAGCCCGCA ACGATATCGG AAAAGTAGCG AAATATGGTA CAGTCGAGAC GCCAGTGTTA ATGGAAATTG GAAAGTTCTC CCATGTCATG CATCTGATTT CGAAAGTAAC GGGTGTGTTA AAGGAAGGAA TTCACCCGAT TGACGCGCTG TTAGCGGCCT TTCCAGCCGG AACGGTAAGC GGTGCGCCGA AAGTAAGGGC GATGCAAATT TTACAGGAGC TAGAACCGAC GGCAAGAAAT TTATATGCTG GAACGATTGC CTACATCGGT TTTGACGGCA ATATTGATTC ATGTATCGCG ATTCGCACTG CGATTGTAAA AGACGGTTAT GCTTACGTGC AAGCAGGCGC GGGAATTGTC GCGGATTCCG TTCCAGAATT GGAGTGGAAA GAAACGCGCA ATAAAGCGAG CGCCTTAATC AAAGCGATGG AACGTGCCGA ACGATTGTTT GCGAAAGGAG AGAATATATA TGTTTAA
|
Protein sequence | MKNCTITTFL EDAVHFQTIP IVRRFFADVF EPVKIFENLK SEAVFLLESK DDQSPWARYS FIGLSPFLTI ESETGHTFSV MDERGEEMMK ASSLKEAFLF IEQKLRVKQL ASEIPFTGGA VGFLGYDFIS AIEKVPIHSN RDISLKTGYF TFCESLIVFD HHKREIVFIH YVRISDKDTE EAKKRKYGEG LKRIETLMKK AASGREEPLL LLHHDDEETR VSFQGVISNY DKASFMRDVE TIKSYIANGD VFQAVLSQRF TVPIQVSGFH IYRMLRHINP SPYMFYFQLD GIEIVGSSPE KLIQVHNRHM EIHPIAGTRR RGRSAEEDEH LQRELYNDPK ERAEHYMLVD LARNDIGKVA KYGTVETPVL MEIGKFSHVM HLISKVTGVL KEGIHPIDAL LAAFPAGTVS GAPKVRAMQI LQELEPTARN LYAGTIAYIG FDGNIDSCIA IRTAIVKDGY AYVQAGAGIV ADSVPELEWK ETRNKASALI KAMERAERLF AKGENIYV
|
| |