Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_1248 |
Symbol | trpE |
ID | 2816497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | + |
Start bp | 1203177 |
End bp | 1204592 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637788192 |
Product | anthranilate synthase component I |
Protein accession | YP_017863 |
Protein GI | 47526514 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000139959 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAAAG AGGAATTTAT AAAACAGAAA GAACAAAGAA AAACATTTTT GGTAATCGCT GAAGAAGAAG GAGATAGCAT TACGCCAATT TCTTTATATA GACGTATGAA AGGTAAAAAG AAATTTTTAT TAGAAAGCTC ACAGCTTCAT CAAGATAAAG GGCGTTATTC TTACTTAGGT TGTAATCCAT ATGGTGAAGT GAAAAGCGTT GGTACGGAAG TGGAACGAAC GATTTACGGC AGGGCAGAAA AGTTGCAAAG TAACGTACTA CAAGTGTTAG AAGAAATAAT CGCACCATCA CAAGTAGACA GTCCATTTCC ATTTTGCGGA GGAGCAGTTG GATACATTGG CTATGACGTC ATTCGGCAAT ATGAAAACAT TGGAGCAGAT TTACATGATC CATTGAATAT TCCAGAAGTA CACCTTTTAC TGTACCGTGA GTTTATCGTG TATGACCACT TACGCCAAAA GTTGTCGTTT GTATATGTAT GCAGGGAAGA TGATTCAGCA GATTATGAAG AAGTATACGA AAGGCTGCGA GTATACAAAG AGGAAGTGCT ACAGGGAGAA GAAGCGGAAG TAACTGAAAT AAGATCAACA TTATCATTCA CTTCTTCTAT AACGGAAAGA GAATTTTGCG TGATGGTAGA AACGGCGAAA GAACACATCG GGGCCGGGGA CATATTTCAA GTTGTATTAT CTCAGCGTTT GCAAAGCGAA TGTATTGGAG ATCCATTCGC GTTATATCGA AAACTTCGAA TTGCCAATCC ATCACCATAT ATGTTTTATA TCGATTTTCA AGATTATGTT GTACTCGGTT CTTCGCCAGA AAGTTTGTTA TCAGTAAGGG AGGATAAAGT GATGACGAAT CCAATTGCTG GTACGAGGCC GAGAGGGAAA ACGAAGGAGG AAGATACGGA GATTGAAAAA GAACTGTTAG AAAATGAGAA AGAGCGAGCG GAGCATATGA TGCTTGTAGA TCTTGGGCGA AATGATATTG GTAGAGTGAG TGAAATCGGC TCAGTTACGA TAGATAAATA TATGAAAGTA GAAAAATATT CTCACGTTAT GCACATTGTA TCTGAAGTTT ACGGAACATT GCGAAAACAA ATGAGCGGAT TTGATGCATT AGCGTACTGT TTACCAGCGG GGACGGTATC AGGTGCTCCG AAAATTAGAG CGATGGAAAT TATAAATGAG CTAGAGAATG AAAAAAGAAA TGTGTACGCC GGTGCAGTTG GATACGTTAG TTTTTCAGGG AATCTTGATA TGGCGCTCGC CATTCGAACA ATGGTTGTAA AGGATGAAAA AGCATACGTT CAGGCAGGAG CGGGTATTGT TTACGATTCA GATCCAGTAG CTGAATATGA AGAAACATTA AATAAAGCGA GAGCGCTATT GGAGGTAATG AAATGA
|
Protein sequence | MTKEEFIKQK EQRKTFLVIA EEEGDSITPI SLYRRMKGKK KFLLESSQLH QDKGRYSYLG CNPYGEVKSV GTEVERTIYG RAEKLQSNVL QVLEEIIAPS QVDSPFPFCG GAVGYIGYDV IRQYENIGAD LHDPLNIPEV HLLLYREFIV YDHLRQKLSF VYVCREDDSA DYEEVYERLR VYKEEVLQGE EAEVTEIRST LSFTSSITER EFCVMVETAK EHIGAGDIFQ VVLSQRLQSE CIGDPFALYR KLRIANPSPY MFYIDFQDYV VLGSSPESLL SVREDKVMTN PIAGTRPRGK TKEEDTEIEK ELLENEKERA EHMMLVDLGR NDIGRVSEIG SVTIDKYMKV EKYSHVMHIV SEVYGTLRKQ MSGFDALAYC LPAGTVSGAP KIRAMEIINE LENEKRNVYA GAVGYVSFSG NLDMALAIRT MVVKDEKAYV QAGAGIVYDS DPVAEYEETL NKARALLEVM K
|
| |