Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B4053 |
Symbol | trpE |
ID | 7183741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | + |
Start bp | 1193525 |
End bp | 1194943 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643549012 |
Product | anthranilate synthase component I |
Protein accession | YP_002444683 |
Protein GI | 218896272 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000211788 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 97 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACGA AAGAAGAATT TATAAAACAA AAAAGAGAGA GAAAGACATT TTTAGTAATC ACTGAAGAAG AAGGAGATAG CATTACGCCA ATTTCTTTAT ATAGACGTAT GAAAGGGAAG AAGAAATTTT TATTAGAAAG TTCACAGCTT CATCAAGATA AAGGGCGTTA TTCTTACTTA GGATGTAATC CTTATGGAGA GGTGACAAGC GTTGGTACGG AAGTAGAAAG AATGATATAT GGGCAAACAG AAAAGTTAAA AGGTAACGTA CTACAAGTGT TAGAAGAAGT AATCGCATCA TCACAAGTAG ATAGCCCATT TCCATTTTGC GGAGGAGCAG TTGGTTATAT TGGATATGAT GTCATTCGGC AGTATGAAAA CATTGGAGCG GATTTACACG ACCCATTAAA TATTCCGGAA GTACACCTTT TACTATATCG TGAGTTTATC GTTTACGATC ATTTACGCCA AAAGTTGTCG TTTGTATATG TATGCAGGGA AGATGATTCA GCAGATTATG AGGAAGTATA CGAAAGGCTA CGAGTATACA AAGAGGAAGT GCTACAAGGA GAAGAAGCTG AAGTAAATGC AATACAATCC ACATTATCAT TCACTTCTTC TATAACGGAA AAAGAGTTTT GTGAGATGGT AGAAATGGCG AAAGAACATA TAAGAGCTGG GGACATATTC CAAGTCGTAC TGTCACAGCG TTTGCAAAGT GAATGTATTG GTGATCCATT CGCGTTATAC CGAAAACTTC GAATTGCAAA TCCATCACCA TATATGTTCT ATATCGATTT TCAAGATTAT GTTGTACTCG GTTCTTCACC GGAAAGTTTG CTATCAGTAA GGGAGAATAA AGTGATGACG AATCCAATTG CTGGTACGAG GCCGAGGGGG AAAACGAAGA GGGAAGATGA GGAAATCGAA AAAGAACTGT TGGAAGATGA GAAAGAACGA GCGGAGCATA TGATGCTTGT AGATCTTGGG CGAAATGATA TTGGCAGAGT GAGTGAAATT GGATCCGTGA CGATAGATAA ATATATGAAA GTAGAAAAAT ATTCTCACGT TATGCACATT GTATCTGAAG TTTACGGAAC ATTGCGAAAA CAAACGAGCG GATTTGATGC GTTAGCGTAT TGCTTACCAG CAGGGACAGT TTCTGGAGCT CCGAAAATTA GAGCGATGGA AATTATAAAT GAGCTAGAGA ATGAAAAAAG AAACGTATAC GCCGGAGCAG TTGGATACGT TAGTTTTTCA GGGAATCTTG ATATGGCACT TGCCATTCGA ACGATGGTCG TAAAGGATGA AAAAGCATAC GTTCAGGCCG GAGCAGGAGT CGTTTACGAT TCAGATCCAG TAGCTGAATA TGAAGAAACA TTAAATAAAG CGAGAGCGCT ATTGGAGGTA ATGAAATGA
|
Protein sequence | MMTKEEFIKQ KRERKTFLVI TEEEGDSITP ISLYRRMKGK KKFLLESSQL HQDKGRYSYL GCNPYGEVTS VGTEVERMIY GQTEKLKGNV LQVLEEVIAS SQVDSPFPFC GGAVGYIGYD VIRQYENIGA DLHDPLNIPE VHLLLYREFI VYDHLRQKLS FVYVCREDDS ADYEEVYERL RVYKEEVLQG EEAEVNAIQS TLSFTSSITE KEFCEMVEMA KEHIRAGDIF QVVLSQRLQS ECIGDPFALY RKLRIANPSP YMFYIDFQDY VVLGSSPESL LSVRENKVMT NPIAGTRPRG KTKREDEEIE KELLEDEKER AEHMMLVDLG RNDIGRVSEI GSVTIDKYMK VEKYSHVMHI VSEVYGTLRK QTSGFDALAY CLPAGTVSGA PKIRAMEIIN ELENEKRNVY AGAVGYVSFS GNLDMALAIR TMVVKDEKAY VQAGAGVVYD SDPVAEYEET LNKARALLEV MK
|
| |