Gene BCG9842_B4053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B4053 
SymboltrpE 
ID7183741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp1193525 
End bp1194943 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content39% 
IMG OID643549012 
Productanthranilate synthase component I 
Protein accessionYP_002444683 
Protein GI218896272 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000211788 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACGA AAGAAGAATT TATAAAACAA AAAAGAGAGA GAAAGACATT TTTAGTAATC 
ACTGAAGAAG AAGGAGATAG CATTACGCCA ATTTCTTTAT ATAGACGTAT GAAAGGGAAG
AAGAAATTTT TATTAGAAAG TTCACAGCTT CATCAAGATA AAGGGCGTTA TTCTTACTTA
GGATGTAATC CTTATGGAGA GGTGACAAGC GTTGGTACGG AAGTAGAAAG AATGATATAT
GGGCAAACAG AAAAGTTAAA AGGTAACGTA CTACAAGTGT TAGAAGAAGT AATCGCATCA
TCACAAGTAG ATAGCCCATT TCCATTTTGC GGAGGAGCAG TTGGTTATAT TGGATATGAT
GTCATTCGGC AGTATGAAAA CATTGGAGCG GATTTACACG ACCCATTAAA TATTCCGGAA
GTACACCTTT TACTATATCG TGAGTTTATC GTTTACGATC ATTTACGCCA AAAGTTGTCG
TTTGTATATG TATGCAGGGA AGATGATTCA GCAGATTATG AGGAAGTATA CGAAAGGCTA
CGAGTATACA AAGAGGAAGT GCTACAAGGA GAAGAAGCTG AAGTAAATGC AATACAATCC
ACATTATCAT TCACTTCTTC TATAACGGAA AAAGAGTTTT GTGAGATGGT AGAAATGGCG
AAAGAACATA TAAGAGCTGG GGACATATTC CAAGTCGTAC TGTCACAGCG TTTGCAAAGT
GAATGTATTG GTGATCCATT CGCGTTATAC CGAAAACTTC GAATTGCAAA TCCATCACCA
TATATGTTCT ATATCGATTT TCAAGATTAT GTTGTACTCG GTTCTTCACC GGAAAGTTTG
CTATCAGTAA GGGAGAATAA AGTGATGACG AATCCAATTG CTGGTACGAG GCCGAGGGGG
AAAACGAAGA GGGAAGATGA GGAAATCGAA AAAGAACTGT TGGAAGATGA GAAAGAACGA
GCGGAGCATA TGATGCTTGT AGATCTTGGG CGAAATGATA TTGGCAGAGT GAGTGAAATT
GGATCCGTGA CGATAGATAA ATATATGAAA GTAGAAAAAT ATTCTCACGT TATGCACATT
GTATCTGAAG TTTACGGAAC ATTGCGAAAA CAAACGAGCG GATTTGATGC GTTAGCGTAT
TGCTTACCAG CAGGGACAGT TTCTGGAGCT CCGAAAATTA GAGCGATGGA AATTATAAAT
GAGCTAGAGA ATGAAAAAAG AAACGTATAC GCCGGAGCAG TTGGATACGT TAGTTTTTCA
GGGAATCTTG ATATGGCACT TGCCATTCGA ACGATGGTCG TAAAGGATGA AAAAGCATAC
GTTCAGGCCG GAGCAGGAGT CGTTTACGAT TCAGATCCAG TAGCTGAATA TGAAGAAACA
TTAAATAAAG CGAGAGCGCT ATTGGAGGTA ATGAAATGA
 
Protein sequence
MMTKEEFIKQ KRERKTFLVI TEEEGDSITP ISLYRRMKGK KKFLLESSQL HQDKGRYSYL 
GCNPYGEVTS VGTEVERMIY GQTEKLKGNV LQVLEEVIAS SQVDSPFPFC GGAVGYIGYD
VIRQYENIGA DLHDPLNIPE VHLLLYREFI VYDHLRQKLS FVYVCREDDS ADYEEVYERL
RVYKEEVLQG EEAEVNAIQS TLSFTSSITE KEFCEMVEMA KEHIRAGDIF QVVLSQRLQS
ECIGDPFALY RKLRIANPSP YMFYIDFQDY VVLGSSPESL LSVRENKVMT NPIAGTRPRG
KTKREDEEIE KELLEDEKER AEHMMLVDLG RNDIGRVSEI GSVTIDKYMK VEKYSHVMHI
VSEVYGTLRK QTSGFDALAY CLPAGTVSGA PKIRAMEIIN ELENEKRNVY AGAVGYVSFS
GNLDMALAIR TMVVKDEKAY VQAGAGVVYD SDPVAEYEET LNKARALLEV MK