Gene GWCH70_2142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2142 
SymboltrpD 
ID7976952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2208306 
End bp2209325 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content47% 
IMG OID644798958 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_002950118 
Protein GI239827494 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAGC AACTTCTTGC CAAATGTATT GAGGGGTATA CGTTGACAGA AGAAGAAGCG 
TATGAGGCGA TGATGATGAT AATGTCTGGA GAAGCCTCTG CCAGCCAGAT TGCCAGTTTC
TTATCCATTT TGCGATTGCG CGGGGAAACG GTCGATGAAT TGACAGGATT GGTGAAAGCG
ATGCGCAACC GAATGATGAC GCTCGATTAT GAAGAAGAAG CCATTGATAC GTGCGGAACA
GGGGGAGACG GGGCATCGAC ATTCAACATT TCCACTGCGG CGGCGATCGT CGTATCATCA
CTTGGTGTCA AAGTGGCCAA ACATGGCAAC CGAGCGGTTT CCTCAAAAAG CGGAAGCGCG
GACGTATTAG AAGCGCTTCA TATTGATATT CAAGCAACCC CGGAGGAAGC GAAACGAGCG
CTAAAAACGA AAGGATTGGC TTTTTTGTTC GCGCCGCTAT ATCATTCCGC CATGAAATAT
GCTGCATTGC CGCGTAAAGA AATCGGGTTT CGCACTGTAT TCAATTTAAT TGGACCACTT
TCGAATCCAG CACGGTGCAA GCGGCAAGTA ATCGGTGTTT ATTCGACACA ATACGCGGAA
AAACTTGCGG AAACTCTTCA CCGACTTGGT TCGGAACACG TTTTATTGGT AACCGGAAAA
GACGGACTTG ATGAATGCAG CATTTCGGCG GAAACAGATG TAGTCGAACT GAAACATGGT
GAAATTCGCC GTTTCACGAT CGCGCCGGAA CAATATGGGC TCGCGCGTGG AAAGTTAGAA
CACGTTCAAG TTCGTACGGT TCAACAAAGT GCTGAACTAT TAAAGGCAGT ATTGGAAGGA
AGAGCAAACG AAAGCGCGAT CAATATCGTC ATTCTTAATG CTGGCGTTGC GTTATATGCA
GCGGGGAAAG CAGCGACGAT TCGCGAAGGG GTCGAAATGG CAAAAGAAGC GATGATGACA
AAGAAAGCCT ATGAACAATT TGAGCGACTG CGCATGAAAG AGGTAGAAAA GTATGCTTGA
 
Protein sequence
MFKQLLAKCI EGYTLTEEEA YEAMMMIMSG EASASQIASF LSILRLRGET VDELTGLVKA 
MRNRMMTLDY EEEAIDTCGT GGDGASTFNI STAAAIVVSS LGVKVAKHGN RAVSSKSGSA
DVLEALHIDI QATPEEAKRA LKTKGLAFLF APLYHSAMKY AALPRKEIGF RTVFNLIGPL
SNPARCKRQV IGVYSTQYAE KLAETLHRLG SEHVLLVTGK DGLDECSISA ETDVVELKHG
EIRRFTIAPE QYGLARGKLE HVQVRTVQQS AELLKAVLEG RANESAINIV ILNAGVALYA
AGKAATIREG VEMAKEAMMT KKAYEQFERL RMKEVEKYA