Gene B21_01247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01247 
SymboltrpD 
ID8112854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1304787 
End bp1306382 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content56% 
IMG OID644847498 
Producthypothetical protein 
Protein accessionYP_002999071 
Protein GI251784767 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0512] Anthranilate/para-aminobenzoate synthases component II
[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase
[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACA TTCTGCTGCT CGATAATATC GACTCTTTTA CGTACAACCT GGCAGATCAG 
TTGCGCAGCA ATGGTCATAA CGTGGTGATT TACCGCAACC ATATTCCGGC GCAGACCTTA
ATTGAACGCC TGGCGACGAT GAGCAATCCG GTACTGATGC TTTCTCCTGG CCCCGGTGTG
CCGAGCGAAG CCGGTTGTAT GCCGGAACTC CTCACCCGCT TGCGTGGCAA GCTGCCCATT
ATTGGCATTT GCCTCGGACA TCAGGCAATT GTCGAAGCTT ACGGGGGCTA TGTCGGTCAG
GCGGGCGAAA TTCTTCACGG TAAAGCGTCG AGCATTGAAC ATGACGGTCA GGCGATGTTT
GCCGGATTAA CAAACCCGCT GCCGGTGGCG CGTTATCACT CGCTGGTTGG CAGTAACATT
CCGGCCGGTT TAACCATCAA CGCCCATTTT AATGGCATGG TGATGGCAGT ACGTCACGAT
GCGGATCGCG TTTGTGGATT CCAGTTCCAT CCGGAATCCA TTCTCACCAC CCAGGGCGCT
CGCCTGCTGG AACAAACGCT GGCCTGGGCG CAGCAGAAAC TAGAGCCAGC CAACACGCTG
CAACCGATTC TGGAAAAACT GTATCAGGCG CAGACGCTTA GCCAACAAGA AAGCCACCAG
CTGTTTTCAG CGGTGGTGCG TGGCGAGCTG AAGCCGGAAC AACTGGCGGC GGCGCTGGTG
AGCATGAAAA TTCGCGGTGA GCACCCGAAC GAGATCGCCG GAGCAGCAAC CGCGCTACTG
GAAAACGCCG CGCCGTTCCC GCGCCCGGAT TATCTGTTTG CTGATATCGT CGGTACTGGC
GGTGACGGCA GCAACAGTAT CAATATTTCT ACCGCCAGTG CGTTTGTCGC CGCGGCCTGT
GGGCTGAAAG TGGCGAAACA CGGCAACCGT AGCGTCTCCA GTAAATCTGG TTCGTCCGAT
CTGCTGGCGG CGTTCGGTAT TAATCTTGAT ATGAACGCCG ATAAATCGCG CCAGGCGCTG
GATGAGTTAG GTGTATGTTT CCTCTTTGCG CCGAAGTATC ACACCGGATT CCGCCACGCG
ATGCCGGTTC GCCAGCAACT GAAAACCCGC ACCCTGTTCA ATGTGCTGGG GCCATTGATT
AACCCGGCGC ATCCGCCGCT GGCGTTAATT GGTGTTTATA GTCCGGAACT GGTGCTGCCG
ATTGCCGAAA CCTTGCGCGT GCTGGGGTAT CAACGCGCGG CGGTGGTGCA CAGCGGCGGG
ATGGATGAAG TTTCATTACA CGCGCCGACA ATCGTTGCCG AGCTGCATGA CGGCGAAATT
AAGAGCTATC AATTGACCGC TGAAGATTTT GGCCTGACTC CCTACCACCA GGAGCAACTG
GCAGGCGGAA CACCGGAAGA AAACCGTGAC ATTTTAACAC GCTTGTTACA AGGTAAAGGC
GACGCCGCCC ATGAAGCAGC CGTCGCTGCG AACGTCGCCA TGTTAATGCG CCTGCATGGC
CATGAAGATC TGCAAGCCAA TGCGCAAACC GTTCTTGAGG TACTGCGCAG TGGTTCCGCT
TACGACAGAG TTACCGCACT GGCGGCACGA GGGTAA
 
Protein sequence
MADILLLDNI DSFTYNLADQ LRSNGHNVVI YRNHIPAQTL IERLATMSNP VLMLSPGPGV 
PSEAGCMPEL LTRLRGKLPI IGICLGHQAI VEAYGGYVGQ AGEILHGKAS SIEHDGQAMF
AGLTNPLPVA RYHSLVGSNI PAGLTINAHF NGMVMAVRHD ADRVCGFQFH PESILTTQGA
RLLEQTLAWA QQKLEPANTL QPILEKLYQA QTLSQQESHQ LFSAVVRGEL KPEQLAAALV
SMKIRGEHPN EIAGAATALL ENAAPFPRPD YLFADIVGTG GDGSNSINIS TASAFVAAAC
GLKVAKHGNR SVSSKSGSSD LLAAFGINLD MNADKSRQAL DELGVCFLFA PKYHTGFRHA
MPVRQQLKTR TLFNVLGPLI NPAHPPLALI GVYSPELVLP IAETLRVLGY QRAAVVHSGG
MDEVSLHAPT IVAELHDGEI KSYQLTAEDF GLTPYHQEQL AGGTPEENRD ILTRLLQGKG
DAAHEAAVAA NVAMLMRLHG HEDLQANAQT VLEVLRSGSA YDRVTALAAR G