Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01247 |
Symbol | trpD |
ID | 8112854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 1304787 |
End bp | 1306382 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644847498 |
Product | hypothetical protein |
Protein accession | YP_002999071 |
Protein GI | 251784767 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0512] Anthranilate/para-aminobenzoate synthases component II [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGACA TTCTGCTGCT CGATAATATC GACTCTTTTA CGTACAACCT GGCAGATCAG TTGCGCAGCA ATGGTCATAA CGTGGTGATT TACCGCAACC ATATTCCGGC GCAGACCTTA ATTGAACGCC TGGCGACGAT GAGCAATCCG GTACTGATGC TTTCTCCTGG CCCCGGTGTG CCGAGCGAAG CCGGTTGTAT GCCGGAACTC CTCACCCGCT TGCGTGGCAA GCTGCCCATT ATTGGCATTT GCCTCGGACA TCAGGCAATT GTCGAAGCTT ACGGGGGCTA TGTCGGTCAG GCGGGCGAAA TTCTTCACGG TAAAGCGTCG AGCATTGAAC ATGACGGTCA GGCGATGTTT GCCGGATTAA CAAACCCGCT GCCGGTGGCG CGTTATCACT CGCTGGTTGG CAGTAACATT CCGGCCGGTT TAACCATCAA CGCCCATTTT AATGGCATGG TGATGGCAGT ACGTCACGAT GCGGATCGCG TTTGTGGATT CCAGTTCCAT CCGGAATCCA TTCTCACCAC CCAGGGCGCT CGCCTGCTGG AACAAACGCT GGCCTGGGCG CAGCAGAAAC TAGAGCCAGC CAACACGCTG CAACCGATTC TGGAAAAACT GTATCAGGCG CAGACGCTTA GCCAACAAGA AAGCCACCAG CTGTTTTCAG CGGTGGTGCG TGGCGAGCTG AAGCCGGAAC AACTGGCGGC GGCGCTGGTG AGCATGAAAA TTCGCGGTGA GCACCCGAAC GAGATCGCCG GAGCAGCAAC CGCGCTACTG GAAAACGCCG CGCCGTTCCC GCGCCCGGAT TATCTGTTTG CTGATATCGT CGGTACTGGC GGTGACGGCA GCAACAGTAT CAATATTTCT ACCGCCAGTG CGTTTGTCGC CGCGGCCTGT GGGCTGAAAG TGGCGAAACA CGGCAACCGT AGCGTCTCCA GTAAATCTGG TTCGTCCGAT CTGCTGGCGG CGTTCGGTAT TAATCTTGAT ATGAACGCCG ATAAATCGCG CCAGGCGCTG GATGAGTTAG GTGTATGTTT CCTCTTTGCG CCGAAGTATC ACACCGGATT CCGCCACGCG ATGCCGGTTC GCCAGCAACT GAAAACCCGC ACCCTGTTCA ATGTGCTGGG GCCATTGATT AACCCGGCGC ATCCGCCGCT GGCGTTAATT GGTGTTTATA GTCCGGAACT GGTGCTGCCG ATTGCCGAAA CCTTGCGCGT GCTGGGGTAT CAACGCGCGG CGGTGGTGCA CAGCGGCGGG ATGGATGAAG TTTCATTACA CGCGCCGACA ATCGTTGCCG AGCTGCATGA CGGCGAAATT AAGAGCTATC AATTGACCGC TGAAGATTTT GGCCTGACTC CCTACCACCA GGAGCAACTG GCAGGCGGAA CACCGGAAGA AAACCGTGAC ATTTTAACAC GCTTGTTACA AGGTAAAGGC GACGCCGCCC ATGAAGCAGC CGTCGCTGCG AACGTCGCCA TGTTAATGCG CCTGCATGGC CATGAAGATC TGCAAGCCAA TGCGCAAACC GTTCTTGAGG TACTGCGCAG TGGTTCCGCT TACGACAGAG TTACCGCACT GGCGGCACGA GGGTAA
|
Protein sequence | MADILLLDNI DSFTYNLADQ LRSNGHNVVI YRNHIPAQTL IERLATMSNP VLMLSPGPGV PSEAGCMPEL LTRLRGKLPI IGICLGHQAI VEAYGGYVGQ AGEILHGKAS SIEHDGQAMF AGLTNPLPVA RYHSLVGSNI PAGLTINAHF NGMVMAVRHD ADRVCGFQFH PESILTTQGA RLLEQTLAWA QQKLEPANTL QPILEKLYQA QTLSQQESHQ LFSAVVRGEL KPEQLAAALV SMKIRGEHPN EIAGAATALL ENAAPFPRPD YLFADIVGTG GDGSNSINIS TASAFVAAAC GLKVAKHGNR SVSSKSGSSD LLAAFGINLD MNADKSRQAL DELGVCFLFA PKYHTGFRHA MPVRQQLKTR TLFNVLGPLI NPAHPPLALI GVYSPELVLP IAETLRVLGY QRAAVVHSGG MDEVSLHAPT IVAELHDGEI KSYQLTAEDF GLTPYHQEQL AGGTPEENRD ILTRLLQGKG DAAHEAAVAA NVAMLMRLHG HEDLQANAQT VLEVLRSGSA YDRVTALAAR G
|
| |