Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1895 |
Symbol | trpD |
ID | 6966745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1787823 |
End bp | 1789418 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643385829 |
Product | bifunctional glutamine amidotransferase/anthranilate phosphoribosyltransferase |
Protein accession | YP_002270318 |
Protein GI | 209397663 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0512] Anthranilate/para-aminobenzoate synthases component II [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.000854588 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGACA TTCTGCTGCT CGATAATATC GACTCTTTTA CTTACAACCT GGCAGATCAG TTGCGCAGCA ATGGTCATAA CGTGGTGATT TACCGCAACC ATATTCCGGC GCAGACCTTA ATTGAACGCC TTGCGACGAT GAGCAATCCG GTGCTGATGC TTTCTCCTGG ACCCGGTGTG CCGAGCGAAG CTGGTTGTAT GCCTGAACTG CTTACCCGCC TGCGCGGTAA GCTGCCTATT ATTGGCATTT GCCTTGGTCA TCAGGCGATT GTCGAAGCTT ACGGGGGCTA TGTCGGTCAG GCGGGCGAAA TTCTTCACGG TAAAGCGTCG AGCATTGAAC ATGACGGTCA GGCGATGTTT GCCGGATTAA CAAACCCGCT GCCGGTGGCG CGTTATCACT CGCTGGTTGG CAGTAATATT CCGGCCGGTT TAACCATCAA CGCCCATTTT AATGGCATGG TGATGGCGGT GCGTCACGAT GCAGATCGCG TTTGCGGATT CCAGTTCCAC CCGGAATCCA TTCTCACCAC CCAGGGCGCT CGCCTGCTGG AACAAACGCT GGCCTGGGCG CAGCAGAAAC TAGAGCCAAC CAACACGCTG CAACCGATTC TGGAAAAACT GTATCAGGCG CAGACCCTTA GCCAGCAGGA AAGTCATCAG CTATTTTCAG CGGTGGTACG TGGTGAACTG AAACCGGAAC ACCTGGCGGC GGCGCTGGTG AGCATGAAAA TTCGCGGCGA GCACCCGAAC GAGATCGCCG GAGCAGCAAC CGCGCTACTG GAAAACGCCG CGCCGTTCCC GCGTCCGGAT TATCTGTTTG CCGATATCGT CGGCACCGGC GGTGACGGCA GCAACAGCAT CAATATTTCC ACCGCCAGTG CGTTTGTCGC CGCGGCCTGT GGGCTGAAAG TGGCGAAACA CGGCAACCGT AGCGTCTCCA GTAAATCTGG CTCGTCGGAT CTGCTGGCGG CGTTCGGTAT TAATCTTGAT ATGAACGCCG ATAAATCGCG CCAGGCGCTG GATGAGTTAG GTGTATGTTT CCTCTTTGCA CCGAAATATC ACACCGGATT TCGCCATGCA ATGCCGGTTC GCCAGCAACT AAAAACCCGC ACCCTGTTCA ATGTGCTGGG GCCATTGATT AACCCGGCGC ATCCGCCGCT GGCGTTAATT GGTGTTTATA GTCCGGAACT GGTGCTGCCG ATTGCCGAAA CCTTACGCGT GCTGGGGTAT CAACGCGCGG CGGTGGTGCA CAGCGGCGGG ATGGATGAAG TTTCATTACA CGCGCCGACA ATCGTTGCCG AGCTGCATGA CGGCGAAATT AAGAGCTATC AATTGACCGC TGAAGATTTT GGCCTGACTC CCTACCACCA GGAGCAACTG GCAGGCGGAA CACCGGAAGA AAACCGTGAC ATTTTAACAC GCTTGTTACA AGGTAAAGGC GACGCCGCCC ATGAAGCAGC CGTCGCGGCG AATGTCGCCA TGTTAATGCG CCTGCATGGC CATGAAGATC TGCAAGCCAA TGCGCAAACC GTTCTTGAGG TACTGCGCAG TGGTTCCGCT TACGACAGAG TCACCGCACT GGCGGCACGA GGGTAA
|
Protein sequence | MADILLLDNI DSFTYNLADQ LRSNGHNVVI YRNHIPAQTL IERLATMSNP VLMLSPGPGV PSEAGCMPEL LTRLRGKLPI IGICLGHQAI VEAYGGYVGQ AGEILHGKAS SIEHDGQAMF AGLTNPLPVA RYHSLVGSNI PAGLTINAHF NGMVMAVRHD ADRVCGFQFH PESILTTQGA RLLEQTLAWA QQKLEPTNTL QPILEKLYQA QTLSQQESHQ LFSAVVRGEL KPEHLAAALV SMKIRGEHPN EIAGAATALL ENAAPFPRPD YLFADIVGTG GDGSNSINIS TASAFVAAAC GLKVAKHGNR SVSSKSGSSD LLAAFGINLD MNADKSRQAL DELGVCFLFA PKYHTGFRHA MPVRQQLKTR TLFNVLGPLI NPAHPPLALI GVYSPELVLP IAETLRVLGY QRAAVVHSGG MDEVSLHAPT IVAELHDGEI KSYQLTAEDF GLTPYHQEQL AGGTPEENRD ILTRLLQGKG DAAHEAAVAA NVAMLMRLHG HEDLQANAQT VLEVLRSGSA YDRVTALAAR G
|
| |