Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1869 |
Symbol | trpD |
ID | 6144331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1893028 |
End bp | 1894623 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641616745 |
Product | bifunctional glutamine amidotransferase/anthranilate phosphoribosyltransferase |
Protein accession | YP_001743923 |
Protein GI | 170679698 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0512] Anthranilate/para-aminobenzoate synthases component II [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0000000144467 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGACA TTCTGCTGCT CGATAATATC GATTCTTTTA CTTACAACCT GGCAGATCAG TTGCGCAGCA ATGGTCATAA CGTGGTGATT TACCGCAACC ATATTCCGGC GCAGACCTTA ATTGAACGCC TGGCGACGAT GAGCAATCCG GTGCTGATGC TCTCTCCTGG CCCCGGTGTG CCGAGCGAAG CCGGTTGTAT GCCGGAACTC CTCACCCGCC TGCGCGGCAA GCTGCCAATT ATTGGCATTT GCCTCGGTCA TCAGGCGATT GTCGAAGCTT ACGGGGGCTA TGTCGGTCAG GCGGGCGAAA TTCTTCACGG TAAAGCGTCG AGCATTGAAC ATGACGGTCA GGCGATGTTT GCCGGATTAA CAAACCCGCT GCCAGTGGCG CGTTATCACT CGCTGGTTGG CAGTAATATT CCGGCAGGTT TAACCATCAA CGCCCATTTT AATGGCATGG TGATGGCGGT GCGCCATGAT GCGGATCGCG TTTGTGGATT CCAGTTCCAT CCGGAATCCA TTCTTACTAC CCAGGGCGCT CGCCTGCTGG AACAAACGCT GGCCTGGGCG CAGCAGAAAC TAGAGCCAAC CAACACGCTG CAACCGATTC TGGAAAAACT GTATCAGGCG CAGACCCTTA GTCAGCAGGA AAGCCACCAG CTATTTTCAG CGGTGGTGCG TGGCGAACTG AAACCGGAAC AACTGGCGGC GGCGCTGGTG AGCATGAAAA TTCGCGGCGA GCACCCGAAT GAAATCGCCG GGGCAGCAAC CGCGCTACTG GAAAACGCCG CGCCGTTCCC GCGCCCGGAT TATCTGTTTG CCGATATCGT CGGTACTGGC GGTGACGGCA GCAACAGTAT CAATATTTCT ACCGCCAGTG CGTTTGTCGC CGCGGCCTGC GGGCTGAAAG TGGCGAAACA CGGCAACCGT AGCGTCTCCA GTAAATCCGG CTCGTCGGAT CTGCTGGCGG CGTTCGGTAT TAATCTTGAT ATGAACGCCG ATAAATCGCG CCAGGCGCTG GATGAGTTAG GCGTCTGTTT CCTCTTTGCG CCGAAATATC ACACCGGATT CCGCCATGCA ATGCCGGTTC GCCAGCAACT GAAAACCCGC ACCCTGTTCA ATGTTCTGGG GCCATTAATT AACCCGGCGC ATCCTCCGCT GGCGTTAATT GGTGTCTACA GCCCGGAGCT GGTGCTGCCG ATTGCCGAAA CCTTGCGCGT GCTGGGGTAT CAACGCGCGG CGGTGGTGCA CAGCGGCGGG ATGGATGAAG TTTCATTACA CGCGCCGACA ATCGTTGCCG AACTGCATGA CGGCGAAATT AAAAGCTATC AGCTCACCGC AGAAGACTTT GGCCTGACAC CCTACCACCA GGAGCAATTG GCAGGCGGAA CACCGGAAGA AAACCGTGAC ATTTTAACAC GCTTGTTACA AGGTAAAGGC GACGCCGCCC ATGAAGCAGC CGTCGCGGCG AATGTCGCCA TGTTAATGCG CCTGCATGGC CATGAAGATC TGCAAGCCAA TGCGCAAACC GTTCTTGAGG TACTGCGCAG TGGTTCCGCT TACGACAGAG TCACCGCACT GGCGGCACGA GGGTAA
|
Protein sequence | MADILLLDNI DSFTYNLADQ LRSNGHNVVI YRNHIPAQTL IERLATMSNP VLMLSPGPGV PSEAGCMPEL LTRLRGKLPI IGICLGHQAI VEAYGGYVGQ AGEILHGKAS SIEHDGQAMF AGLTNPLPVA RYHSLVGSNI PAGLTINAHF NGMVMAVRHD ADRVCGFQFH PESILTTQGA RLLEQTLAWA QQKLEPTNTL QPILEKLYQA QTLSQQESHQ LFSAVVRGEL KPEQLAAALV SMKIRGEHPN EIAGAATALL ENAAPFPRPD YLFADIVGTG GDGSNSINIS TASAFVAAAC GLKVAKHGNR SVSSKSGSSD LLAAFGINLD MNADKSRQAL DELGVCFLFA PKYHTGFRHA MPVRQQLKTR TLFNVLGPLI NPAHPPLALI GVYSPELVLP IAETLRVLGY QRAAVVHSGG MDEVSLHAPT IVAELHDGEI KSYQLTAEDF GLTPYHQEQL AGGTPEENRD ILTRLLQGKG DAAHEAAVAA NVAMLMRLHG HEDLQANAQT VLEVLRSGSA YDRVTALAAR G
|
| |