Gene ECH74115_1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1895 
SymboltrpD 
ID6966745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1787823 
End bp1789418 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content56% 
IMG OID643385829 
Productbifunctional glutamine amidotransferase/anthranilate phosphoribosyltransferase 
Protein accessionYP_002270318 
Protein GI209397663 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0512] Anthranilate/para-aminobenzoate synthases component II
[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase
[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.000854588 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGACA TTCTGCTGCT CGATAATATC GACTCTTTTA CTTACAACCT GGCAGATCAG 
TTGCGCAGCA ATGGTCATAA CGTGGTGATT TACCGCAACC ATATTCCGGC GCAGACCTTA
ATTGAACGCC TTGCGACGAT GAGCAATCCG GTGCTGATGC TTTCTCCTGG ACCCGGTGTG
CCGAGCGAAG CTGGTTGTAT GCCTGAACTG CTTACCCGCC TGCGCGGTAA GCTGCCTATT
ATTGGCATTT GCCTTGGTCA TCAGGCGATT GTCGAAGCTT ACGGGGGCTA TGTCGGTCAG
GCGGGCGAAA TTCTTCACGG TAAAGCGTCG AGCATTGAAC ATGACGGTCA GGCGATGTTT
GCCGGATTAA CAAACCCGCT GCCGGTGGCG CGTTATCACT CGCTGGTTGG CAGTAATATT
CCGGCCGGTT TAACCATCAA CGCCCATTTT AATGGCATGG TGATGGCGGT GCGTCACGAT
GCAGATCGCG TTTGCGGATT CCAGTTCCAC CCGGAATCCA TTCTCACCAC CCAGGGCGCT
CGCCTGCTGG AACAAACGCT GGCCTGGGCG CAGCAGAAAC TAGAGCCAAC CAACACGCTG
CAACCGATTC TGGAAAAACT GTATCAGGCG CAGACCCTTA GCCAGCAGGA AAGTCATCAG
CTATTTTCAG CGGTGGTACG TGGTGAACTG AAACCGGAAC ACCTGGCGGC GGCGCTGGTG
AGCATGAAAA TTCGCGGCGA GCACCCGAAC GAGATCGCCG GAGCAGCAAC CGCGCTACTG
GAAAACGCCG CGCCGTTCCC GCGTCCGGAT TATCTGTTTG CCGATATCGT CGGCACCGGC
GGTGACGGCA GCAACAGCAT CAATATTTCC ACCGCCAGTG CGTTTGTCGC CGCGGCCTGT
GGGCTGAAAG TGGCGAAACA CGGCAACCGT AGCGTCTCCA GTAAATCTGG CTCGTCGGAT
CTGCTGGCGG CGTTCGGTAT TAATCTTGAT ATGAACGCCG ATAAATCGCG CCAGGCGCTG
GATGAGTTAG GTGTATGTTT CCTCTTTGCA CCGAAATATC ACACCGGATT TCGCCATGCA
ATGCCGGTTC GCCAGCAACT AAAAACCCGC ACCCTGTTCA ATGTGCTGGG GCCATTGATT
AACCCGGCGC ATCCGCCGCT GGCGTTAATT GGTGTTTATA GTCCGGAACT GGTGCTGCCG
ATTGCCGAAA CCTTACGCGT GCTGGGGTAT CAACGCGCGG CGGTGGTGCA CAGCGGCGGG
ATGGATGAAG TTTCATTACA CGCGCCGACA ATCGTTGCCG AGCTGCATGA CGGCGAAATT
AAGAGCTATC AATTGACCGC TGAAGATTTT GGCCTGACTC CCTACCACCA GGAGCAACTG
GCAGGCGGAA CACCGGAAGA AAACCGTGAC ATTTTAACAC GCTTGTTACA AGGTAAAGGC
GACGCCGCCC ATGAAGCAGC CGTCGCGGCG AATGTCGCCA TGTTAATGCG CCTGCATGGC
CATGAAGATC TGCAAGCCAA TGCGCAAACC GTTCTTGAGG TACTGCGCAG TGGTTCCGCT
TACGACAGAG TCACCGCACT GGCGGCACGA GGGTAA
 
Protein sequence
MADILLLDNI DSFTYNLADQ LRSNGHNVVI YRNHIPAQTL IERLATMSNP VLMLSPGPGV 
PSEAGCMPEL LTRLRGKLPI IGICLGHQAI VEAYGGYVGQ AGEILHGKAS SIEHDGQAMF
AGLTNPLPVA RYHSLVGSNI PAGLTINAHF NGMVMAVRHD ADRVCGFQFH PESILTTQGA
RLLEQTLAWA QQKLEPTNTL QPILEKLYQA QTLSQQESHQ LFSAVVRGEL KPEHLAAALV
SMKIRGEHPN EIAGAATALL ENAAPFPRPD YLFADIVGTG GDGSNSINIS TASAFVAAAC
GLKVAKHGNR SVSSKSGSSD LLAAFGINLD MNADKSRQAL DELGVCFLFA PKYHTGFRHA
MPVRQQLKTR TLFNVLGPLI NPAHPPLALI GVYSPELVLP IAETLRVLGY QRAAVVHSGG
MDEVSLHAPT IVAELHDGEI KSYQLTAEDF GLTPYHQEQL AGGTPEENRD ILTRLLQGKG
DAAHEAAVAA NVAMLMRLHG HEDLQANAQT VLEVLRSGSA YDRVTALAAR G