Gene EcSMS35_1869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1869 
SymboltrpD 
ID6144331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1893028 
End bp1894623 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content56% 
IMG OID641616745 
Productbifunctional glutamine amidotransferase/anthranilate phosphoribosyltransferase 
Protein accessionYP_001743923 
Protein GI170679698 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0512] Anthranilate/para-aminobenzoate synthases component II
[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase
[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000000144467 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGACA TTCTGCTGCT CGATAATATC GATTCTTTTA CTTACAACCT GGCAGATCAG 
TTGCGCAGCA ATGGTCATAA CGTGGTGATT TACCGCAACC ATATTCCGGC GCAGACCTTA
ATTGAACGCC TGGCGACGAT GAGCAATCCG GTGCTGATGC TCTCTCCTGG CCCCGGTGTG
CCGAGCGAAG CCGGTTGTAT GCCGGAACTC CTCACCCGCC TGCGCGGCAA GCTGCCAATT
ATTGGCATTT GCCTCGGTCA TCAGGCGATT GTCGAAGCTT ACGGGGGCTA TGTCGGTCAG
GCGGGCGAAA TTCTTCACGG TAAAGCGTCG AGCATTGAAC ATGACGGTCA GGCGATGTTT
GCCGGATTAA CAAACCCGCT GCCAGTGGCG CGTTATCACT CGCTGGTTGG CAGTAATATT
CCGGCAGGTT TAACCATCAA CGCCCATTTT AATGGCATGG TGATGGCGGT GCGCCATGAT
GCGGATCGCG TTTGTGGATT CCAGTTCCAT CCGGAATCCA TTCTTACTAC CCAGGGCGCT
CGCCTGCTGG AACAAACGCT GGCCTGGGCG CAGCAGAAAC TAGAGCCAAC CAACACGCTG
CAACCGATTC TGGAAAAACT GTATCAGGCG CAGACCCTTA GTCAGCAGGA AAGCCACCAG
CTATTTTCAG CGGTGGTGCG TGGCGAACTG AAACCGGAAC AACTGGCGGC GGCGCTGGTG
AGCATGAAAA TTCGCGGCGA GCACCCGAAT GAAATCGCCG GGGCAGCAAC CGCGCTACTG
GAAAACGCCG CGCCGTTCCC GCGCCCGGAT TATCTGTTTG CCGATATCGT CGGTACTGGC
GGTGACGGCA GCAACAGTAT CAATATTTCT ACCGCCAGTG CGTTTGTCGC CGCGGCCTGC
GGGCTGAAAG TGGCGAAACA CGGCAACCGT AGCGTCTCCA GTAAATCCGG CTCGTCGGAT
CTGCTGGCGG CGTTCGGTAT TAATCTTGAT ATGAACGCCG ATAAATCGCG CCAGGCGCTG
GATGAGTTAG GCGTCTGTTT CCTCTTTGCG CCGAAATATC ACACCGGATT CCGCCATGCA
ATGCCGGTTC GCCAGCAACT GAAAACCCGC ACCCTGTTCA ATGTTCTGGG GCCATTAATT
AACCCGGCGC ATCCTCCGCT GGCGTTAATT GGTGTCTACA GCCCGGAGCT GGTGCTGCCG
ATTGCCGAAA CCTTGCGCGT GCTGGGGTAT CAACGCGCGG CGGTGGTGCA CAGCGGCGGG
ATGGATGAAG TTTCATTACA CGCGCCGACA ATCGTTGCCG AACTGCATGA CGGCGAAATT
AAAAGCTATC AGCTCACCGC AGAAGACTTT GGCCTGACAC CCTACCACCA GGAGCAATTG
GCAGGCGGAA CACCGGAAGA AAACCGTGAC ATTTTAACAC GCTTGTTACA AGGTAAAGGC
GACGCCGCCC ATGAAGCAGC CGTCGCGGCG AATGTCGCCA TGTTAATGCG CCTGCATGGC
CATGAAGATC TGCAAGCCAA TGCGCAAACC GTTCTTGAGG TACTGCGCAG TGGTTCCGCT
TACGACAGAG TCACCGCACT GGCGGCACGA GGGTAA
 
Protein sequence
MADILLLDNI DSFTYNLADQ LRSNGHNVVI YRNHIPAQTL IERLATMSNP VLMLSPGPGV 
PSEAGCMPEL LTRLRGKLPI IGICLGHQAI VEAYGGYVGQ AGEILHGKAS SIEHDGQAMF
AGLTNPLPVA RYHSLVGSNI PAGLTINAHF NGMVMAVRHD ADRVCGFQFH PESILTTQGA
RLLEQTLAWA QQKLEPTNTL QPILEKLYQA QTLSQQESHQ LFSAVVRGEL KPEQLAAALV
SMKIRGEHPN EIAGAATALL ENAAPFPRPD YLFADIVGTG GDGSNSINIS TASAFVAAAC
GLKVAKHGNR SVSSKSGSSD LLAAFGINLD MNADKSRQAL DELGVCFLFA PKYHTGFRHA
MPVRQQLKTR TLFNVLGPLI NPAHPPLALI GVYSPELVLP IAETLRVLGY QRAAVVHSGG
MDEVSLHAPT IVAELHDGEI KSYQLTAEDF GLTPYHQEQL AGGTPEENRD ILTRLLQGKG
DAAHEAAVAA NVAMLMRLHG HEDLQANAQT VLEVLRSGSA YDRVTALAAR G