Gene ECD_01237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01237 
SymboltrpD 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1305381 
End bp1306976 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content56% 
IMG OID 
Productbifunctional indole-3-glycerol-phosphate synthase/anthranilate phosphoribosyltransferase 
Protein accessionACT43127 
Protein GI253977457 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACA TTCTGCTGCT CGATAATATC GACTCTTTTA CGTACAACCT GGCAGATCAG 
TTGCGCAGCA ATGGTCATAA CGTGGTGATT TACCGCAACC ATATTCCGGC GCAGACCTTA
ATTGAACGCC TGGCGACGAT GAGCAATCCG GTACTGATGC TTTCTCCTGG CCCCGGTGTG
CCGAGCGAAG CCGGTTGTAT GCCGGAACTC CTCACCCGCT TGCGTGGCAA GCTGCCCATT
ATTGGCATTT GCCTCGGACA TCAGGCAATT GTCGAAGCTT ACGGGGGCTA TGTCGGTCAG
GCGGGCGAAA TTCTTCACGG TAAAGCGTCG AGCATTGAAC ATGACGGTCA GGCGATGTTT
GCCGGATTAA CAAACCCGCT GCCGGTGGCG CGTTATCACT CGCTGGTTGG CAGTAACATT
CCGGCCGGTT TAACCATCAA CGCCCATTTT AATGGCATGG TGATGGCAGT ACGTCACGAT
GCGGATCGCG TTTGTGGATT CCAGTTCCAT CCGGAATCCA TTCTCACCAC CCAGGGCGCT
CGCCTGCTGG AACAAACGCT GGCCTGGGCG CAGCAGAAAC TAGAGCCAGC CAACACGCTG
CAACCGATTC TGGAAAAACT GTATCAGGCG CAGACGCTTA GCCAACAAGA AAGCCACCAG
CTGTTTTCAG CGGTGGTGCG TGGCGAGCTG AAGCCGGAAC AACTGGCGGC GGCGCTGGTG
AGCATGAAAA TTCGCGGTGA GCACCCGAAC GAGATCGCCG GAGCAGCAAC CGCGCTACTG
GAAAACGCCG CGCCGTTCCC GCGCCCGGAT TATCTGTTTG CTGATATCGT CGGTACTGGC
GGTGACGGCA GCAACAGTAT CAATATTTCT ACCGCCAGTG CGTTTGTCGC CGCGGCCTGT
GGGCTGAAAG TGGCGAAACA CGGCAACCGT AGCGTCTCCA GTAAATCTGG TTCGTCCGAT
CTGCTGGCGG CGTTCGGTAT TAATCTTGAT ATGAACGCCG ATAAATCGCG CCAGGCGCTG
GATGAGTTAG GTGTATGTTT CCTCTTTGCG CCGAAGTATC ACACCGGATT CCGCCACGCG
ATGCCGGTTC GCCAGCAACT GAAAACCCGC ACCCTGTTCA ATGTGCTGGG GCCATTGATT
AACCCGGCGC ATCCGCCGCT GGCGTTAATT GGTGTTTATA GTCCGGAACT GGTGCTGCCG
ATTGCCGAAA CCTTGCGCGT GCTGGGGTAT CAACGCGCGG CGGTGGTGCA CAGCGGCGGG
ATGGATGAAG TTTCATTACA CGCGCCGACA ATCGTTGCCG AGCTGCATGA CGGCGAAATT
AAGAGCTATC AATTGACCGC TGAAGATTTT GGCCTGACTC CCTACCACCA GGAGCAACTG
GCAGGCGGAA CACCGGAAGA AAACCGTGAC ATTTTAACAC GCTTGTTACA AGGTAAAGGC
GACGCCGCCC ATGAAGCAGC CGTCGCTGCG AACGTCGCCA TGTTAATGCG CCTGCATGGC
CATGAAGATC TGCAAGCCAA TGCGCAAACC GTTCTTGAGG TACTGCGCAG TGGTTCCGCT
TACGACAGAG TTACCGCACT GGCGGCACGA GGGTAA
 
Protein sequence
MADILLLDNI DSFTYNLADQ LRSNGHNVVI YRNHIPAQTL IERLATMSNP VLMLSPGPGV 
PSEAGCMPEL LTRLRGKLPI IGICLGHQAI VEAYGGYVGQ AGEILHGKAS SIEHDGQAMF
AGLTNPLPVA RYHSLVGSNI PAGLTINAHF NGMVMAVRHD ADRVCGFQFH PESILTTQGA
RLLEQTLAWA QQKLEPANTL QPILEKLYQA QTLSQQESHQ LFSAVVRGEL KPEQLAAALV
SMKIRGEHPN EIAGAATALL ENAAPFPRPD YLFADIVGTG GDGSNSINIS TASAFVAAAC
GLKVAKHGNR SVSSKSGSSD LLAAFGINLD MNADKSRQAL DELGVCFLFA PKYHTGFRHA
MPVRQQLKTR TLFNVLGPLI NPAHPPLALI GVYSPELVLP IAETLRVLGY QRAAVVHSGG
MDEVSLHAPT IVAELHDGEI KSYQLTAEDF GLTPYHQEQL AGGTPEENRD ILTRLLQGKG
DAAHEAAVAA NVAMLMRLHG HEDLQANAQT VLEVLRSGSA YDRVTALAAR G