Gene EcHS_A1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1371 
SymboltrpC 
ID5592556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1365484 
End bp1366845 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content53% 
IMG OID640920526 
Productbifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase 
Protein accessionYP_001458085 
Protein GI157160767 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0134] Indole-3-glycerol phosphate synthase
[COG0135] Phosphoribosylanthranilate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCAAA CCGTTTTAGC GAAAATCGTC GCAGACAAGG CGATTTGGGT AGAAGCCCGC 
AAACAGCAGC AACCGCTGGC CAGTTTTCAG AATGAGGTTC AGCCGAGCAC GCGACATTTT
TATGATGCGC TACAGGGTGC GCGCACGGCG TTTATTCTGG AGTGCAAGAA AGCGTCGCCG
TCAAAAGGCG TGATCCGTGA TGATTTCGAT CCAGCACGCA TTGCCGCCAT TTATAAACAT
TACGCTTCGG CAATTTCGGT GCTGACTGAT GAGAAATATT TTCAGGGGAG CTTTGATTTC
CTCCCCATCG TCAGCCAAAT CGCCCCGCAG CCGATTTTAT GTAAAGACTT CATTATCGAC
CCTTACCAGA TCTATCTGGC GCGCTATTAC CAGGCCGATG CCTGCTTATT AATGCTTTCA
GTACTGGATG ACGACCAATA TCGCCAGCTT GCCGCCGTCG CTCACAGTCT GGAGATGGGG
GTGCTGACCG AAGTCAGTAA TGAAGAGGAA CAGGAGCGCG CCATTGCATT GGGAGCAAAG
GTCGTTGGCA TCAACAACCG CGATCTGCGT GATTTGTCGA TTGATCTCAA CCGTACCCGC
GAGCTTGCGC CGAAACTGGG GCACAACGTG ACGGTAATCA GCGAATCCGG CATCAATACT
TACGCTCAGG TGCGCGAGTT AAGCCACTTC GCTAACGGTT TTCTGATTGG TTCGGCGTTG
ATGGCCCATG ACGATTTGCA CGCCGCCGTG CGCCGGGTGT TGCTGGGTGA GAATAAAGTA
TGCGGCCTGA CGCGTGGGCA AGATGCTAAA GCAGCTTATG ACGCGGGCGC GATTTACGGT
GGGTTGATTT TTGTTGCGAC ATCACCGCGT TGCGTCAACG TTGAACAGGC GCAGGAAGTG
ATGGCTGCGG CACCGTTGCA GTATGTTGGC GTGTTCCGCA ATCACGATAT TGCCGATGTG
GTGGACAAAG CTAAGGTGTT ATCGCTGGCG GCAGTGCAAC TGCATGGTAA TGAAGAACAG
CTGTATATCG ATACGCTGCG TGAAGCTCTG CCAGCACATG TTGCCATCTG GAAAGCATTA
AGCGTCGGTG AAACCCTGCC CGCCCGCGAG TTTCAGCACG TTGATAAATA TGTTTTAGAC
AACGGCCAGG GTGGAAGCGG GCAACGTTTT GACTGGTCAC TATTAAATGG TCAATCGCTT
GGCAACGTTC TGCTGGCGGG GGGCTTAGGC GCAGATAACT GCGTGGAAGC GGCACAAACC
GGCTGCGCCG GACTTGATTT TAATTCTGCT GTAGAGTCGC AACCGGGCAT CAAAGACGCA
CGTCTTTTGG CCTCGGTTTT CCAGACGCTG CGCGCATATT AA
 
Protein sequence
MMQTVLAKIV ADKAIWVEAR KQQQPLASFQ NEVQPSTRHF YDALQGARTA FILECKKASP 
SKGVIRDDFD PARIAAIYKH YASAISVLTD EKYFQGSFDF LPIVSQIAPQ PILCKDFIID
PYQIYLARYY QADACLLMLS VLDDDQYRQL AAVAHSLEMG VLTEVSNEEE QERAIALGAK
VVGINNRDLR DLSIDLNRTR ELAPKLGHNV TVISESGINT YAQVRELSHF ANGFLIGSAL
MAHDDLHAAV RRVLLGENKV CGLTRGQDAK AAYDAGAIYG GLIFVATSPR CVNVEQAQEV
MAAAPLQYVG VFRNHDIADV VDKAKVLSLA AVQLHGNEEQ LYIDTLREAL PAHVAIWKAL
SVGETLPARE FQHVDKYVLD NGQGGSGQRF DWSLLNGQSL GNVLLAGGLG ADNCVEAAQT
GCAGLDFNSA VESQPGIKDA RLLASVFQTL RAY