Gene SbBS512_E1487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1487 
SymboltrpC 
ID6270562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1355296 
End bp1356657 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content53% 
IMG OID641725587 
Productbifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase 
Protein accessionYP_001880093 
Protein GI187731266 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0134] Indole-3-glycerol phosphate synthase
[COG0135] Phosphoribosylanthranilate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCAAA CCGTTTTAGC GAAAATCGTC GCAGACAAGG CGATTTGGGT AGAAACCCGC 
AAACAGCAGC AACCGCTGGC CAGTTTTCAG AATGAGGTTC AGCCGAGCAC GCGACATTTT
TATGATGCGC TACAGGGTGC ACGCACGGCG TTTATTCTGG AGTGCAAGAA AGCGTCGCCG
TCAAAAGGCG TGATCCGTGA TGATTTCGAT CCGGCACGCA TTGCCGCCAT TTATAAACAT
TACGCTTCGG CAATTTCGGT GCTGACTGAT GAGAAATATT TTCAGGGGAG CTTTGATTTC
CTCCCCATCG TCAGCCAAAT CGCCCCGCAG CCGATTTTAT GTAAAGACTT CATTATCGAC
CCTTACCAGA TCTATCTGGC GCGCTATTAC CAGGCCGATG CCTGCTTATT AATGCTTTCA
GTACTGGATG ACGACCAATA TCGCCAGCTT GCAGCCGTCG CTCACAGTCT GGAGATGGGG
GGGCTGACCG AAGTCAGTAA TGAAGAGGAA CTGGAGCGCG CCATTGCATT AGGGGCAAAG
GTCGTTGGCA TCAACAACCG CGATCTGCGT GATTTGTCGA TTGATCTCAA CCGTACCCGC
GAGCTTGCGC CGAAACTGGG GCACAACGTA ACGGTAATCA GCGAATCCGG CATCAATACT
TACGCTCAGG TGCGCGAGTT AAGCCACTTC GCTAACGGTT TTCTGATTGG TTCGGCGTTG
ATGGCCCATG ACGATTTGCA CGCCGCCGTG CGCCGGGTGT TGCTGGGTGA GAATAAAGTA
TGTGGCCTGA CGCGTGGGCA AGATGCTAAA GCAGCTTATG ACGCGGGCGC GATTTACGGT
GGGTTGATTT TTGTCGCGAC ATCACCGCGT TGCGTCAACG TTGAACAGGC GCAGGAAGTG
ATGGCTGCGG CACCGTTGCA GTATGTTGGC GTGTTCCGCA ATCACGATAT TGCCGATGTG
GTGGACAAAG CTAAGGTGTT ATCGCTGGCG GCAGTGCAAC TGCATGGTAA TGAAGATCAG
CTGTATATCG ATACGCTGCG TGAAGCTCTG CCAGCACATG TTGCCATCTG GAAAGCATTA
AGCGTCGGTG AAACCCTGCC CGCCCGCGAG TTTCAGCACG TTGATAAATA TGTTTTAGAC
AACGGCCAGG GTGGAAGCGG GCAACGTTTT GACTGGTCAC TATTAAATGG TCAATCGCTT
GGCAACGTTC TGCTGGCGGG GGGCTTAGGC GCAGATAACT GCGTGGAAGC GGCACAAACC
GGCTGCGCCG GACTTGATTT TAATTCTGCT GTAGAGTCGC AACCGGGCAT CAAAGACGCA
CGTCTTTTGG CCTCGGTTTT CCAGACGCTG CGCGCATATT AA
 
Protein sequence
MMQTVLAKIV ADKAIWVETR KQQQPLASFQ NEVQPSTRHF YDALQGARTA FILECKKASP 
SKGVIRDDFD PARIAAIYKH YASAISVLTD EKYFQGSFDF LPIVSQIAPQ PILCKDFIID
PYQIYLARYY QADACLLMLS VLDDDQYRQL AAVAHSLEMG GLTEVSNEEE LERAIALGAK
VVGINNRDLR DLSIDLNRTR ELAPKLGHNV TVISESGINT YAQVRELSHF ANGFLIGSAL
MAHDDLHAAV RRVLLGENKV CGLTRGQDAK AAYDAGAIYG GLIFVATSPR CVNVEQAQEV
MAAAPLQYVG VFRNHDIADV VDKAKVLSLA AVQLHGNEDQ LYIDTLREAL PAHVAIWKAL
SVGETLPARE FQHVDKYVLD NGQGGSGQRF DWSLLNGQSL GNVLLAGGLG ADNCVEAAQT
GCAGLDFNSA VESQPGIKDA RLLASVFQTL RAY