Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2365 |
Symbol | |
ID | 6066108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2606842 |
End bp | 2608200 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641601768 |
Product | bifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase |
Protein accession | YP_001725327 |
Protein GI | 170020373 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0134] Indole-3-glycerol phosphate synthase [COG0135] Phosphoribosylanthranilate isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.979048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000345337 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAACCG TTTTAGCGAA AATCGTCGCA GACAAGGCGA TTTGGGTAGA AGCCCGCAAA CAGCAGCAAC CGCTGGCCAG TTTTCAGAAT GAGGTTCAGC CGAGCACGCG ACATTTTTAT GATGCGCTAC AGGGTGCGCG CACGGCGTTT ATTCTGGAGT GCAAGAAAGC GTCGCCGTCA AAAGGCGTGA TCCGTGATGA TTTCGATCCA GCACGCATTG CCGCCATTTA TAAACATTAC GCTTCGGCAA TTTCGGTGCT GACTGATGAG AAATATTTTC AGGGGAGCTT TGATTTCCTC CCCATCGTCA GCCAAATCGC CCCGCAGCCG ATTTTATGTA AAGACTTCAT TATCGACCCT TACCAGATCT ATCTGGCGCG CTATTACCAG GCCGATGCCT GCTTATTAAT GCTTTCAGTA CTGGATGACG ACCAATATCG CCAGCTTGCC GCCGTCGCTC ACAGTCTGGA GATGGGGGTG CTGACCGAAG TCAGTAATGA AGAGGAACAG GAGCGCGCCA TTGCATTGGG AGCAAAGGTC GTTGGCATCA ACAACCGCGA TCTGCGTGAT TTGTCGATTG ATCTCAACCG TACCCGCGAG CTTGCGCCGA AACTGGGGCA CAACGTGACG GTAATCAGCG AATCCGGCAT CAATACTTAC GCTCAGGTGC GCGAGTTAAG CCACTTCGCT AACGGTTTTC TGATTGGTTC GGCGTTGATG GCCCATGACG ATTTGCACGC CGCCGTGCGC CGGGTGTTGC TGGGTGAGAA TAAAGTATGT GGCCTGACGC GTGGGCAAGA TGCTAAAGCA GCTTATGACG CGGGCGCGAT TTACGGTGGG TTGATTTTTG TTGCGACATC ACCGCGTTGC GTCAACGTTG AACAGGCGCA GGAAGTGATG GCTGCGGCAC CGTTGCAGTA TGTTGGCGTG TTCCGCAATC ACGATATTGC CGATGTGGTG GACAAAGCTA AGGTGTTATC GCTGGCGGCA GTGCAACTGC ATGGTAATGA AGAACAGCTG TATATCGATA CGCTGCGTGA AGCTCTGCCA GCACATGTTG CCATCTGGAA AGCATTAAGC GTCGGTGAAA CCCTGCCCGC CCGCGAGTTT CAGCACGTTG ATAAATATGT TTTAGACAAC GGCCAGGGTG GAAGCGGGCA ACGTTTTGAC TGGTCACTAT TAAATGGTCA ATCGCTTGGC AACGTTCTGC TGGCGGGGGG CTTAGGCGCA GATAACTGCG TGGAAGCGGC ACAAACCGGC TGCGCCGGAC TTGATTTTAA TTCTGCTGTA GAGTCGCAAC CGGGCATCAA AGACGCACGT CTTTTGGCCT CGGTTTTCCA GACGCTGCGC GCATATTAA
|
Protein sequence | MQTVLAKIVA DKAIWVEARK QQQPLASFQN EVQPSTRHFY DALQGARTAF ILECKKASPS KGVIRDDFDP ARIAAIYKHY ASAISVLTDE KYFQGSFDFL PIVSQIAPQP ILCKDFIIDP YQIYLARYYQ ADACLLMLSV LDDDQYRQLA AVAHSLEMGV LTEVSNEEEQ ERAIALGAKV VGINNRDLRD LSIDLNRTRE LAPKLGHNVT VISESGINTY AQVRELSHFA NGFLIGSALM AHDDLHAAVR RVLLGENKVC GLTRGQDAKA AYDAGAIYGG LIFVATSPRC VNVEQAQEVM AAAPLQYVGV FRNHDIADVV DKAKVLSLAA VQLHGNEEQL YIDTLREALP AHVAIWKALS VGETLPAREF QHVDKYVLDN GQGGSGQRFD WSLLNGQSLG NVLLAGGLGA DNCVEAAQTG CAGLDFNSAV ESQPGIKDAR LLASVFQTLR AY
|
| |