Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_2206 |
Symbol | |
ID | 5112898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 2397512 |
End bp | 2398870 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640492393 |
Product | bifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase |
Protein accession | YP_001176932 |
Protein GI | 146311858 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0134] Indole-3-glycerol phosphate synthase [COG0135] Phosphoribosylanthranilate isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0123051 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACCG TTTTAGCGAA AATCGTTGCC GATAAGGCTA TCTGGGTTGA AGCCCGCAAA GAACAGCAAC CGCTGGCCAG TTTTCAGAAT GACGTTGTGC CGAGCAGTCG CCGTTTTTAT GACGCCCTTC AGGGCGCTCG CACCGCCTTT ATTCTGGAGT GCAAAAAAGC CTCCCCGTCT AAAGGTGTGA TCCGCGACGA TTTTGACCCT GCGCGTATCG CTGGGGTTTA CAAACACCAC GCATCGGCCA TTTCGGTGCT GACGGATGAG AAATATTTCC AGGGCAGTTT TGATTTCCTG CCGATTGTGA GCCAAATCGC GCCGCAGCCG ATTTTGTGCA AAGACTTCAT TATCGACGCG TACCAAATTT GGTTGGCACG TTTTTATCAG GCCGACGCCT GCCTGCTGAT GCTGTCCGTT CTGGATGACG AACAGTATCG CCAGCTCGCT GCCGTGGCGC ACAGTCTGAA TATGGGCGTG CTGACCGAAG TGAGCAACGA AGAAGAGCTT GAGCGCGCGA TTGCGCTTGA CGCCAAAGTC GTTGGCATCA ACAACCGCGA CCTGCGCGAT CTCTCCATCG ACCTGAACCG TACGCGTGAG CTTGCTCCGC GTCTGGGTGC CGGTGTGACG GTGATCAGCG AATCAGGCAT TAATACTTAC GCTCAGGTTC GCGAGTTGAG TCATTTCGCA AACGGCTTCC TGATCGGCTC AGCCATGATG GCCTACGATG ATTTAAACGT CGCGGTGCGC CGCGTTCTGC TGGGTGAAAA CAAAGTCTGC GGCCTGACTC GCGGCCAGGA TGCTGCGGCG GCGCTTGAAG CGGGCGCAAT CTATGGCGGA TTGATATTTG TCGACGGTTC GCCGCGCACT ATTTCAGAAA ATCAGGCGCG TGAAGTGATA GCAGCCGCAC CACTCAGCTA CGTCGGCGTG TTCCGCGATG CGCCTGTAAA TGACGTCGTG GCAAAAGCCG AAAGCCTGTC CCTCGCCGCC GTTCAACTGC ACGGCAGCGA AGACCAGCAT TATATCGACG CGCTGCGCCA GGCATTGCCT TCGCAGATCC AGATCTGGAA AGCGCAGAGC GTGAGCGACA CCTTGCCAGC CCGCGATTTG AATCACGTTG ATAAGTATGT GCTTGATAAC GGCCAGGGCG GCACGGGCCA GCGTTTCGAC TGGTCGCTGC TGAACGGTGA AAAGCTGGAT AACGTCCTGC TCGCGGGTGG GTTAAGCCCC GACAATTGCG TAGAGGCCGC GAAAGCGGGC TGCGCAGGTC TCGATTTCAA TTCAGGCGTA GAGTCCCAAC CGGGAATCAA AGATGCCAGC AAACTGGCCT CGGTATTCAA GACGCTGCGT GCATATTAA
|
Protein sequence | MQTVLAKIVA DKAIWVEARK EQQPLASFQN DVVPSSRRFY DALQGARTAF ILECKKASPS KGVIRDDFDP ARIAGVYKHH ASAISVLTDE KYFQGSFDFL PIVSQIAPQP ILCKDFIIDA YQIWLARFYQ ADACLLMLSV LDDEQYRQLA AVAHSLNMGV LTEVSNEEEL ERAIALDAKV VGINNRDLRD LSIDLNRTRE LAPRLGAGVT VISESGINTY AQVRELSHFA NGFLIGSAMM AYDDLNVAVR RVLLGENKVC GLTRGQDAAA ALEAGAIYGG LIFVDGSPRT ISENQAREVI AAAPLSYVGV FRDAPVNDVV AKAESLSLAA VQLHGSEDQH YIDALRQALP SQIQIWKAQS VSDTLPARDL NHVDKYVLDN GQGGTGQRFD WSLLNGEKLD NVLLAGGLSP DNCVEAAKAG CAGLDFNSGV ESQPGIKDAS KLASVFKTLR AY
|
| |