Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1603 |
Symbol | |
ID | 6872020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 1545538 |
End bp | 1546896 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642784749 |
Product | bifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase |
Protein accession | YP_002215417 |
Protein GI | 198243276 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0134] Indole-3-glycerol phosphate synthase [COG0135] Phosphoribosylanthranilate isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.913758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.0208372 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACCG TTTTAGCGAA AATCGTCGCA GACAAGGCGA TTTGGGTAGA AGCCCGCAAA CAGCAACAGC CGCTGGCCAG CTTTCAAAAT GAGATCCAGC CAAGTACACG CCATTTTTAT GATGCGCTCC AGGGCGCGCG TACCGCCTTT ATTCTGGAGT GTAAGAAAGC ATCGCCATCA AAAGGCGTGA TTCGCGATGA TTTCGATCCG GCGCGTATTG CCAATATTTA TCAACATTAC GCCTCGGCAA TCTCGGTGCT CACCGACGAA AAATATTTTC AGGGTAGCTT CGATTTTCTG CCGGTCGTTA GCCAAAGCGC ACCGCAGCCG ATTCTGTGTA AGGATTTTAT TATCGATCCC TATCAGATCT ACCTTGCCCG TTACTATCAG GCCGATGCCT GTTTACTGAT GCTCTCGGTT CTGGATGACG AACAGTATCG CCAACTCTCC GCCGTCGCGC ACAGTCTGAA AATGGGCGTG CTCACGGAAG TCAGTAATGA CGAAGAACGG GAGCGCGCGA TAGCGTTAGG CGCAAAAGTG GTAGGTATCA ACAATCGCGA TCTGCGCGAT CTGTCGATTG ATTTGAATCG CACCCGCCAG CTGGCGCCAA AACTGGGCCA CGGCGTGACT GTCATCAGCG AGTCCGGGAT TAACACCTAT GGTCAGGTAC GCGAACTGAG CCACTTCGCC AACGGTTTTT TAATTGGCTC GGCGTTAATG GCGCATGACG ATCTTAACGC CGCCGTCCGT CGCGTGCTGC TTGGCGAAAA TAAAGTCTGC GGCCTGACCC GCGCCCAGGA CGCTAAAGCG GCCTGTGACG CTGGCGCAAT ATATGGCGGG TTGATTTTTG TGCCCTCATC TCCACGCGCG GTGAGCGTTG AGCAGGCGCG AGAAGTGATA AGCGGCGCGC CATTGCAGTA TGTCGGCGTT TTCCAGAACG CTGATATCGC CGATGTTTGC CAGAAAGCCG CCGTCCTGTC GCTTTCTGCC GTACAGCTAC ATGGCAGCGA AGACCAGGCG TATGTCAACG CGCTGCGCGA GGCGTTGCCG CGCAATGTGC AAATCTGGAA GGCGCTGAGC GTTAGCAATG CCCTTCCCGC ACGCGATTAT CACCATGTCG ATAAATACAT TTTCGACAAT GGGCAAGGCG GCAGCGGGCA GCGCTTCGAC TGGTCACTGC TACAGGGGCA ACCGCTGGAT GATGTGTTAC TGGCGGGCGG GCTGGCGGCC GATAACTGCG TCCAGGCGGC GCAAGTCGGC TGTGCCGGTC TCGATTTTAA TTCAGGTGTG GAGTCACAGC CGGGCATCAA AGATGCTCGT CTTCTGGCCT CGGTTTTTCA GACACTGCGC GCATATTAA
|
Protein sequence | MQTVLAKIVA DKAIWVEARK QQQPLASFQN EIQPSTRHFY DALQGARTAF ILECKKASPS KGVIRDDFDP ARIANIYQHY ASAISVLTDE KYFQGSFDFL PVVSQSAPQP ILCKDFIIDP YQIYLARYYQ ADACLLMLSV LDDEQYRQLS AVAHSLKMGV LTEVSNDEER ERAIALGAKV VGINNRDLRD LSIDLNRTRQ LAPKLGHGVT VISESGINTY GQVRELSHFA NGFLIGSALM AHDDLNAAVR RVLLGENKVC GLTRAQDAKA ACDAGAIYGG LIFVPSSPRA VSVEQAREVI SGAPLQYVGV FQNADIADVC QKAAVLSLSA VQLHGSEDQA YVNALREALP RNVQIWKALS VSNALPARDY HHVDKYIFDN GQGGSGQRFD WSLLQGQPLD DVLLAGGLAA DNCVQAAQVG CAGLDFNSGV ESQPGIKDAR LLASVFQTLR AY
|
| |