Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1852 |
Symbol | |
ID | 6484299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 1818060 |
End bp | 1819418 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642737226 |
Product | bifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase |
Protein accession | YP_002040978 |
Protein GI | 194444204 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0134] Indole-3-glycerol phosphate synthase [COG0135] Phosphoribosylanthranilate isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.0214876 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACCG TTTTAGCGAA AATCGTCGCA GACAAGGCGA TTTGGGTAGA AGCCCGCAAA CAGCAACAGC CGCTGGCCAG TTTTCAAAAT GAGATCCAGC CAAGTACACG CCATTTTTAT GATGCGCTCC AGGGCGCGCG TACCGCCTTT ATTCTGGAGT GTAAGAAAGC ATCGCCATCA AAAGGCGTGA TTCGCGATGA TTTCGATCCG GCGCGTATTG CCAGTATTTA TCAACATTAC GCCTCGGCAA TCTCGGTGCT CACCGACGAA AAATATTTTC AGGGGAGCTT CGATTTTCTG CCGGTCGTTA GCCAAAGCGC GCCGCAGCCG ATTCTGTGTA AGGATTTTAT TATCGATCCC TATCAGATCT ACCTTGCCCG TTACTATCAG GCCGATGCCT GTTTACTGAT GCTCTCGGTT CTGGATGACG AACAGTATCG CCAGCTCTCC GCCGTCGCGC ACAGTCTGAA AATGGGCGTG CTCACGGAGG TCAGTAATGA CGAAGAACGG GAGCGCGCGA TAGCGTTAGG CGCAAAAGTG GTAGGTATCA ACAATCGCGA TCTGCGCGAT CTGTCGATTG ATTTGAATCG CACCCGCCAG CTGGCGCCAA AACTGGGCCA CGGCGTGACT GTCATCAGCG AGTCCGGGAT TAACACCTAT GGTCAGGTAC GCGAACTGAG CCACTTCGCC AACGGTTTTT TAATTGGCTC GGCGTTAATG GCGCATGACG ATCTTAACGC CGCCGTCCGT CGCGTGCTGC TTGGCGAAAA TAAAGTCTGC GGCCTGACCC GCGTCCAGGA CGCTAAAGCG GCCTGTGACG CTGGCGCAAT ATATGGCGGG TTGATTTTTG TGCCCTCATC TCCACGCGCG GTGAGCGTTG AGCAGGCGCG AGAAGTGATA AGCGGCGCGC CATTGCAGTA TGTCGGCGTT TTCCAGAACG CTGATATCGC CGATGTTTGC CAGAAAGCCG CCGTCCTGTC GCTTTCTGCC GTACAGCTAC ATGGCAGCGA AGACCAGGCG TATGTCAACG CGCTGCGCGA GGCGTTGCCG AAACAGGTGC AAATCTGGAA GGCGCTGAGC GTTAGCGATG CCCTTCCCGC ACGCGATTAT CACCATGTCG ATAAATACGT TTTCGACAAT GGGCAAGGCG GCAGCGGGCA GCGCTTCGAC TGGTCACTGC TACAGGGGCA ACCACTGGAT AATGTGTTAC TGGCGGGCGG GCTGGCGGCC GATAACTGCG TCCAGGCGGC GCAAGTCGGC TGTGCCGGTC TCGATTTTAA TTCAGGTGTG GAGTCACAGC CGGGCATCAA AGATGCTCGT CTTCTGGCCT CGGTTTTTCA GACACTGCGC GCATATTAA
|
Protein sequence | MQTVLAKIVA DKAIWVEARK QQQPLASFQN EIQPSTRHFY DALQGARTAF ILECKKASPS KGVIRDDFDP ARIASIYQHY ASAISVLTDE KYFQGSFDFL PVVSQSAPQP ILCKDFIIDP YQIYLARYYQ ADACLLMLSV LDDEQYRQLS AVAHSLKMGV LTEVSNDEER ERAIALGAKV VGINNRDLRD LSIDLNRTRQ LAPKLGHGVT VISESGINTY GQVRELSHFA NGFLIGSALM AHDDLNAAVR RVLLGENKVC GLTRVQDAKA ACDAGAIYGG LIFVPSSPRA VSVEQAREVI SGAPLQYVGV FQNADIADVC QKAAVLSLSA VQLHGSEDQA YVNALREALP KQVQIWKALS VSDALPARDY HHVDKYVFDN GQGGSGQRFD WSLLQGQPLD NVLLAGGLAA DNCVQAAQVG CAGLDFNSGV ESQPGIKDAR LLASVFQTLR AY
|
| |