Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B2998 |
Symbol | |
ID | 6796813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 2928382 |
End bp | 2930043 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 642777165 |
Product | invasion protein regulator |
Protein accession | YP_002147774 |
Protein GI | 197250952 |
COG category | [K] Transcription |
COG ID | [COG3710] DNA-binding winged-HTH domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000362876 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACATT TTAATCCTGT TCCTGTATCG AATAAAAAAT TCGTCTTTGA TGATTTCATA CTCAACATGG ACGGCTCCCT GCTACGCTCA GAAAAGAAAG TCAATATTCC GCCAAAAGAA TATGCTGTTC TGGTCATCCT GCTCGAAGCC GCCGGCGAGA TTGTGAGTAA AAACACCTTA CTGGACCAGG TATGGGGCGA CGCGGAAGTT AACGAAGAAT CTCTTACCCG CTGTATTTAT GCCTTACGAC GTATTCTGTC GGAAGATAAA GAGCATCGTT ACATTGAAAC ACTGTACGGA CAGGGCTATC GGTTTAATCG TCCGGTCGTA GTGGTGTCTC CGCCAGCGCC GCAACCTACG ACTCATACAT TGGCGATACT TCCTTTTCAG ATGCAGGATC AGGTTCAATC CGAGAGTTTG CATTACTCTA TCGTGAAGGG ATTATCGCAG TATGCGCCCT TTGGCCTGAG CGTGCTGCCG GTGACCATTA CGAAGAACTG CCGCAGTGTT AAGGATATAC TTGAGCTCAT GGATCAATTA CGCCCCGATT ATTATATCTC CGGGCAGATG ATACCTGATG GTAATGATAA TATTGTACAG ATCGAGATAG TTCGGGTTAA AGGTTATCAC CTGCTGCACC AGGAAAGCAT TAAGTTGATA GAACACCAAC CCGCTTCTCT CTTGCAAAAC AAAATTGCGA ATCTTTTGCT CAGATGTATT CCCGGGCTTC GCTGGGACAC AAAGCAGATT AGCGAGCTAA ATTCGATTGA CAGCACTATG GTTTACTTAC GCGGTAAGCA TGAGTTAAAT CAATACACCC CCTATAGCTT ACAGCAAGCG CTTAAATTGC TGACTCAATG CGTCAACATG TCGCCAAACA GCATTGCGCC TTACTGTGCG CTGGCAGAAT GCTACCTCAG CATGGCGCAA ATGGGGATTT TTGATAAACA AAACGCTATG ATCAAAGCTA AAGAACATGC GATTAAGGCG ACAGAGCTGG ACCACAATAA TCCACAAGCT TTAGGATTAC TGGGGCTAAT TAATACGATT CATTCAGAAT ACATCGTCGG GAGTTTGCTA TTCAAACAAG CTAACTTACT TTCGCCCATT TCTGCAGATA TTAAATATTA TTATGGCTGG AATCTCTTCA TGGCTGGTCA GTTGGAGGAG GCCTTACAAA CGATTAACGA GTGTTTAAAA TTGGACCCAA CGCGCGCAGC CGCAGGGATC ACTAAGCTGT GGATTACCTA TTATCATACC GGTATTGATG ATGCTATACG TTTAGGCGAT GAATTACGCT CACAACACTT GCAGGATAAT CCAATATTAT TAAGTATGCA GGTTATGTTC CTTTCGCTTA AAGGTAAACA TGAACTGGCA CGAAAATTAA CTAAAGAAAT ATCCACGCAG GAAATAACAG GGCTTATTGC TGTTAATCTT CTTTATGCTG AATACTGTCA GAATAGTGAG CGTGCCTTAC CGACGATAAG AGAGTTTCTG GAAAGCGAAC AGCGTATTGA TAATAATCCG GGATTATTAC CGTTAGTGCT GGTTGCCCAC GGCGAAGCTA TTGCCGAGAA AATGTGGAAT AAATTTAAAA ACGAAGACAA TATTTGGTTC AAAAGATGGA AACAGGATCC CCGCTTGATT AAATTACGGT AA
|
Protein sequence | MPHFNPVPVS NKKFVFDDFI LNMDGSLLRS EKKVNIPPKE YAVLVILLEA AGEIVSKNTL LDQVWGDAEV NEESLTRCIY ALRRILSEDK EHRYIETLYG QGYRFNRPVV VVSPPAPQPT THTLAILPFQ MQDQVQSESL HYSIVKGLSQ YAPFGLSVLP VTITKNCRSV KDILELMDQL RPDYYISGQM IPDGNDNIVQ IEIVRVKGYH LLHQESIKLI EHQPASLLQN KIANLLLRCI PGLRWDTKQI SELNSIDSTM VYLRGKHELN QYTPYSLQQA LKLLTQCVNM SPNSIAPYCA LAECYLSMAQ MGIFDKQNAM IKAKEHAIKA TELDHNNPQA LGLLGLINTI HSEYIVGSLL FKQANLLSPI SADIKYYYGW NLFMAGQLEE ALQTINECLK LDPTRAAAGI TKLWITYYHT GIDDAIRLGD ELRSQHLQDN PILLSMQVMF LSLKGKHELA RKLTKEISTQ EITGLIAVNL LYAEYCQNSE RALPTIREFL ESEQRIDNNP GLLPLVLVAH GEAIAEKMWN KFKNEDNIWF KRWKQDPRLI KLR
|
| |