Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B0393 |
Symbol | |
ID | 6794497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 388098 |
End bp | 391067 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642774682 |
Product | type III restriction-modification system StyLTI enzyme res |
Protein accession | YP_002145338 |
Protein GI | 197248044 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00360009 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTT TACTGGAAGA ACTTCCCCAT CAGGAACAGG CGTTAGCGGC GATTCTGGCG AGTTTCACCG GTATCGATCA CGCGCAGGCC GATCATAATC ACTATGCTAA TCCGCTGATT AAGGAACGTT ACGACGATAA GGCCAATATT GACGTTAAAA TGGAGACCGG GACGGGCAAA ACCTATGTCT ATACCCGGTT GATGTATGAA CTGCATCAGA ACTACGGGCT CTTCAAATTT GTGCTGGTGG TGCCGACACC AGCCATTAAA GAAGGCGCGC GGAACTTTAT CACCAGCGAT TACGCCAGAC AGCATTTTTC ACAGTTCTAC GAAAATACGC GGATGGAACT TTGCACCATT AACGCCGGTG ATTTTAAAGT AAAGTCGGGG CGTAAAAATT TTCCGGCCCA GCTATTAAGT TTTACTGATG CCAGCCGTCG TGATAGCCAT ACGATTCAGG TTTTGCTGAT CAATGCGCAA ATGCTCAATT CCGCCAGTAT GACGCGAGAC GATTACGATC AAACGCTACT GGGTGGGCTG ACGTCGCCTG TTAAAGGGCT GCAAATGACG CGACCGGTGG TCATTATTGA TGAACCGCAT CGTTTTGCGC GAGATAACAA ATTTTATCGA GCGATTCAGG CTATTCAGCC GCAAATGATC GTCCGCTTTG GCGCTACCTT CCCGGATATT GTCGAAGGTA AGGGTAAAAA TAAATGTGTA CGTAAAGATT ACTATCGCCG GCAACCGCAG TTTGATCTCA ACGCGGTGGA CAGTTTTAAC GATGGTTTGG TGAAAGGTAT TGATATTTAT TACCCAAATC TCCCCGAAGA ACAGGCCAAC AATCGTTATA TCGTTGACAG CGTCACGGCA AAGAAATTAA TCCTCCGACG GGGGAGCAAA ATTGCCGAGG TTGGCGTGGG CGAAAATCTC GCCGATGTCG ATGCAGGATT TGAAGGCAGT ATCGAATATG CCGGCAGTAA AATGTTGTCG AACGATCTGG AGCTGGAGGC AGGAATGGCG CTGGTGCCAG GAACCTTTGG CGCGAGCTAC CAGGAACTGA TTATTCAGGA CGCCATTGAT AAGCATTTTG ATACCGAGCA GGCAAATTTC TTACGCAGCA ATGAGCCAGA AAATAATGCC CCGCGTATTA AGACCTTAAG CCTTTTCTTT ATTGACAGTA TTAAAAGCTA TCGTGATGAC GAAGGCTGGT TGAAAGTCAC TTTTGAGCGT TTGCTGAAAA AGAAACTGAC GCAACTGATT GACGATTATC AGCGCAAGAC CCTGCCGCGA GAAGTGGAGT ATCTGTCGTT TCTGCAGGCC ACGCTCGCCA GCCTGCACTC GGATAACCAA AACGTCCACG CTGGTTACTT TGGCGAAGAC CGCGGAAGCG GCGATGAGGC GATCCAGGCT GAAGTAGATG ATATTCTGAA AAATAAAGAG AAGTTGCTCA GTTTTTCAGA CCATCACGGC AACTGGGAAA CGCGCCGCTT TCTGTTTTCA AAATGGACGC TTCGCGAAGG CTGGGATAAC CCGAATGTTT TTGTCATTGC TAAATTACGT TCTTCCGGTA GCGAGTCGAG CAAAATTCAG GAAGTGGGGC GCGGCCTGCG GCTACCGGTA GATGAAAACG GCCATCGCGT TCATCAGGAA GAGTGGCCGT CCCGACTGTC GTTTCTGATT GGTTATGATG AAAAAGCGTT TGCCAGTATG CTGGTTGATG AGATTAATCG CGACAGCAAA GTTCAGCTTA ACGAGCAGAA GCTGGATGAG GCGATGATCA CTCTCATCGT CACCGAGCGG CAAAAAGTCG ATCCTGCGTT TACGGAGCTT CGTTTGCTGG AAGATCTGGA TGATAAAAAA CTGATCAACC GGAGTAATGA GTTTAAACCC AGCGTCACGC TTAACGGGGA AACCAAAAGT GGTTTTGCGT GGCTACTGGA GTTCTACCCT GAGCTGACGC AGGCGCGGGT GCGAGCGGAT CGCATTCGTG ACAATAAGCC CGCCTCCCGG CTGCGAGTCA GGTTACGCAA AGAGAATTGG GAACAACTTA GCAGTATCTG GGAGCAGTTT TCCCGCCGTT ATATGCTGCA ATTCGAGCGT AGCGGCGCTT CTCTGGAACA GATTGCCGCC GAGGTGCTGC GCGATCCGGC GCTGTATATA CGCCAGAAGC CAAGCCAGGT GCAACAACGG CTGGTATCGA ATGAAGATAA TGGCCGTTTT GAAGTGGCGC AGCGGGAAGG CGAATTAGCC GCCAGCGAAT TTATGGCGGG CATGAAATAT GGCCATTTTC TTAAGCAACT GGCGTTACGC ACCAGTCTGC CGGTTAACGT CCTGCACCCG GTGTTAATGG CGATGCTGCG TGATGTTTTG CACGGAGATT CACGCTATTT AAGCGAGATC TCGTTGGACA ATATGACCCG CGCATTACAG ACGCGGATTA ATGCGCATTT TGCGCAGCGC CACGATTATC TGCCTCTCGA CTTCCAGGCT TCAACGTCGG TATTTGATTC CACGGCACGG CAGTTCAGAG AGGAGATTAG CGCTGAAATT GTGGGAAAAA ATGTGGATGA GAATGCGATA GACGATCCCC GTTCTCTCTA TCAAATACCG CCGTTGCGTT ATGACAGCGT CGATCCAGAA TTGCCGCTAT TAAAATACGA TTATCCGCAA CAGGTTTCTG TGTTTGGCAA ACTGCCTAAG CGGGCCATTC AGATCCCCAA ATATACGGGG GGCTCTACTA CGCCGGATTT TGTGTACCGT ATTGAGCGTC AGGACGCCGA CAGCGTTTAT TTACTGGTTG AAACTAAAGC AGAAAATATG CGCGTAGGCG ATCAGGTTAT TCTTGATGCG CAACGTAAAT TCTTCGATAT GCTGCGTCGG CAAAATATCA ATGTCGAGTT TGCGGATGCG ACCAGCGCGC CGGCGGTATT TTCTACGATC AATGGCTTGA TTGAGGGGAA GGCAAACTAA
|
Protein sequence | MNILLEELPH QEQALAAILA SFTGIDHAQA DHNHYANPLI KERYDDKANI DVKMETGTGK TYVYTRLMYE LHQNYGLFKF VLVVPTPAIK EGARNFITSD YARQHFSQFY ENTRMELCTI NAGDFKVKSG RKNFPAQLLS FTDASRRDSH TIQVLLINAQ MLNSASMTRD DYDQTLLGGL TSPVKGLQMT RPVVIIDEPH RFARDNKFYR AIQAIQPQMI VRFGATFPDI VEGKGKNKCV RKDYYRRQPQ FDLNAVDSFN DGLVKGIDIY YPNLPEEQAN NRYIVDSVTA KKLILRRGSK IAEVGVGENL ADVDAGFEGS IEYAGSKMLS NDLELEAGMA LVPGTFGASY QELIIQDAID KHFDTEQANF LRSNEPENNA PRIKTLSLFF IDSIKSYRDD EGWLKVTFER LLKKKLTQLI DDYQRKTLPR EVEYLSFLQA TLASLHSDNQ NVHAGYFGED RGSGDEAIQA EVDDILKNKE KLLSFSDHHG NWETRRFLFS KWTLREGWDN PNVFVIAKLR SSGSESSKIQ EVGRGLRLPV DENGHRVHQE EWPSRLSFLI GYDEKAFASM LVDEINRDSK VQLNEQKLDE AMITLIVTER QKVDPAFTEL RLLEDLDDKK LINRSNEFKP SVTLNGETKS GFAWLLEFYP ELTQARVRAD RIRDNKPASR LRVRLRKENW EQLSSIWEQF SRRYMLQFER SGASLEQIAA EVLRDPALYI RQKPSQVQQR LVSNEDNGRF EVAQREGELA ASEFMAGMKY GHFLKQLALR TSLPVNVLHP VLMAMLRDVL HGDSRYLSEI SLDNMTRALQ TRINAHFAQR HDYLPLDFQA STSVFDSTAR QFREEISAEI VGKNVDENAI DDPRSLYQIP PLRYDSVDPE LPLLKYDYPQ QVSVFGKLPK RAIQIPKYTG GSTTPDFVYR IERQDADSVY LLVETKAENM RVGDQVILDA QRKFFDMLRR QNINVEFADA TSAPAVFSTI NGLIEGKAN
|
| |