Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C0453 |
Symbol | |
ID | 6489338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 452428 |
End bp | 455397 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642740722 |
Product | type III restriction-modification system StyLTI enzyme res |
Protein accession | YP_002044389 |
Protein GI | 194451936 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.985709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 1.2237e-22 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATATTT TACTGGAAGA ACTTCCCCAT CAGGAACAGG CGTTAGCGGC GATTCTGGCG AGTTTCACCG GTATCGATCA CGCGCAGGCC GATCATAATC ACTATGCTAA TCCGCTGATT AAGGAACGTT ACGACGATAA GGCCAATATT GACGTTAAAA TGGAGACCGG GACGGGCAAA ACCTATGTCT ATACCCGGTT GATGTATGAA CTGCATCAGA AGTATGGCCT CTTCAAATTT GTGCTGGTGG TGCCGACGCC AGCCATTAAA GAAGGCGCGC GGAACTTTAT CACCAGCGAT TACGCCAGAC AGCATTTTTC ACAGTTCTAC GAAAATACGC GGATGGAACT TTGCACCATC AACGCCGGTG ATTTTAAAGT AAAGTCGGGG CGTAAAAATT TTCCGGCCCA GCTATTAAGT TTTACTGATG CCAGCCGTCG TGATAGCCAT ACGATTCAGG TTTTGCTGAT CAATGCGCAA ATGCTCAATT CCGCCAGTAT GACGCGAGAC GATTACGATC AAACGCTCCT GGGCGGGCTG ACGTCGCCTG TTAAAGGGCT GCAAATGACG CGACCGGTGG TCATTATTGA TGAACCGCAT CGTTTTGCGC GAGATAACAA ATTTTATCGA GCGATTCAGG CTATTCAGCC GCAAATGATC GTCCGCTTTG GCGCTACCTT CCCGGATATT GTCGAAGGTA AGGGTAAAAA TAAATGTGTA CGTAAAGATT ACTATCGCCG GCAACCGCAG TTTGATCTCA ACGCGGTGGA CAGTTTTAAC GATGGTTTGG TTAAAGGTAT TGATATTTAT TACCCGAATC TCCCCGAAGA ACAGGCCAAC AATCGTTATA TCGTTGACAG CGTCACGGCA AAGAAATTAA TCCTCCGACG GGGGAGCAAA ATTGCCGAGG TTGGCGTGGG CGAAAATCTC GCCGATGTCG ATGCTGGATT TGAGGGCAGT ATTGAATATG CCGGCAGTAA AATGTTGTCG AACGATCTGG AGTTGGAGGC AGGGATGGCG CTGGTGCCAG GAACCTTTGG CGCGAGCTAC CAGGAACTGA TTATTCAGGA AGCCATTGAT AAGCATTTTG ATACCGAGCA GGCAAATTTC TTACGCAGCA ATGAGCCAGA AAATAATGCC CCGCGTATTA AGACCTTAAG CCTTTTCTTT ATTGACAGTA TTAAAAGCTA TCGTGATGAC GAAGGTTGGT TGAAACTGAA GTTTGAGTGT TTACTGAAAA AGAAACTGAC GCAACTGATT GACGATTATC AGCGCAAGAC CCTGCCGCGA GAAGTGGAGT ATCTGTCGTT TCTGCAGGCC ACGCTCGCCA GCCTGCACTC GGATAACCAA AACGTCCACG CTGGTTACTT TGGCGAAGAC CGCGGAAGCG GCGATGAGGC GATCCAGGCT GAGGTAGATG ATATTCTGAA AAATAAAGAG AAGTTGCTCA GTTTTTCAGA CCATCACGGC AACTGGGAAA CGCGCCGCTT TCTGTTTTCA AAATGGACGC TTCGCGAAGG CTGGGATAAC CCGAATGTTT TTGTCATTGC TAAATTACGT TCTTCCGGTA GCGAGTCGAG CAAAATTCAG GAAGTGGGGC GCGGCCTGCG GCTACCGGTA GATGAAAACG GCCATCGCGT TCATCAGGAA GAGTGGCCGT CCCGACTGTC GTTTCTGATT GGTTATGATG AAAAAGCGTT TGCCAGTATG CTGGTTGATG AGATTAATCG CGACAGCAAA GTTCAGCTTA ACGAGCAGAA GCTGGATGAG GCGATGATCA CACTCATCGT CACCGAGCGG CAAAAAGTCG ATCCTGCGTT TACGGAGCTT CGTTTGCTGG AAGATCTGGA TGATAAAAAA CTGATCAACC GGAGTAATGA GTTTAAACCC AGCGTCACGC TTAACGGGGA AACCAAAAGT GGCTTTGCGT GGCTACTGGA GTTCTACCCT GAGCTGACGC AGGCGCGGGT GCGAGCGGAT CGCATTCGTG ACAATAAGCC CGCCTCCCGA CTGCGAGTCA GGTTACGCAA AGAGAATTGG GAACAACTTA GCAGTATCTG GGAGCAGTTT TCCCGCCGTT ATATGCTGCA ATTCGAGCGT AGCGGCGCGT CTCTGGAACA GATTGCCGCC GAGGTGCTGC GCGATCCGGC GCTGTATATA CGCCAGAAGC CAAGCCAGGT GCAACAACGG CTGGTATCGA ATGAAGATAA TGGCCGTTTT GAAGTGGCGC AGCGGGAAGG CGAATTAGCC GCCAGCGAAT TTATGGCGGG CATGAAATAT GGCCATTTTC TTAAGCAACT GGCGTTACGC ACCAGTCTGC CGGTTAACGT CCTGCACCCG GTGTTAATGG CGATGCTGCG TGATGTTTTG CACGGAGATT CACGCTATTT AAGCGAGATC TCGTTGGACA ATATGACCCG CGCATTACAG GCGCGGATTA ATGCGCATTT TGCGCAGCGC CACGATTATC TGCCTCTCGA TTTTCAGGCT TCAACGTCGG TATTTGATTC CACGGCACGG CAGTTCAGAG AGGAGATTAG CGCTGAAATT GTGGGGAAAA ATGTGGACGA GAATGCGATA GACGATCCCC GTTCTCTCTA TCAAATACCG CCGTTGCGTT ATGACAGCGT CGATCCAGAA TTGCCGCTAT TAAAATACGA TTATCCGCAA CAGGTTTCTG TGTTTGGCAA ACTGCCTAAG CGGGCCATTC AGATCCCCAA ATATACGGGG GGCTCTACTA CGCCGGATTT TGTGTACCGT ATTGAGCGTC AGGACGCCGA CAGCGTTTAT TTACTGGTTG AAACTAAAGC AGAAAATATG CGCGTAGGCG ATCAGGTTAT TCTTGATGCG CAACGTAAAT TCTTCGATAT GCTGCGTCGG CAAAATATCA ATGTCGAGTT TGCGGAAGCG ACCAGCGCGC CGGCGGTATT TTCTACGATC AATGGCTTGA TTGAGGGGAA GGCAAACTAA
|
Protein sequence | MNILLEELPH QEQALAAILA SFTGIDHAQA DHNHYANPLI KERYDDKANI DVKMETGTGK TYVYTRLMYE LHQKYGLFKF VLVVPTPAIK EGARNFITSD YARQHFSQFY ENTRMELCTI NAGDFKVKSG RKNFPAQLLS FTDASRRDSH TIQVLLINAQ MLNSASMTRD DYDQTLLGGL TSPVKGLQMT RPVVIIDEPH RFARDNKFYR AIQAIQPQMI VRFGATFPDI VEGKGKNKCV RKDYYRRQPQ FDLNAVDSFN DGLVKGIDIY YPNLPEEQAN NRYIVDSVTA KKLILRRGSK IAEVGVGENL ADVDAGFEGS IEYAGSKMLS NDLELEAGMA LVPGTFGASY QELIIQEAID KHFDTEQANF LRSNEPENNA PRIKTLSLFF IDSIKSYRDD EGWLKLKFEC LLKKKLTQLI DDYQRKTLPR EVEYLSFLQA TLASLHSDNQ NVHAGYFGED RGSGDEAIQA EVDDILKNKE KLLSFSDHHG NWETRRFLFS KWTLREGWDN PNVFVIAKLR SSGSESSKIQ EVGRGLRLPV DENGHRVHQE EWPSRLSFLI GYDEKAFASM LVDEINRDSK VQLNEQKLDE AMITLIVTER QKVDPAFTEL RLLEDLDDKK LINRSNEFKP SVTLNGETKS GFAWLLEFYP ELTQARVRAD RIRDNKPASR LRVRLRKENW EQLSSIWEQF SRRYMLQFER SGASLEQIAA EVLRDPALYI RQKPSQVQQR LVSNEDNGRF EVAQREGELA ASEFMAGMKY GHFLKQLALR TSLPVNVLHP VLMAMLRDVL HGDSRYLSEI SLDNMTRALQ ARINAHFAQR HDYLPLDFQA STSVFDSTAR QFREEISAEI VGKNVDENAI DDPRSLYQIP PLRYDSVDPE LPLLKYDYPQ QVSVFGKLPK RAIQIPKYTG GSTTPDFVYR IERQDADSVY LLVETKAENM RVGDQVILDA QRKFFDMLRR QNINVEFAEA TSAPAVFSTI NGLIEGKAN
|
| |