Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1086 |
Symbol | |
ID | 6484652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 1093263 |
End bp | 1096613 |
Gene Length | 3351 bp |
Protein Length | 1116 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642736490 |
Product | host specificity protein J |
Protein accession | YP_002040249 |
Protein GI | 194446675 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAAG GTGGAGGGAA GGGGCATACA CCACGTGAGG CGAAGGACGA TCTGAAGTCC ACACAACAAC TGAGCGTGAT TGATGCCCTC AGTGAGGGAC CGATAGTCGG CCCGGTGAAC GGTCTGCAGA GCGTGCTGAT TAATAACACG CCGGTGGTGG ACGCGGACGG TAACAGTAAT ATTCACGGCG TGACCGTGGT ATATCAGGTG GGGGAGACAC CACAGGCACC GCTGGAAGGT TTTGAGGCTT CCGGCGCGGA AACGGTGCTG GGTGTGGAAG TGAAACACGA TAATCCCGTT ACCCGTACTG TTGTCTCAGA GAATGTCGAC CGGCTACGCT TCACCTTTGG TGTACAGATG CTGCAGGAGA CCACGGACAA GGGGGACCGT AACCCGTCCT CCGTGAATCT GCTGATACAG TTTCAGCGTA GCGGGATCTG GAACACAGAA TTTGATATCA CTATTAACGG CAAGATCACA ACACAATATC TGGCATCGGT AGTGGCTGAT AATTTACCGC CGCGCCCGTT CAGTGTCCGC ATGGTCAGGG TGACACCGGA CAGCACCACC GACAGGCTTC AGAACAAAAC GCTGTGGTCG TCGTATACGG AAATCATCGA TATCCGGCAG GGTTATCCTG GCACAGCGGT TGCCGGTCTG CTGGTGGATG CGGAACAGTT CGGCAGCCAG CAGGTCACGC GTAACTACCA CCTGCGCGGA CGTATTTTTC AGGTCCCCTC AAACTATGAC CCGGATACCC GCACATATAC CGGCCTGTGG GACGGGGCGT TTAAACCGGC GTACACGAAT AACCCGGCGT GGTGCACGAT GGATAAACTG ACCCACCCCC GTTACGGGCT GGGCAGGCGT ATCGGGGGGG CGGATGTGGA TAAATGGGCG CTGTACGCCA TCGCGCAGTA CTGCGATCAA CCGGTGCCGG ACGGATTTGG CGGCACGGAA CCCCGCATGA CGCTTAATGC GTATATTACC ACCCAGCGTA AGGCGTATGA CGTTCTGGCG GATTTCTGCT CGGTGATGCG TTGTATGCCG GTATGGAATG GCCGCAAAAT GACCTTCATC CAGGACCGCC CCTCCGATAA AGCATGGACC TACACCAACG GTAACGTGGT GGGCGGGCGC TTTAAATACA GCTTCAGTGC CCTGAAAGAC CGCCATAACG CGGTAGAAGT GAGATACACC GATCCGCTGA ATGGCTGGCA AACCTCCACG GAGCTGGTGG AAGACCATGC CTCACAGGCC CGTTATGGAC GCAATCTGCT GAAAATGGAC GCGTTCGGCT GTACCTCACG TGGACAGGCG CACCGGACGG GGTTGTGGGT GATGATGACG GAGCTGCTGG AAACGCAGAC CGTGGATTTT TCTGTCGGTG CGGAAGGTCT GCGTCATACA CCGGGCGATA TTATTGAGGT CTGCGACAAC GATTACGCCG GGGCGTCGGT CGGTGGGCGT ATCACTGACC TGGATATTTC CACCCGCACG CTGACGCTTG ACCGGGAAAT AACACTACCG GAAAGCGGCG CCACCACGCT GAATATTGTC GGGCCTGACG GTAAGCCGTT CAGTACGGAG ATTCAGTCGC AGCCCGCACC GGATCGGGTG GTAACGAAAG TCCTGCCGGA AACCGTGCAG CCATACAGTA TCTGGGGGCT GAAACTGCCC TCCCTGAAGC GCCGCCTTTT CCGTTGCGTG CGTATTAAGG AGAATGACGA CGGCACATAC GCTATCACTG CCTTGCAGCA CGTTCCGGAA AAAGAGTCCA TCGTGGACAA CGGGGCGCAC TTTGACCCGT TACCGGGGAC CACCAACAGC ATTATTCCGC CCGCTGTGCA GCATCTGACA GTCAGCACGG ATAACGACAG TACCCTGTAT CAGGCCAAAG CGAAGTGGGG CACGCCGCGG GTGGTAAAAG ATGTGCGTTT TGTGGTGAGG CTGACCACAG GCAGTGGGAA CGAGGGCGAT CCGGTTCGTC TGGTGACAAC GGCGACGACC AGCGAAACGG AGTACGCCTT CCACGAACTG CCACTGGGTG ACTACACGCT GACAGTCAGG GCAATAAACG GTTACGGGCA GCAGGGTGAA CCGGCGTCCG TGGCATTCAG TATTCAGGCA CCGGAAGCGC CATCCACGAT TGAGATGACG CCGGGTTATT TTCAGATAAC GGTGACGCCG CACCAGACTG TCTACGATGC CAGTGTGCAG TATGAGTTCT GGTACTCCGC CACGCAACTG GCGACTGCCG CCGATATTCA GTCAAAAGCA CAGTATCTGG GCGTCGGGTC ATTCTGGATA AAGGATGGAC TGAAACCACT GCATGATGCC TGGTTTTACG TGCGCAGTGT AAATCTGGCT GGAAAATCAG TATTTGCGGA AGCATCCGGA CGTCCGGGGG ATGACGCGAA AGGGTATCTG GATTTTTTTA AGGGACTGAT TACGGAGACG TATCTTGGTA CGGAGTTGCT GAAAAAAATT GACCTGACGG AGAATAACGC CAGCAAACTG CAACAATTTT CGAAGGAGTG GCAGGACGCT AACGATAAAT GGAGCGCCAC GTGGGGCGTC AAAATAGAGC AGACCAAAGA CGGCAAATAT TATGTGGCCG GACTTGGACT GAGCATGGAA GACACGCCTG ACGGGAAGAT AAGCCAGTTC CTGGTGGCGG CGGATCGCAT TGCTTATATT AACCCGGCAA ACGGAAACGA GACGCCCGGA TTCGTCATGC AGGGCGACCA GATAATCATG AATGAGGCGT TCCTGAAATA CCTGAGCGCG CCGACCATTA CCAGTGGCGG GAATCCTCCG GCATTTTCCC TGACGCCGGA TGGAAAGCTG ACTGCGAAAA ATGCGGATAT CAGCGGCCAT ATCAACGCTG TATCTGGCTC GTTTACGGGA GAAATCAATG CCACCTCCGG TAAGTTTTCT GGCGTGATAG AAGCAAGAGA GTTTGTCGGT GATATCTGCG GCTCAAAAGT CATGCAGGGC GTGAGCATCA GGGCGACGAA CGACGAACGC AGCACCTCAA CACGGTATAC CGACAGCGCC ACCTATCAGA TAGGGAAAAC CATCACGGTG ATGGCTAACT GTGAGCGTAA CGGTGGCACC GGTGCCATCA CCGTCACGAT AAATATTAAC GGCCAGGTGA AAACGGCGGA GGTTATGCCG TATACCGCAG GGATTCCGGC CATGTATCAG ACCGTCGTCT TTTCGGTCTA CACCACTTCA CCTGTCGTGG ATATCAGCGT CTCTCTGAGG GTCCGTGGGC AGTACACCAC GTCTGCTTCC GTCTGGCCGC TGGTGATGGT TTCCCGGTCG GGGAGCAACT TCACAAACTG A
|
Protein sequence | MSKGGGKGHT PREAKDDLKS TQQLSVIDAL SEGPIVGPVN GLQSVLINNT PVVDADGNSN IHGVTVVYQV GETPQAPLEG FEASGAETVL GVEVKHDNPV TRTVVSENVD RLRFTFGVQM LQETTDKGDR NPSSVNLLIQ FQRSGIWNTE FDITINGKIT TQYLASVVAD NLPPRPFSVR MVRVTPDSTT DRLQNKTLWS SYTEIIDIRQ GYPGTAVAGL LVDAEQFGSQ QVTRNYHLRG RIFQVPSNYD PDTRTYTGLW DGAFKPAYTN NPAWCTMDKL THPRYGLGRR IGGADVDKWA LYAIAQYCDQ PVPDGFGGTE PRMTLNAYIT TQRKAYDVLA DFCSVMRCMP VWNGRKMTFI QDRPSDKAWT YTNGNVVGGR FKYSFSALKD RHNAVEVRYT DPLNGWQTST ELVEDHASQA RYGRNLLKMD AFGCTSRGQA HRTGLWVMMT ELLETQTVDF SVGAEGLRHT PGDIIEVCDN DYAGASVGGR ITDLDISTRT LTLDREITLP ESGATTLNIV GPDGKPFSTE IQSQPAPDRV VTKVLPETVQ PYSIWGLKLP SLKRRLFRCV RIKENDDGTY AITALQHVPE KESIVDNGAH FDPLPGTTNS IIPPAVQHLT VSTDNDSTLY QAKAKWGTPR VVKDVRFVVR LTTGSGNEGD PVRLVTTATT SETEYAFHEL PLGDYTLTVR AINGYGQQGE PASVAFSIQA PEAPSTIEMT PGYFQITVTP HQTVYDASVQ YEFWYSATQL ATAADIQSKA QYLGVGSFWI KDGLKPLHDA WFYVRSVNLA GKSVFAEASG RPGDDAKGYL DFFKGLITET YLGTELLKKI DLTENNASKL QQFSKEWQDA NDKWSATWGV KIEQTKDGKY YVAGLGLSME DTPDGKISQF LVAADRIAYI NPANGNETPG FVMQGDQIIM NEAFLKYLSA PTITSGGNPP AFSLTPDGKL TAKNADISGH INAVSGSFTG EINATSGKFS GVIEAREFVG DICGSKVMQG VSIRATNDER STSTRYTDSA TYQIGKTITV MANCERNGGT GAITVTININ GQVKTAEVMP YTAGIPAMYQ TVVFSVYTTS PVVDISVSLR VRGQYTTSAS VWPLVMVSRS GSNFTN
|
| |