Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1079 |
Symbol | |
ID | 6485093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 1086396 |
End bp | 1089482 |
Gene Length | 3087 bp |
Protein Length | 1028 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642736484 |
Product | gifsy-1 prophage VmtH |
Protein accession | YP_002040243 |
Protein GI | 194442929 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.497302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.0993737 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCCAGA AAGTCGGTGA TATCGTCATC AACATGGATG TTGATACAGC TAAAGTTGCC GCCGGTCTTC AGACTGCCAG TAACGGGCTG GGGAAGCTGG TGGACAGCAG TGATCTCGTT GAAAAACGCA TCAAGCGATG TATGGAGTCC AGCGCCAGAA GTGTGGCGGC ATCGGCAAAA AGTATCAGTG CCGCTATGGC GCAATCACAG GTTGCCACAC GCACACAGAG TGACGCTATG GCACAACTGG CGCGTGAGGC GAACGAGGCC AGAGAAAGGG CTGTCGACCT GAATCAGAAG TTAAGGGCGG AAGCTGCGCA GGCAGCGGTG GTTGCACAGG CTCAGGATGC AGCCGCAGCG GCATTTTACC GTCAGATTGA CAGTGTAAAA CAGTTAAGCG GTGGTCTGCA GGAGTTACAG CGTATCCAGG CGCAGGTACG ACAGGCGAAA GGACGCGGAG ATATTTCACA GGGCGATTAT CTGGCGCTGG TGTCTGAAGC TGCTGCAAAG ACACGCGAAC TTACCGATGC GGAGGCGCTG GCCACGCAGA AAAAAGCACA GTTTATACGT CGACTGAAAG AGCAGACGGC GGTACAGGGC CTCTCCCGTA CTGAGTTGCT GCGGGTGAAG GCGGCTGAAC TGGGGGTTAG CAGTGCCGCC GATGTCTATA TCCGCAAACT GGATACCGCA ACAAAATCCA CTCATGCACT AGGACTGAAA TCAGCAATGG CGCGCCGCGA GATAGGCGTA CTGATTGGTG AACTGGCACG GGGAAATTTT GGCGCCCTTC GCGGTTCCGG TATCACGCTG GCCAACCGGG CCGGGTGGAT TGAGCAACTG ATGTCGCCGA AGGGCATGAT GCTCGGCGGG CTGGTTGGCG GTGTGGCTGC GGCGGTTTAC GGACTGGGTA AGGCGTACTA TGAGGGGGCG AAAGAAAGTG AGGAGTTCAA TAAACAGCTT ATTCTGACCG GAAGTTACGC CGGAAAAACC ACAGGCAAGC TTAATGAAAT GGCGAAGTCG CTCGCCGGAA ATGGCGTCAC GCAGCACGAC GCGGCAGGAG TACTGACCCA GGTGGTCGGT AGCGGAGCGT TTACCGGGCA GGCAGTGGCA ATGGTATCCC GTACCGCGGC CAGAATGCAG GAAAACGTGG GGCAGTCAGT GGATGAAACC ATCCGCCAGT TTAAACGGCT GCAGGATGAT CCGGTGAACG CGGCGAAAGA ACTGGACAGG GCACTGCATT TTCTGACTGC CACCCAGCTT GAACAAATCA GGGTGCTCGG TGAACAGGGA AGAACGGCTG ATGCTGCGAA AATTGCCATG TCCGCGTATT CGGAAGAGAT GAATAAACGG GCGGGTGACG TACATGATAA TCTGGGCTGG ATCGAGAAGG CATGGAATGC CGTGGGTGAT GCGGCGAAGT GGGCGTGGGA TCGGATGCTG GATATCGGGC GGGAAGATAC GCTCGATGAA AAAATCGCGA CACTGCAGGA AAAAATCGCG CGCGCCAGAA AAACGCCATG GACAGTGTCT TCCTCCCAGA CTGAATATGA TCAGCAGCAA CTGAACGAAC TTCAGGAACA GAAACGCCAG AAGGACCTGC TGGATGCGAA GGCGCAGGCA GAGCGTAATT ATCAGAAGAC TCAGAAGCGC CGGAACGAGC AGAACGCCGC GCTGAACCGG GATAATGAAA CTGAATCCCT GCGGCATCAA CGGGAGGTGG CGCGCATTAC CGCCATGCAG TATGCCGATG CAGCGGTACG CAATGCCGCG CTGGAGCGTG AAAACGAACG CCATAAAAAA GCAATGGCAC GGCAGAAGGA AAAGCCAAAG GCTTACTACA ACGACGAGGC CGGGCGACTG CTTTTGCAGT ACAGCCAGCA ACAGGCGCAG ACTGAAGGGC TGATTGCCGC CGCGAAGCTT TCCACGACCG AAAAAATGAC GGAAGCGCAT AAGCAGCTTT TGTCATTTCA GCAGCGCATC GCTGATTTGT CCGGTAAAAA ACTGACGGCG GATGAACAAA GCGTACTGGC ACATAAGGAT GAAATAGCGC TTGCGCTACA GAAGCTGGAT ATCTCACAAC AGGATTTGCA ACACCAGAAT GCCTTTAATG AACTGAAGAA AAAGACGCTC ACATTAACCA GCCAGCTCGC TGACGAAGAA TCCCGCGTCA GGCAGCAGCA CGCACTGGCG CTGGCCACAA TGGGTATGGG CGATCAGCAA CGTGGCCGGT ACGAAGAGCA TCTGAAAATT CAACAGCACT ACCAGGAACA ACTGGAGCAG CTTAAGCGCG ACAGCAAGGC AAAAGGGACA TACGGTTCTG ACGAATACCG TCAGGCGGAG CAGGAACTTC AGGCCAGTCT CGATCGCCGA CTGGCTGAGT GGGCGGATTA TAACGCGAAA GTGGATGCTG CGCAGGGAGA CTGGACGCAG GGCGCGTCGC GGGCGCTGGA TAACTTTCTG GCGCAGGGGG GCAACGTGGC AGGCATGACG GAGAACGTTT TCACAAACGC ATTTAACGGC ATGGCGGACA GTATCGCGAA TTTTGCCGTG ACCGGAAAGG GCAGTTTCCG GAGCCTGACG GTCTCCATCC TGGCTGACCT GGCAAAAATG GAGGCACGTA TTGCGGCTTC TAAACTGTTG GGTTCAGTAC TGGGTATGTT CGTCTTTGGC GCATCAGCAG GCGGAAGTAC ACCATCCGGG GCATACAGTT CAGCGGCGCT GTCGGTCATT CCAAATGCGG ACGGCGGCGT GTACCGCTCA GCAGGACTCA GTCAGTACAG CGGCAGTATT GTTAACAGAC CGACGTTCTT TGCATTTGCC AGAGGGGCGG CAGTAATGGG AGAGGCCGGT CCGGAGGCTA TACTGCCGCT TCGTCGCGGT ACTGACGGTA AGCTGGGGGT TGTGGCAGCA GGTTCCGGAG GGATGGCGAT GTTTGCACCG CAGTATCATA TTGCAATCAG CAACACGGGG CCGGAACTGA CGCCGCAGGC GCTGAAGGCG GTTTATGATC TGGGTAAAAA GGCGGCGGCT GATTTCGTGC AGCAGCAGGG GCGTGACGGC GGCAGGCTGA GCGGGGCATA TCGATGA
|
Protein sequence | MSQKVGDIVI NMDVDTAKVA AGLQTASNGL GKLVDSSDLV EKRIKRCMES SARSVAASAK SISAAMAQSQ VATRTQSDAM AQLAREANEA RERAVDLNQK LRAEAAQAAV VAQAQDAAAA AFYRQIDSVK QLSGGLQELQ RIQAQVRQAK GRGDISQGDY LALVSEAAAK TRELTDAEAL ATQKKAQFIR RLKEQTAVQG LSRTELLRVK AAELGVSSAA DVYIRKLDTA TKSTHALGLK SAMARREIGV LIGELARGNF GALRGSGITL ANRAGWIEQL MSPKGMMLGG LVGGVAAAVY GLGKAYYEGA KESEEFNKQL ILTGSYAGKT TGKLNEMAKS LAGNGVTQHD AAGVLTQVVG SGAFTGQAVA MVSRTAARMQ ENVGQSVDET IRQFKRLQDD PVNAAKELDR ALHFLTATQL EQIRVLGEQG RTADAAKIAM SAYSEEMNKR AGDVHDNLGW IEKAWNAVGD AAKWAWDRML DIGREDTLDE KIATLQEKIA RARKTPWTVS SSQTEYDQQQ LNELQEQKRQ KDLLDAKAQA ERNYQKTQKR RNEQNAALNR DNETESLRHQ REVARITAMQ YADAAVRNAA LERENERHKK AMARQKEKPK AYYNDEAGRL LLQYSQQQAQ TEGLIAAAKL STTEKMTEAH KQLLSFQQRI ADLSGKKLTA DEQSVLAHKD EIALALQKLD ISQQDLQHQN AFNELKKKTL TLTSQLADEE SRVRQQHALA LATMGMGDQQ RGRYEEHLKI QQHYQEQLEQ LKRDSKAKGT YGSDEYRQAE QELQASLDRR LAEWADYNAK VDAAQGDWTQ GASRALDNFL AQGGNVAGMT ENVFTNAFNG MADSIANFAV TGKGSFRSLT VSILADLAKM EARIAASKLL GSVLGMFVFG ASAGGSTPSG AYSSAALSVI PNADGGVYRS AGLSQYSGSI VNRPTFFAFA RGAAVMGEAG PEAILPLRRG TDGKLGVVAA GSGGMAMFAP QYHIAISNTG PELTPQALKA VYDLGKKAAA DFVQQQGRDG GRLSGAYR
|
| |