Gene SNSL254_A1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1079 
Symbol 
ID6485093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1086396 
End bp1089482 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content56% 
IMG OID642736484 
Productgifsy-1 prophage VmtH 
Protein accessionYP_002040243 
Protein GI194442929 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.497302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.0993737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCCAGA AAGTCGGTGA TATCGTCATC AACATGGATG TTGATACAGC TAAAGTTGCC 
GCCGGTCTTC AGACTGCCAG TAACGGGCTG GGGAAGCTGG TGGACAGCAG TGATCTCGTT
GAAAAACGCA TCAAGCGATG TATGGAGTCC AGCGCCAGAA GTGTGGCGGC ATCGGCAAAA
AGTATCAGTG CCGCTATGGC GCAATCACAG GTTGCCACAC GCACACAGAG TGACGCTATG
GCACAACTGG CGCGTGAGGC GAACGAGGCC AGAGAAAGGG CTGTCGACCT GAATCAGAAG
TTAAGGGCGG AAGCTGCGCA GGCAGCGGTG GTTGCACAGG CTCAGGATGC AGCCGCAGCG
GCATTTTACC GTCAGATTGA CAGTGTAAAA CAGTTAAGCG GTGGTCTGCA GGAGTTACAG
CGTATCCAGG CGCAGGTACG ACAGGCGAAA GGACGCGGAG ATATTTCACA GGGCGATTAT
CTGGCGCTGG TGTCTGAAGC TGCTGCAAAG ACACGCGAAC TTACCGATGC GGAGGCGCTG
GCCACGCAGA AAAAAGCACA GTTTATACGT CGACTGAAAG AGCAGACGGC GGTACAGGGC
CTCTCCCGTA CTGAGTTGCT GCGGGTGAAG GCGGCTGAAC TGGGGGTTAG CAGTGCCGCC
GATGTCTATA TCCGCAAACT GGATACCGCA ACAAAATCCA CTCATGCACT AGGACTGAAA
TCAGCAATGG CGCGCCGCGA GATAGGCGTA CTGATTGGTG AACTGGCACG GGGAAATTTT
GGCGCCCTTC GCGGTTCCGG TATCACGCTG GCCAACCGGG CCGGGTGGAT TGAGCAACTG
ATGTCGCCGA AGGGCATGAT GCTCGGCGGG CTGGTTGGCG GTGTGGCTGC GGCGGTTTAC
GGACTGGGTA AGGCGTACTA TGAGGGGGCG AAAGAAAGTG AGGAGTTCAA TAAACAGCTT
ATTCTGACCG GAAGTTACGC CGGAAAAACC ACAGGCAAGC TTAATGAAAT GGCGAAGTCG
CTCGCCGGAA ATGGCGTCAC GCAGCACGAC GCGGCAGGAG TACTGACCCA GGTGGTCGGT
AGCGGAGCGT TTACCGGGCA GGCAGTGGCA ATGGTATCCC GTACCGCGGC CAGAATGCAG
GAAAACGTGG GGCAGTCAGT GGATGAAACC ATCCGCCAGT TTAAACGGCT GCAGGATGAT
CCGGTGAACG CGGCGAAAGA ACTGGACAGG GCACTGCATT TTCTGACTGC CACCCAGCTT
GAACAAATCA GGGTGCTCGG TGAACAGGGA AGAACGGCTG ATGCTGCGAA AATTGCCATG
TCCGCGTATT CGGAAGAGAT GAATAAACGG GCGGGTGACG TACATGATAA TCTGGGCTGG
ATCGAGAAGG CATGGAATGC CGTGGGTGAT GCGGCGAAGT GGGCGTGGGA TCGGATGCTG
GATATCGGGC GGGAAGATAC GCTCGATGAA AAAATCGCGA CACTGCAGGA AAAAATCGCG
CGCGCCAGAA AAACGCCATG GACAGTGTCT TCCTCCCAGA CTGAATATGA TCAGCAGCAA
CTGAACGAAC TTCAGGAACA GAAACGCCAG AAGGACCTGC TGGATGCGAA GGCGCAGGCA
GAGCGTAATT ATCAGAAGAC TCAGAAGCGC CGGAACGAGC AGAACGCCGC GCTGAACCGG
GATAATGAAA CTGAATCCCT GCGGCATCAA CGGGAGGTGG CGCGCATTAC CGCCATGCAG
TATGCCGATG CAGCGGTACG CAATGCCGCG CTGGAGCGTG AAAACGAACG CCATAAAAAA
GCAATGGCAC GGCAGAAGGA AAAGCCAAAG GCTTACTACA ACGACGAGGC CGGGCGACTG
CTTTTGCAGT ACAGCCAGCA ACAGGCGCAG ACTGAAGGGC TGATTGCCGC CGCGAAGCTT
TCCACGACCG AAAAAATGAC GGAAGCGCAT AAGCAGCTTT TGTCATTTCA GCAGCGCATC
GCTGATTTGT CCGGTAAAAA ACTGACGGCG GATGAACAAA GCGTACTGGC ACATAAGGAT
GAAATAGCGC TTGCGCTACA GAAGCTGGAT ATCTCACAAC AGGATTTGCA ACACCAGAAT
GCCTTTAATG AACTGAAGAA AAAGACGCTC ACATTAACCA GCCAGCTCGC TGACGAAGAA
TCCCGCGTCA GGCAGCAGCA CGCACTGGCG CTGGCCACAA TGGGTATGGG CGATCAGCAA
CGTGGCCGGT ACGAAGAGCA TCTGAAAATT CAACAGCACT ACCAGGAACA ACTGGAGCAG
CTTAAGCGCG ACAGCAAGGC AAAAGGGACA TACGGTTCTG ACGAATACCG TCAGGCGGAG
CAGGAACTTC AGGCCAGTCT CGATCGCCGA CTGGCTGAGT GGGCGGATTA TAACGCGAAA
GTGGATGCTG CGCAGGGAGA CTGGACGCAG GGCGCGTCGC GGGCGCTGGA TAACTTTCTG
GCGCAGGGGG GCAACGTGGC AGGCATGACG GAGAACGTTT TCACAAACGC ATTTAACGGC
ATGGCGGACA GTATCGCGAA TTTTGCCGTG ACCGGAAAGG GCAGTTTCCG GAGCCTGACG
GTCTCCATCC TGGCTGACCT GGCAAAAATG GAGGCACGTA TTGCGGCTTC TAAACTGTTG
GGTTCAGTAC TGGGTATGTT CGTCTTTGGC GCATCAGCAG GCGGAAGTAC ACCATCCGGG
GCATACAGTT CAGCGGCGCT GTCGGTCATT CCAAATGCGG ACGGCGGCGT GTACCGCTCA
GCAGGACTCA GTCAGTACAG CGGCAGTATT GTTAACAGAC CGACGTTCTT TGCATTTGCC
AGAGGGGCGG CAGTAATGGG AGAGGCCGGT CCGGAGGCTA TACTGCCGCT TCGTCGCGGT
ACTGACGGTA AGCTGGGGGT TGTGGCAGCA GGTTCCGGAG GGATGGCGAT GTTTGCACCG
CAGTATCATA TTGCAATCAG CAACACGGGG CCGGAACTGA CGCCGCAGGC GCTGAAGGCG
GTTTATGATC TGGGTAAAAA GGCGGCGGCT GATTTCGTGC AGCAGCAGGG GCGTGACGGC
GGCAGGCTGA GCGGGGCATA TCGATGA
 
Protein sequence
MSQKVGDIVI NMDVDTAKVA AGLQTASNGL GKLVDSSDLV EKRIKRCMES SARSVAASAK 
SISAAMAQSQ VATRTQSDAM AQLAREANEA RERAVDLNQK LRAEAAQAAV VAQAQDAAAA
AFYRQIDSVK QLSGGLQELQ RIQAQVRQAK GRGDISQGDY LALVSEAAAK TRELTDAEAL
ATQKKAQFIR RLKEQTAVQG LSRTELLRVK AAELGVSSAA DVYIRKLDTA TKSTHALGLK
SAMARREIGV LIGELARGNF GALRGSGITL ANRAGWIEQL MSPKGMMLGG LVGGVAAAVY
GLGKAYYEGA KESEEFNKQL ILTGSYAGKT TGKLNEMAKS LAGNGVTQHD AAGVLTQVVG
SGAFTGQAVA MVSRTAARMQ ENVGQSVDET IRQFKRLQDD PVNAAKELDR ALHFLTATQL
EQIRVLGEQG RTADAAKIAM SAYSEEMNKR AGDVHDNLGW IEKAWNAVGD AAKWAWDRML
DIGREDTLDE KIATLQEKIA RARKTPWTVS SSQTEYDQQQ LNELQEQKRQ KDLLDAKAQA
ERNYQKTQKR RNEQNAALNR DNETESLRHQ REVARITAMQ YADAAVRNAA LERENERHKK
AMARQKEKPK AYYNDEAGRL LLQYSQQQAQ TEGLIAAAKL STTEKMTEAH KQLLSFQQRI
ADLSGKKLTA DEQSVLAHKD EIALALQKLD ISQQDLQHQN AFNELKKKTL TLTSQLADEE
SRVRQQHALA LATMGMGDQQ RGRYEEHLKI QQHYQEQLEQ LKRDSKAKGT YGSDEYRQAE
QELQASLDRR LAEWADYNAK VDAAQGDWTQ GASRALDNFL AQGGNVAGMT ENVFTNAFNG
MADSIANFAV TGKGSFRSLT VSILADLAKM EARIAASKLL GSVLGMFVFG ASAGGSTPSG
AYSSAALSVI PNADGGVYRS AGLSQYSGSI VNRPTFFAFA RGAAVMGEAG PEAILPLRRG
TDGKLGVVAA GSGGMAMFAP QYHIAISNTG PELTPQALKA VYDLGKKAAA DFVQQQGRDG
GRLSGAYR