Gene SNSL254_A2796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2796 
Symbol 
ID6483987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2737465 
End bp2740827 
Gene Length3363 bp 
Protein Length1120 aa 
Translation table11 
GC content57% 
IMG OID642738120 
Producthost specificity protein 
Protein accessionYP_002041854 
Protein GI194443329 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAGG GCGGCGGAAA AGGGCATACG CCCCGCGAGG CACCGGATAA CCTTAAATCC 
ACGCAGCTGC TGAGCGTCAT CGATGCCATC AGCGAGGGAC CGATAGAAGG CCCGGTGAAC
GGTCTGCAAA GTGTTCTGGT AAACCAGACG CCGGTGGTGG ACCGCGACGG TAACACGAAT
ATCCACGGCG TGAAGGTGGT ATACCGCGTC GGTGAGCAGG AACAGACCCC GCTGGAGGGA
TTTGAATCGT CCGGCGCCGA GACGGTGCTT GGTGTACAGG TCAAATACGA CAATCCGGTG
ACCAGAACCA TCACGGCTGC AAATATTGAC CGCCTGCGTT TTACGTTCGG CGTGCAGTCA
CTGGTGGAGG CCAACAGCAA GGGCGACCGC AATCCGACAT CGGTCAGGCT GCAAATCCAT
CTTGAGCGCT ATGGTCAGTG GGTGGTGGAA AAAGAGATTA CGATTACCGG GAAAACAACC
ACACAGTATC TGGCCTCGGT GATAGTGGAT AATCTCCCTC CCCGGCCATT TGGTATCCGG
ATGGTACGTG TGACGGCAGA CAGTACCACT GACCAGTTAC AGAACAACAC GGTCTGGTCG
TCGTATACCG AGATTATTGA TGTCCGGCAG CGCTATCCCA ACACCGCCGT AATTGGCCTG
CAGGTGGAGT CTGAGCAGTT CGGCAGCCAG CAGGTGACGC GAAATTACCA TTTTTTCGGG
CGGATTATTC ATGTGCCGTC GAATTACGAT CCGGTAGCGC GAACCTACAG CGGCATCTGG
GACGGCACGT TCAAGCCTGC ATACAGCAAT AATCCGGCGT GGTGTCTCTG GGATGTGCTG
ACACATCCCC GTTATGGCAT GGGACAGCGA ATCGGCGCGG CGGACGTGGA CAGGTGGGCG
CTGTATGCAA TAGGCCAGTA CTGCGACCAG ATGGTCCCTG ACGGATTCGG CGGGACAGAG
CCGCGTATGA CCTTTAATGC GTATCTGGCA CAGCAGCGTA AGGCGTGGGA TGTGCTGACC
GACTTCTGCT CCGCCATGCG TTGTATGCCG GTGTGGAACG GGCAGATGAT GACCTTCGTG
CAGGACAGGC CCTCGGATAC AGTCTGGACC TATACCCGCA GCAATGTGGT AATGCCGGAT
GAGGGCACAC CGTTCCGTTA CAGCTTCAGT GCGCGGAAGG ACCGCCATAA TGCGGTAGAG
GTGAACTGGA TCGACCCTGA TAATGGCTGG CAGACATCCA CGGAACTGGT GGAAGACACG
GTCGCCATCA GTCACTACGG ACGCAATCTG GTAAAAATGG ATGCGTTTGG CTGTACCAGT
CGCGGGCAGG CGCACCGCGC CGGGCTGTGG CTGATAAAAA CGGAGCTGCT GGAAACCCAG
ACGGTTGATT TTAGTGTGGG TGCGGAGGGG CTGCGCCACG TTCCCGGTGA TGTGATTGAG
GTTTGCGACG AGGATTATGC CGGCATCAGC CTGGGCGGGC GGATTCTGTC CGTTGACCGC
GCCCGCCGCA TTCTGACCCT TGACAGGGAG ATTACCCTGC CGTCGTCCGG CACCACGCTG
ATAAGCCTGA TGGATGGCGA AGGCTTGCCG GTCAGCGTGG ACGTGCAGTC TGTTACCGAC
GGTGTGCAGG TTCAGGTCAG CCGGATACCG GACGGCGTGG CGGAATACAG CGTCTGGGGG
CTGAAACTGC CGTCGCTGCG CCAGCGTCTC TTCCGGTGTG TGGCTGTCCG GGAAAACGAC
GACGGAACGT ATGCCATCAC CGCCGTACAG CATGTGCCGG AAAAAGAGTC CATCGTGGAC
AACGGGGCAT CATTCGATCC GCAACCCGGA ACGATTCACG GCACCGTTCC CCCGGCGATA
CAGCATCTGA CCACAGAAAT TCTGGCGGAG GAGGGACAGT ATCAGGTACT GGCGCGCTGG
GACACACCGC GAGTCGTTAA GGGCGCCTCG TTTTCTTTGC GCCTGAACGT GGCGGCGGAA
GATGGCAGTG ACCGGCTGGT AAGCAGCGCA GGAACGCCGG ATACGCAGTA CCGGTTCCGG
GGGCTGACGC CGGGGCGCTA CACCCTGTCC GTCAGGGCGG TGAACAGCCA GGGACAACAG
GGAGACCCGG CCAGCACACA GTTCAGCATC TCCGCGCCGG CGGCACCATC ATTTATCGAA
CTCACCCCTG GCTATTTCCA GATTACAGCC ACACCGCGTC AGGCGGTATA CGACCCGACG
GTGCAGTATG AGTTCTGGTT TTCAGACGCG CAGATTACGG ATATCCATCA GGTGGAAAAC
GCCGCACGAT ATCTGGGAAC GGCGCTGTAC TGGATAGCGG CCAGTGTGAA TATCAGGCCC
GGCAGGGATT ACTATTTTTA TATCCGGGCG GTAAATCAGG TCGGTAAATC CGCATTCGTG
GAGGCGACCG GGCAGGCCAG CAACGATGCC GCAGGCTATC TGGATTTTTT CAAAGGGCAG
ATAACTGAAA GTCACCTGGG TAAGGAACTG CTGGAGAAGG TGGAACTGAC GGAGGATAAC
GCCAGCAAAC TGCAGCAGTT TTCGAAGGAG TGGCAGGATG CTAACGATAA ATGGAACGCC
ATGTGGGGCG TCAAAATAGA GCAGACCAAA GACGGCAAAT ATTATGTGGC CGGACTTGGA
CTGAGCATGG AAGACATGCC TGACGGGAAG ATAAGCCAGT TCCTGGTGGC GGCGGATCGC
ATTGCTTATA TTAACCCGGC AAACGGAAAC GAGACGCCCG GATTCGTCAT GCAGGGCGAC
CAGATAATCA TGAATGAGGC GTTCCTGAAA TATCTGAGCG CGCCGACCAT TACCAGTGGC
GGGAATCCTC CGGCATTTTC CCTGACGCCG GATGGAAAGC TGACTGCGAA AAATGCGGAT
ATCAGCGGGC ATATCAACGC TGTATCTGGC TCGTTTACGG GAGAAATCAA TGCCACCTCC
GGTAAGTTTT CTGGCGTGAT AGAAGCAAGA GAGTTTGTCG GTGATATCTG CGGCTCAAAA
GTCATGCAGG GCGTGAGCAT CAGGGAGACG AACGACGAAC GCAGCACCTC AACACGGTAT
ACCGACAGCG CCACCTATCA GATAGGGAAA ACCATCACGG TGATGGCTAA CTGTGAGCGT
AACGGTGGCT CCGGTGCCAT CACCGTCACG ATAAATATTA ACGGCCAGGT GAAAACGGCG
GAGGTTATCC CGTATACCGC AGGGCTTCCG GCCATGTATC AGACCGTTGT CTTTTCGGTC
TACACCACTT CACCTGTCGT GGATATCAGC GTTTCTCTGA GGGTTCGTGG GCAGTACACC
ACGTCTGCTT CCGTCTGGCC GCTGGTGATG GTTTCCCGGT CGGGGAACAA CTTCACAAAC
TGA
 
Protein sequence
MGKGGGKGHT PREAPDNLKS TQLLSVIDAI SEGPIEGPVN GLQSVLVNQT PVVDRDGNTN 
IHGVKVVYRV GEQEQTPLEG FESSGAETVL GVQVKYDNPV TRTITAANID RLRFTFGVQS
LVEANSKGDR NPTSVRLQIH LERYGQWVVE KEITITGKTT TQYLASVIVD NLPPRPFGIR
MVRVTADSTT DQLQNNTVWS SYTEIIDVRQ RYPNTAVIGL QVESEQFGSQ QVTRNYHFFG
RIIHVPSNYD PVARTYSGIW DGTFKPAYSN NPAWCLWDVL THPRYGMGQR IGAADVDRWA
LYAIGQYCDQ MVPDGFGGTE PRMTFNAYLA QQRKAWDVLT DFCSAMRCMP VWNGQMMTFV
QDRPSDTVWT YTRSNVVMPD EGTPFRYSFS ARKDRHNAVE VNWIDPDNGW QTSTELVEDT
VAISHYGRNL VKMDAFGCTS RGQAHRAGLW LIKTELLETQ TVDFSVGAEG LRHVPGDVIE
VCDEDYAGIS LGGRILSVDR ARRILTLDRE ITLPSSGTTL ISLMDGEGLP VSVDVQSVTD
GVQVQVSRIP DGVAEYSVWG LKLPSLRQRL FRCVAVREND DGTYAITAVQ HVPEKESIVD
NGASFDPQPG TIHGTVPPAI QHLTTEILAE EGQYQVLARW DTPRVVKGAS FSLRLNVAAE
DGSDRLVSSA GTPDTQYRFR GLTPGRYTLS VRAVNSQGQQ GDPASTQFSI SAPAAPSFIE
LTPGYFQITA TPRQAVYDPT VQYEFWFSDA QITDIHQVEN AARYLGTALY WIAASVNIRP
GRDYYFYIRA VNQVGKSAFV EATGQASNDA AGYLDFFKGQ ITESHLGKEL LEKVELTEDN
ASKLQQFSKE WQDANDKWNA MWGVKIEQTK DGKYYVAGLG LSMEDMPDGK ISQFLVAADR
IAYINPANGN ETPGFVMQGD QIIMNEAFLK YLSAPTITSG GNPPAFSLTP DGKLTAKNAD
ISGHINAVSG SFTGEINATS GKFSGVIEAR EFVGDICGSK VMQGVSIRET NDERSTSTRY
TDSATYQIGK TITVMANCER NGGSGAITVT ININGQVKTA EVIPYTAGLP AMYQTVVFSV
YTTSPVVDIS VSLRVRGQYT TSASVWPLVM VSRSGNNFTN