Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2796 |
Symbol | |
ID | 6483987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2737465 |
End bp | 2740827 |
Gene Length | 3363 bp |
Protein Length | 1120 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642738120 |
Product | host specificity protein |
Protein accession | YP_002041854 |
Protein GI | 194443329 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAGG GCGGCGGAAA AGGGCATACG CCCCGCGAGG CACCGGATAA CCTTAAATCC ACGCAGCTGC TGAGCGTCAT CGATGCCATC AGCGAGGGAC CGATAGAAGG CCCGGTGAAC GGTCTGCAAA GTGTTCTGGT AAACCAGACG CCGGTGGTGG ACCGCGACGG TAACACGAAT ATCCACGGCG TGAAGGTGGT ATACCGCGTC GGTGAGCAGG AACAGACCCC GCTGGAGGGA TTTGAATCGT CCGGCGCCGA GACGGTGCTT GGTGTACAGG TCAAATACGA CAATCCGGTG ACCAGAACCA TCACGGCTGC AAATATTGAC CGCCTGCGTT TTACGTTCGG CGTGCAGTCA CTGGTGGAGG CCAACAGCAA GGGCGACCGC AATCCGACAT CGGTCAGGCT GCAAATCCAT CTTGAGCGCT ATGGTCAGTG GGTGGTGGAA AAAGAGATTA CGATTACCGG GAAAACAACC ACACAGTATC TGGCCTCGGT GATAGTGGAT AATCTCCCTC CCCGGCCATT TGGTATCCGG ATGGTACGTG TGACGGCAGA CAGTACCACT GACCAGTTAC AGAACAACAC GGTCTGGTCG TCGTATACCG AGATTATTGA TGTCCGGCAG CGCTATCCCA ACACCGCCGT AATTGGCCTG CAGGTGGAGT CTGAGCAGTT CGGCAGCCAG CAGGTGACGC GAAATTACCA TTTTTTCGGG CGGATTATTC ATGTGCCGTC GAATTACGAT CCGGTAGCGC GAACCTACAG CGGCATCTGG GACGGCACGT TCAAGCCTGC ATACAGCAAT AATCCGGCGT GGTGTCTCTG GGATGTGCTG ACACATCCCC GTTATGGCAT GGGACAGCGA ATCGGCGCGG CGGACGTGGA CAGGTGGGCG CTGTATGCAA TAGGCCAGTA CTGCGACCAG ATGGTCCCTG ACGGATTCGG CGGGACAGAG CCGCGTATGA CCTTTAATGC GTATCTGGCA CAGCAGCGTA AGGCGTGGGA TGTGCTGACC GACTTCTGCT CCGCCATGCG TTGTATGCCG GTGTGGAACG GGCAGATGAT GACCTTCGTG CAGGACAGGC CCTCGGATAC AGTCTGGACC TATACCCGCA GCAATGTGGT AATGCCGGAT GAGGGCACAC CGTTCCGTTA CAGCTTCAGT GCGCGGAAGG ACCGCCATAA TGCGGTAGAG GTGAACTGGA TCGACCCTGA TAATGGCTGG CAGACATCCA CGGAACTGGT GGAAGACACG GTCGCCATCA GTCACTACGG ACGCAATCTG GTAAAAATGG ATGCGTTTGG CTGTACCAGT CGCGGGCAGG CGCACCGCGC CGGGCTGTGG CTGATAAAAA CGGAGCTGCT GGAAACCCAG ACGGTTGATT TTAGTGTGGG TGCGGAGGGG CTGCGCCACG TTCCCGGTGA TGTGATTGAG GTTTGCGACG AGGATTATGC CGGCATCAGC CTGGGCGGGC GGATTCTGTC CGTTGACCGC GCCCGCCGCA TTCTGACCCT TGACAGGGAG ATTACCCTGC CGTCGTCCGG CACCACGCTG ATAAGCCTGA TGGATGGCGA AGGCTTGCCG GTCAGCGTGG ACGTGCAGTC TGTTACCGAC GGTGTGCAGG TTCAGGTCAG CCGGATACCG GACGGCGTGG CGGAATACAG CGTCTGGGGG CTGAAACTGC CGTCGCTGCG CCAGCGTCTC TTCCGGTGTG TGGCTGTCCG GGAAAACGAC GACGGAACGT ATGCCATCAC CGCCGTACAG CATGTGCCGG AAAAAGAGTC CATCGTGGAC AACGGGGCAT CATTCGATCC GCAACCCGGA ACGATTCACG GCACCGTTCC CCCGGCGATA CAGCATCTGA CCACAGAAAT TCTGGCGGAG GAGGGACAGT ATCAGGTACT GGCGCGCTGG GACACACCGC GAGTCGTTAA GGGCGCCTCG TTTTCTTTGC GCCTGAACGT GGCGGCGGAA GATGGCAGTG ACCGGCTGGT AAGCAGCGCA GGAACGCCGG ATACGCAGTA CCGGTTCCGG GGGCTGACGC CGGGGCGCTA CACCCTGTCC GTCAGGGCGG TGAACAGCCA GGGACAACAG GGAGACCCGG CCAGCACACA GTTCAGCATC TCCGCGCCGG CGGCACCATC ATTTATCGAA CTCACCCCTG GCTATTTCCA GATTACAGCC ACACCGCGTC AGGCGGTATA CGACCCGACG GTGCAGTATG AGTTCTGGTT TTCAGACGCG CAGATTACGG ATATCCATCA GGTGGAAAAC GCCGCACGAT ATCTGGGAAC GGCGCTGTAC TGGATAGCGG CCAGTGTGAA TATCAGGCCC GGCAGGGATT ACTATTTTTA TATCCGGGCG GTAAATCAGG TCGGTAAATC CGCATTCGTG GAGGCGACCG GGCAGGCCAG CAACGATGCC GCAGGCTATC TGGATTTTTT CAAAGGGCAG ATAACTGAAA GTCACCTGGG TAAGGAACTG CTGGAGAAGG TGGAACTGAC GGAGGATAAC GCCAGCAAAC TGCAGCAGTT TTCGAAGGAG TGGCAGGATG CTAACGATAA ATGGAACGCC ATGTGGGGCG TCAAAATAGA GCAGACCAAA GACGGCAAAT ATTATGTGGC CGGACTTGGA CTGAGCATGG AAGACATGCC TGACGGGAAG ATAAGCCAGT TCCTGGTGGC GGCGGATCGC ATTGCTTATA TTAACCCGGC AAACGGAAAC GAGACGCCCG GATTCGTCAT GCAGGGCGAC CAGATAATCA TGAATGAGGC GTTCCTGAAA TATCTGAGCG CGCCGACCAT TACCAGTGGC GGGAATCCTC CGGCATTTTC CCTGACGCCG GATGGAAAGC TGACTGCGAA AAATGCGGAT ATCAGCGGGC ATATCAACGC TGTATCTGGC TCGTTTACGG GAGAAATCAA TGCCACCTCC GGTAAGTTTT CTGGCGTGAT AGAAGCAAGA GAGTTTGTCG GTGATATCTG CGGCTCAAAA GTCATGCAGG GCGTGAGCAT CAGGGAGACG AACGACGAAC GCAGCACCTC AACACGGTAT ACCGACAGCG CCACCTATCA GATAGGGAAA ACCATCACGG TGATGGCTAA CTGTGAGCGT AACGGTGGCT CCGGTGCCAT CACCGTCACG ATAAATATTA ACGGCCAGGT GAAAACGGCG GAGGTTATCC CGTATACCGC AGGGCTTCCG GCCATGTATC AGACCGTTGT CTTTTCGGTC TACACCACTT CACCTGTCGT GGATATCAGC GTTTCTCTGA GGGTTCGTGG GCAGTACACC ACGTCTGCTT CCGTCTGGCC GCTGGTGATG GTTTCCCGGT CGGGGAACAA CTTCACAAAC TGA
|
Protein sequence | MGKGGGKGHT PREAPDNLKS TQLLSVIDAI SEGPIEGPVN GLQSVLVNQT PVVDRDGNTN IHGVKVVYRV GEQEQTPLEG FESSGAETVL GVQVKYDNPV TRTITAANID RLRFTFGVQS LVEANSKGDR NPTSVRLQIH LERYGQWVVE KEITITGKTT TQYLASVIVD NLPPRPFGIR MVRVTADSTT DQLQNNTVWS SYTEIIDVRQ RYPNTAVIGL QVESEQFGSQ QVTRNYHFFG RIIHVPSNYD PVARTYSGIW DGTFKPAYSN NPAWCLWDVL THPRYGMGQR IGAADVDRWA LYAIGQYCDQ MVPDGFGGTE PRMTFNAYLA QQRKAWDVLT DFCSAMRCMP VWNGQMMTFV QDRPSDTVWT YTRSNVVMPD EGTPFRYSFS ARKDRHNAVE VNWIDPDNGW QTSTELVEDT VAISHYGRNL VKMDAFGCTS RGQAHRAGLW LIKTELLETQ TVDFSVGAEG LRHVPGDVIE VCDEDYAGIS LGGRILSVDR ARRILTLDRE ITLPSSGTTL ISLMDGEGLP VSVDVQSVTD GVQVQVSRIP DGVAEYSVWG LKLPSLRQRL FRCVAVREND DGTYAITAVQ HVPEKESIVD NGASFDPQPG TIHGTVPPAI QHLTTEILAE EGQYQVLARW DTPRVVKGAS FSLRLNVAAE DGSDRLVSSA GTPDTQYRFR GLTPGRYTLS VRAVNSQGQQ GDPASTQFSI SAPAAPSFIE LTPGYFQITA TPRQAVYDPT VQYEFWFSDA QITDIHQVEN AARYLGTALY WIAASVNIRP GRDYYFYIRA VNQVGKSAFV EATGQASNDA AGYLDFFKGQ ITESHLGKEL LEKVELTEDN ASKLQQFSKE WQDANDKWNA MWGVKIEQTK DGKYYVAGLG LSMEDMPDGK ISQFLVAADR IAYINPANGN ETPGFVMQGD QIIMNEAFLK YLSAPTITSG GNPPAFSLTP DGKLTAKNAD ISGHINAVSG SFTGEINATS GKFSGVIEAR EFVGDICGSK VMQGVSIRET NDERSTSTRY TDSATYQIGK TITVMANCER NGGSGAITVT ININGQVKTA EVIPYTAGLP AMYQTVVFSV YTTSPVVDIS VSLRVRGQYT TSASVWPLVM VSRSGNNFTN
|
| |