Gene SNSL254_A4550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4550 
Symbol 
ID6484581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4421699 
End bp4424053 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content58% 
IMG OID642739776 
Productputative phage tail protein 
Protein accessionYP_002043458 
Protein GI194445074 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.423372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.711096 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCAACG ACATTATTAC TCAGCTTCAG GCGCGTAATG AGACGTTGAC GCAGGCAATA 
GCCCGTTACG GCTCACTCAA CGCCAGCACG CTGCACACGC TCAGCTTTGA GCAAACAAAA
ATCACCCGGC TTACGCAACA GCTCGCTAAC TCCGCCCTTC GCCGGGAAGA GAACGATAAA
CAGCGCGCTG GGTTGCTGGA AAAAACACAA ACGTTCGCCG GACAGCTCGG CAAGCTCCTG
AACGTTGAGA CTCCCGACTG GAAGCTGCCT TACGAATTTC ACGGCAACAT GGTCGATATG
GCGGCGAAAG GCGGCATGAA TAACACCGCG CGGGACGCCC TGAGCCTGAA TATCCGCGAC
TGGAGCCTTG ATTTCAATCA GGATCAAAAA GATCTGCAAA GCGCCGCCGC CACGATGATC
GAAGGCGGCG TCAGCGCATT GCAGGATCTT AGCCGCTACA TGCCCGATAT CGCCAAAGCC
GCAACCGCCT CCCGCGACAG CGCGCAAAGC TGGGCGCAGG CGGCTCTGGC CACTCGCGAC
AAACTGAACA TCGCCCCTGA CGGCTTCCGT TTTGCGCAAA ATATGCTGTA CAGCGTGGCA
AAAAGCGGCG GCGGCTCCGT TGCAGAACAA ACCCAGTGGA TTAACGCCTT TGCCGGAAAA
ACCGGCGCTC AGGGGAAAGA AGGCATTGCG GAACTGACCG CAACGATGCA AATCGCCATG
AAAAACGCCC CTGACGCAGG CGCGGCGGCA GCGAATTTCG ACCATTTCCT GAAATCCGCC
TTCTCGAAAG AGACGGACAG TTGGTTTGCC CGTCAGGGCG TGGATCTTCA GGGATCGCTG
CTGGAGCATC AGCAAAACGG GATCGGCGTG ACGGAAGCGA TGACCCACAT TGTGCAGATG
CAACTGGAGA AAATGAACCC GCAGATCCTC GACACCTTCA GGCAAACCAT GAAGATTGAG
GATCTTTCCG CGCGCGGCGA CGCGCTACAG GCCATGACGG AGAAATTTAA CCTCGGCGCG
ATGTTCGGCG ATGCGCAAAC GCGGGATTTT CTCGCCCCGA TGCTGGCGAA TATGGACGAA
TATCGCCAGC TAAAAGCCTC CGCAATGCAG GCGGCGGGGC AACATGTTAT TGATGATGAC
TTCGCCGCGA AAATGACATC GCCCGGAGAA CAGACCAAAG CGTTACAACT TTCACTTAAC
GATCTGTGGC TGACCGTCGG CCTGGCGCTG ATGCCCGCCA TTGGCGAACT GGCGCAAAGC
ATCACGCCGC TGGTACGGCA GTTCAGCGCC TGGCTGCGGG AAAATCCGGC GCTGGTGCAA
GGGATCGCCA AAATGGTTGG CGTTATCTGG CTGTTCAACG GAGCGCTGAA TATTCTCAGG
CTGGGAGCAA ACCTCATTGC GTCACCGTTT ATTCGCCTGA TCGATATCTT CCTGAAGGTC
AAAGCCGGTC TGGCGCTGGG CGGCGGCAGT CGCGCGCTGT CGGTTCTGAA ATCGTTTGGC
AACGGTGCGA AAAGCCTGAC GATGCTGCTG GGAAACGGCC TGATAAAAGG GCTACGGCTG
GTCGGCCAGA CATTTATCTG GCTGGGTCGG GCGCTGCTGA TGAACCCTGT TGGCCTGACC
ATCACCGCTA TCGCAGGCGC CGCCTATTTA CTTTATCGCT ACTGGGAACC GATTTCCGGT
TTCTTTGCCG GAGTCTGGGA GCGCATCAAA ACCGCCTTTG ACGGAGGCAT TGCAGGCGTC
ACGCGTCTGA TTCTCGACTG GTCGCCGCTG GGGCTGTTTT ACCGCGCCTT CGCCAGCGTA
CTGGACTGGT TTGGCATTGA ACTCCCCGCC AGCTTCAGCG AATTTGGCGG CAATATTCTG
GATAGCCTGA TCAACGGCAT TCTGAATGCG CTTCCTTTCC TGAACGGGGC GATTGAGAAG
ATAAAAGCGC TGATCCCCGA CTGGGCGAAA AGCGCGCTGG GCATCAGCGC TGAAATGCCG
TCTGTCGCCG CCGCCGTCCC CGGTATTGCC GGAACAATGG TCGCGCAACA GGCCAGCGCG
CCGCTGGTAT CGGGAGCGAA AGCGGTGACA ACCTCGGCCA AAACGATGGC CTCGCCGCAG
CCTATGAAGA TGAAAAGCGC CGCCACGCCG CCGACGCCAG CCGCGCTTCC CGGCAAATCC
GGCGGGAAAC CTTATACGCT GCCCTCCCGC ACGCAAAGCA ACGTGCAGGT ACACTTTTCC
CCGCAGGTTA CCGTGCAGGG AAGCGGCGCG AATGCCGCCA AAGATATCAA CAACGTGCTG
TCGCTGAGCA AACGCGAGCT GGAGAGAATG ATTAACGATG TCATGGCGCA ACAACGGCGC
CGGGAGTACG CGTAA
 
Protein sequence
MANDIITQLQ ARNETLTQAI ARYGSLNAST LHTLSFEQTK ITRLTQQLAN SALRREENDK 
QRAGLLEKTQ TFAGQLGKLL NVETPDWKLP YEFHGNMVDM AAKGGMNNTA RDALSLNIRD
WSLDFNQDQK DLQSAAATMI EGGVSALQDL SRYMPDIAKA ATASRDSAQS WAQAALATRD
KLNIAPDGFR FAQNMLYSVA KSGGGSVAEQ TQWINAFAGK TGAQGKEGIA ELTATMQIAM
KNAPDAGAAA ANFDHFLKSA FSKETDSWFA RQGVDLQGSL LEHQQNGIGV TEAMTHIVQM
QLEKMNPQIL DTFRQTMKIE DLSARGDALQ AMTEKFNLGA MFGDAQTRDF LAPMLANMDE
YRQLKASAMQ AAGQHVIDDD FAAKMTSPGE QTKALQLSLN DLWLTVGLAL MPAIGELAQS
ITPLVRQFSA WLRENPALVQ GIAKMVGVIW LFNGALNILR LGANLIASPF IRLIDIFLKV
KAGLALGGGS RALSVLKSFG NGAKSLTMLL GNGLIKGLRL VGQTFIWLGR ALLMNPVGLT
ITAIAGAAYL LYRYWEPISG FFAGVWERIK TAFDGGIAGV TRLILDWSPL GLFYRAFASV
LDWFGIELPA SFSEFGGNIL DSLINGILNA LPFLNGAIEK IKALIPDWAK SALGISAEMP
SVAAAVPGIA GTMVAQQASA PLVSGAKAVT TSAKTMASPQ PMKMKSAATP PTPAALPGKS
GGKPYTLPSR TQSNVQVHFS PQVTVQGSGA NAAKDINNVL SLSKRELERM INDVMAQQRR
REYA