Gene SNSL254_A2815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2815 
Symbol 
ID6484719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2755619 
End bp2757550 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content51% 
IMG OID642738138 
Productphage terminase large subunit 
Protein accessionYP_002041872 
Protein GI194445357 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.326238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCCG GAGAGCGCAG GGCGAATAAT GCCAACAGAG CCATAACTAA CGGGCTGATA 
GCGCTTCATA TTCCCGTACC GCTTACCACC GTGCAGTGGG CTGATGAGTA TTACTATCTG
CCAAAAGAGT CCTCCTACAC CCCCGGCAAA TGGGAAACGC TGCCGTTTCA GGTAGCGATA
ATGAACGCGA TGGGGTATGA ACTGATCCGC GTTGTAAACC TCATTAAGTC TGCCCGCGTG
GGCTATACCA AAATGTTGCT GGGGGTGGAA GGCTATTTCA TAGAGCACAA GTCGCGCAAC
AGCCTGCTGT TCCAGCCGAC CGACTCATCC GCTGAGGATT TTATGAAATC CCACGTGGAG
CCGACTATCA GGGATGTTCC TGTATTGCTG GAGCTGGCCC CCTGGTTCGG GCGTAAACAT
CGTGATAACA CGCTTACCCT GAAACGCTTT TCTTCCGGTG TCGGGTTCTG GTGCCTCGGC
GGTGCAGCAG CCAAAAACTA CCGTGAAAAA TCGGTGGATG TGGTCTGCTA TGACGAATTG
TCATCTTTTG AGCCGGATGT CGAGAAAGAA GGTTCGCCGA CGCTGCTGGG GGATAAACGT
ATTGAAGGTT CTGTCTGGCC TAAATCCATT CGGGGCTCCA CACCAAAAGT CAAAGGGTCA
TGCCAGATTG AAAAGGCGGC AAATGAATCG GCGCATTTTA TGCGTTTTTA TGTACCGTGT
CCGCATTGTG GCGAAGAACA GTACCTTAAA TTCGGTGATG GCAGTACGCC GTTCGGTCTG
AAATGGGAGA AAAGCAAGCC GGAGACGGTG TATTACCTTT GTGAACATAA TGGATGCGTG
ATCCGTCAAT CGGAACTTGA TCAGAAAGCA GGCCGCTGGA TTTGCGATAA CACAGGCATG
TGGACACGCG ATGGACTGGC TTATTTCAGC GCGTCCGGTG AGGAGGTTCC GCCGCCACGA
TCCATTACCT TTCATATCTG GACGGCTTAC AGTCCCTTTA CCACCTGGAT ACAGATTATT
TATGACTGGC TGGATGCGCT GAAAGATCCA AATGGTGTGA AAACCTTTAT AAACACCACG
TTGGGCGAGC CTTATGAAGA GGCGGTGGCC GAAAAACTCA GCCATGAGCT TTTGCTGGAA
AAAGTGATTC ATTATGCGGC GCCGGTTCCG GAGCGGGTGG TGTATCTGAC CGCTGGTATC
GACTCCCAGC GTAACCGTTA TGAAATGTAT GTCTGGGGCT GGGCGCCGGG CGAAGAGGCT
TTCCTTATTG ATAAGCAAAT TATCATGGGA CGGCATGATG ATGAAGATAC CCTGCAGCGT
GTGGATGCCG TCATTAATAA AAAATATCGT CATGCTGACG GGACGGATAT TTCCATTTCC
CGTATCTGCT GGGATATCGG CGGTATCGAT GCAGAAATCG TCTATAAACG CTCAAAAAAA
CACGGCATTT TCCGCGTGCT GCCTGTCAAA GGGGCCTCCG TTTACGGAAA ACCCGTTATT
ACCATGCCTA AAAAACGCAA CCAGAGCGGG GTATTCCTGT GCGAAATCGG TACTGATACT
GCCAAAGAAA TGCTTTACGC CAGAATGGGG GCGGTTACTG CGCCTGCCGA CGAAGCCACG
CCTTATGCGA TCCGCTTTCC GGATAATCCG GATGTTTTTA CGGAGGTGGA AGCGAAGCAA
CTGGTAGCCG AAGAGCTGGT GGAGAAACTG GTTAACGGAA AATTCCGGCT GTTATGGGAT
GCCAAAGGAC GTCGTAACGA AGCGCTGGAT TGTCTTGTCT ATGCCAGTGC AGCGTTACGG
GTGTCTGTGC AGCGCTGGCA ACTGGATCTG GAGGCGCTGG CGACATCAAG GAAAAGCGAA
GAGCAGGATA CCCCGACACT TGAACAACTG GCCGCAATGC TGGCAGGAGG AGTTAATGGC
AACAATCACT GA
 
Protein sequence
MISGERRANN ANRAITNGLI ALHIPVPLTT VQWADEYYYL PKESSYTPGK WETLPFQVAI 
MNAMGYELIR VVNLIKSARV GYTKMLLGVE GYFIEHKSRN SLLFQPTDSS AEDFMKSHVE
PTIRDVPVLL ELAPWFGRKH RDNTLTLKRF SSGVGFWCLG GAAAKNYREK SVDVVCYDEL
SSFEPDVEKE GSPTLLGDKR IEGSVWPKSI RGSTPKVKGS CQIEKAANES AHFMRFYVPC
PHCGEEQYLK FGDGSTPFGL KWEKSKPETV YYLCEHNGCV IRQSELDQKA GRWICDNTGM
WTRDGLAYFS ASGEEVPPPR SITFHIWTAY SPFTTWIQII YDWLDALKDP NGVKTFINTT
LGEPYEEAVA EKLSHELLLE KVIHYAAPVP ERVVYLTAGI DSQRNRYEMY VWGWAPGEEA
FLIDKQIIMG RHDDEDTLQR VDAVINKKYR HADGTDISIS RICWDIGGID AEIVYKRSKK
HGIFRVLPVK GASVYGKPVI TMPKKRNQSG VFLCEIGTDT AKEMLYARMG AVTAPADEAT
PYAIRFPDNP DVFTEVEAKQ LVAEELVEKL VNGKFRLLWD AKGRRNEALD CLVYASAALR
VSVQRWQLDL EALATSRKSE EQDTPTLEQL AAMLAGGVNG NNH