Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2815 |
Symbol | |
ID | 6484719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2755619 |
End bp | 2757550 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642738138 |
Product | phage terminase large subunit |
Protein accession | YP_002041872 |
Protein GI | 194445357 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 0.326238 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTCCG GAGAGCGCAG GGCGAATAAT GCCAACAGAG CCATAACTAA CGGGCTGATA GCGCTTCATA TTCCCGTACC GCTTACCACC GTGCAGTGGG CTGATGAGTA TTACTATCTG CCAAAAGAGT CCTCCTACAC CCCCGGCAAA TGGGAAACGC TGCCGTTTCA GGTAGCGATA ATGAACGCGA TGGGGTATGA ACTGATCCGC GTTGTAAACC TCATTAAGTC TGCCCGCGTG GGCTATACCA AAATGTTGCT GGGGGTGGAA GGCTATTTCA TAGAGCACAA GTCGCGCAAC AGCCTGCTGT TCCAGCCGAC CGACTCATCC GCTGAGGATT TTATGAAATC CCACGTGGAG CCGACTATCA GGGATGTTCC TGTATTGCTG GAGCTGGCCC CCTGGTTCGG GCGTAAACAT CGTGATAACA CGCTTACCCT GAAACGCTTT TCTTCCGGTG TCGGGTTCTG GTGCCTCGGC GGTGCAGCAG CCAAAAACTA CCGTGAAAAA TCGGTGGATG TGGTCTGCTA TGACGAATTG TCATCTTTTG AGCCGGATGT CGAGAAAGAA GGTTCGCCGA CGCTGCTGGG GGATAAACGT ATTGAAGGTT CTGTCTGGCC TAAATCCATT CGGGGCTCCA CACCAAAAGT CAAAGGGTCA TGCCAGATTG AAAAGGCGGC AAATGAATCG GCGCATTTTA TGCGTTTTTA TGTACCGTGT CCGCATTGTG GCGAAGAACA GTACCTTAAA TTCGGTGATG GCAGTACGCC GTTCGGTCTG AAATGGGAGA AAAGCAAGCC GGAGACGGTG TATTACCTTT GTGAACATAA TGGATGCGTG ATCCGTCAAT CGGAACTTGA TCAGAAAGCA GGCCGCTGGA TTTGCGATAA CACAGGCATG TGGACACGCG ATGGACTGGC TTATTTCAGC GCGTCCGGTG AGGAGGTTCC GCCGCCACGA TCCATTACCT TTCATATCTG GACGGCTTAC AGTCCCTTTA CCACCTGGAT ACAGATTATT TATGACTGGC TGGATGCGCT GAAAGATCCA AATGGTGTGA AAACCTTTAT AAACACCACG TTGGGCGAGC CTTATGAAGA GGCGGTGGCC GAAAAACTCA GCCATGAGCT TTTGCTGGAA AAAGTGATTC ATTATGCGGC GCCGGTTCCG GAGCGGGTGG TGTATCTGAC CGCTGGTATC GACTCCCAGC GTAACCGTTA TGAAATGTAT GTCTGGGGCT GGGCGCCGGG CGAAGAGGCT TTCCTTATTG ATAAGCAAAT TATCATGGGA CGGCATGATG ATGAAGATAC CCTGCAGCGT GTGGATGCCG TCATTAATAA AAAATATCGT CATGCTGACG GGACGGATAT TTCCATTTCC CGTATCTGCT GGGATATCGG CGGTATCGAT GCAGAAATCG TCTATAAACG CTCAAAAAAA CACGGCATTT TCCGCGTGCT GCCTGTCAAA GGGGCCTCCG TTTACGGAAA ACCCGTTATT ACCATGCCTA AAAAACGCAA CCAGAGCGGG GTATTCCTGT GCGAAATCGG TACTGATACT GCCAAAGAAA TGCTTTACGC CAGAATGGGG GCGGTTACTG CGCCTGCCGA CGAAGCCACG CCTTATGCGA TCCGCTTTCC GGATAATCCG GATGTTTTTA CGGAGGTGGA AGCGAAGCAA CTGGTAGCCG AAGAGCTGGT GGAGAAACTG GTTAACGGAA AATTCCGGCT GTTATGGGAT GCCAAAGGAC GTCGTAACGA AGCGCTGGAT TGTCTTGTCT ATGCCAGTGC AGCGTTACGG GTGTCTGTGC AGCGCTGGCA ACTGGATCTG GAGGCGCTGG CGACATCAAG GAAAAGCGAA GAGCAGGATA CCCCGACACT TGAACAACTG GCCGCAATGC TGGCAGGAGG AGTTAATGGC AACAATCACT GA
|
Protein sequence | MISGERRANN ANRAITNGLI ALHIPVPLTT VQWADEYYYL PKESSYTPGK WETLPFQVAI MNAMGYELIR VVNLIKSARV GYTKMLLGVE GYFIEHKSRN SLLFQPTDSS AEDFMKSHVE PTIRDVPVLL ELAPWFGRKH RDNTLTLKRF SSGVGFWCLG GAAAKNYREK SVDVVCYDEL SSFEPDVEKE GSPTLLGDKR IEGSVWPKSI RGSTPKVKGS CQIEKAANES AHFMRFYVPC PHCGEEQYLK FGDGSTPFGL KWEKSKPETV YYLCEHNGCV IRQSELDQKA GRWICDNTGM WTRDGLAYFS ASGEEVPPPR SITFHIWTAY SPFTTWIQII YDWLDALKDP NGVKTFINTT LGEPYEEAVA EKLSHELLLE KVIHYAAPVP ERVVYLTAGI DSQRNRYEMY VWGWAPGEEA FLIDKQIIMG RHDDEDTLQR VDAVINKKYR HADGTDISIS RICWDIGGID AEIVYKRSKK HGIFRVLPVK GASVYGKPVI TMPKKRNQSG VFLCEIGTDT AKEMLYARMG AVTAPADEAT PYAIRFPDNP DVFTEVEAKQ LVAEELVEKL VNGKFRLLWD AKGRRNEALD CLVYASAALR VSVQRWQLDL EALATSRKSE EQDTPTLEQL AAMLAGGVNG NNH
|
| |