Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2269 |
Symbol | |
ID | 6875310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2164192 |
End bp | 2165433 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642785367 |
Product | phage portal protein, HK97 family |
Protein accession | YP_002216029 |
Protein GI | 198246051 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0000000000275865 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTTCTTTT CGGGATTATT TCAACGAAAA AGTGACGCGC CGGTGACCAC GCCAGCAGAG CTGGCGGATG CTATCGGGCT GTCATACGAC ACCTATACCG GAAAGCAGAT CAGCAGTCAG CGGGCCATGC GACTGACGGC GGTTTTTTCC TGCGTCAGAG TGCTGGCAGA GTCGGTCGGG ATGTTGCCCT GCAACCTGTA TCACCTGAAC GGCAGCCTGA AACAGAGAGC CACTGGCGAA CGTCTGCATA AGCTGATCTC CACGCATCCC AATAGCTATA TGACGCCGCA GGAGTTCTGG GAGCTGGTGG TCACCTGTCT GTGCCTGCGG GGCAACTTTT ATGCCTACAA AGTGAAAGCA TTTGGCGAAG TGGCTGAACT GCTGCCCGTC GATCCCGGTT GCGTGGTGCC GAAGCTTAAC AGTAGCTGGG AGCCGGTCTA TCAGGTCACA TTCCCGGATG GCTCCACGGA TGTACTGAGC CAGGAGGATA TCTGGCATGT GCGCACGCTG ACGCTGGACG GACTGGTGGG GCTGAATCCC ATCGCCTATG CCCGCGAGGC AATATCGCTG GCGGCAGCGA CCGAAGAGCA CGGGGCCAGA CTGTTCAGCA ATGGCGCGGT GACGTCGGGT GTGTTGCGTA CAGAGCAGAC GCTGTCGGAT CAGGCTTATG AGCGCCTGAA GAAAGATTTT GAGGAGCGTC ACACCGGGCT TGGTAATGCT CACCGCCCGA TGATCCTTGA GATGGGGCTG GACTGGAAGT CGATGGCGCT GAACGCCGAG GACAGCCAGT TCCTGGAAAC CCGCAAGTTT CAGCTTGAAG AAATCTGTCG TCTGTTCCGG GTGCCATTGC ACATGGTGCA GAACACCGAT CGCGCCACCT TCAACAATAT TGAAGAGCTG GGGCTGGGAT TTATCAACTA TTCACTGGTG CCGTATCTGA CCCGCATCGA ACAGCGGATC AACACCGGAC TGGTACGAAA AAGTAAGCAG GGCGTTTTTT ACGCCAAATT TAACGCGGGG GCGTTACTGC GTGGGGATAT GAAGTCCCGT TTTGAAGCCT ATGCCACCGG GATCAACTGG GGGATTTACT CTCCCAATGA CTGCCGCGAC CTGGAAGATA TGAATCCGCG TCCCGGTGGT GATGTCTATC TCACACCGAT GAACATGACC ACGAAACCCT CCGATGGCAG TAAAGCCGGT AAGCAGAAGG ATAACGCCAA TGCAGACGAA ACAACGTCTT GA
|
Protein sequence | MFFSGLFQRK SDAPVTTPAE LADAIGLSYD TYTGKQISSQ RAMRLTAVFS CVRVLAESVG MLPCNLYHLN GSLKQRATGE RLHKLISTHP NSYMTPQEFW ELVVTCLCLR GNFYAYKVKA FGEVAELLPV DPGCVVPKLN SSWEPVYQVT FPDGSTDVLS QEDIWHVRTL TLDGLVGLNP IAYAREAISL AAATEEHGAR LFSNGAVTSG VLRTEQTLSD QAYERLKKDF EERHTGLGNA HRPMILEMGL DWKSMALNAE DSQFLETRKF QLEEICRLFR VPLHMVQNTD RATFNNIEEL GLGFINYSLV PYLTRIEQRI NTGLVRKSKQ GVFYAKFNAG ALLRGDMKSR FEAYATGINW GIYSPNDCRD LEDMNPRPGG DVYLTPMNMT TKPSDGSKAG KQKDNANADE TTS
|
| |