Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2256 |
Symbol | |
ID | 6873006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2155979 |
End bp | 2157907 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642785356 |
Product | tail protein |
Protein accession | YP_002216018 |
Protein GI | 198244985 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.000173389 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGACA GTTTCCAGTT AAAGGCCATT ATCACTGCCG TTGACCAGTT ATCGGGTCCG CTGAAAGGGA TGCAGCGGGA ACTGAAGGGA TTTCAGAAAG AAATGGCCGG GCTGGCGATC GGTGCTGCTG CTGCCGGGAC CGCTGTTCTT GGGGCGCTGG CGCTGCCCGT GAATGCTGCG ATCGGCTTTG AGTCAAAAAT GGCTGACATC CGGAAGGTGG TTGACGGCCT GGATGATAAA AAAGCATTCG CGCAGATGAG TGACGATATC CTGACGCTGT CCACACAGTT ACCGATGGCG GCGGAGGGAA TTGCAGAGAT CGTGGCGGCG GGCGGGCAGG CAGGCATTGC CCGCGGCGAT TTGATGCAGT TTGCGAACGA CGCAGTGAAA ATGGGTGTGG CGTTTGATAC CACTGCCGAA GAGTCCGGTC AGATGATGGC GCAGTGGCGG ACAGCGTTCA GACTGACGCA GGAAGACGTG GTTGTCCTGG CCGATAAAAT CAACTATCTG GGGAATACCG GCCCGGCAAA TGCGAAGAAA ATTTCTGATA TCGTGACGCG GATTGGTCCG CTTGGCGGTG TTGCCGGAGT GGCATCCGGC GAAATTGCCG CGATGGGCGC CACCATTGCC GGGATGGGGG TTGAATCGGA GATAGCCTCC ACCGGCATCA AAAACTTCAT GCTGTCGTTA ACCGCAGGTA ATTCGGCAAC CAAAGCCCAG AAACAGGCTA TGGCTTTCCT GAAGCTGAAT CCCCGGAAAC TCGCTGAGGA TATGCAAAAG GATTCGCGCG GGGCCATGCT GAAGGTGCTG GACTCGCTCG CGAAAGTGCC AAAAGCTAAA CAGGCCGCCG TCATGAATGC GCTGTTTGGC AAGGAGTCAC TTAGCGCGAT TGCCCCGCTG CTGACCAACC TGGATTTGTT ACGCACCAAT TTTGATCGTG TGGCTGATGC CCAGGAATAT GGCGGCTCGA TGCAGAAGGA ATACGCATCC CGCGCGGCCA CAACAGAAAA CCAGCTGGTT CTGCTGAAAA ACAGCGTCAA TGCGATTTCG GTAACGCTGG GCGATACCTT CCTGCCCGCC ATTAACGAAG CTGCAGAAGC GGTCATGCCT TACCTGGAGC AGCTCCGGAC ATTCGTTCGC GCGAATCCTG AACTGGTTCA GTCTGCGGCG AAGTTCGGCG CGGCGCTGCT GGCTGTTGGC GTATCCATTG GCAGCCTGTC CCGGGCTGTC AAAATCCTGA ACAGTGTCAT TAATCTCTCT CCGGCGAAAG TCGCCATTGC GGCGCTGGTG GCCGGCGCTA TGCTGATCAT TGAGAACTGG GACGATGTTG CTCCGGTGAT TAAGGCGGTA TGGCAGGAGG TCGATAACGT TGCGCAGGAG ATGGGCGGAT GGGAGACGGT GATTGAAGGG GTTGGTCTGG TTATGGCTGG TTCTTTTACC GTCAGGACCA TTGGTGCCCT GCAGCAGTCC GTCCTGCTGG CCGGACGGCT TTCCGGTCTG CTGGGTAAAA TTGGCCGGAT GGGGGCCATG ACGCTGACAA TTGGCGTGGC GGTGTCACTC TTTAAAGAGC TTAAGGATCT GGAGCAGGGG GCAAAGGATG CGGGTATGGA TGCTGGCGCA TTCGCTGTAC AGAAGCTGCA AACGAAGGAG CGTGAACGCG GGTATAACGG TTTTATTCCC AGACTCAAAG AGCTTCTTGG TATGGACACC CCGATTCCGC AGGGGCGTTA TCAACCTTAT GTGCCACTGA CCCGGCGTTC TGGCGTACTC GAGCGAGCTG TCCCGCCATC AACGCAGCGC AGCGAACTCA AAGTGACATT TGAGAATGCA CCACAAGGTA TGCGTGTGAC TGATATACCG AAATCCGGTA ATCAATTGAT GAACATCAGC CATGATGTGG GTTACTCACC CTTTCGTACA TCACGATAA
|
Protein sequence | MADSFQLKAI ITAVDQLSGP LKGMQRELKG FQKEMAGLAI GAAAAGTAVL GALALPVNAA IGFESKMADI RKVVDGLDDK KAFAQMSDDI LTLSTQLPMA AEGIAEIVAA GGQAGIARGD LMQFANDAVK MGVAFDTTAE ESGQMMAQWR TAFRLTQEDV VVLADKINYL GNTGPANAKK ISDIVTRIGP LGGVAGVASG EIAAMGATIA GMGVESEIAS TGIKNFMLSL TAGNSATKAQ KQAMAFLKLN PRKLAEDMQK DSRGAMLKVL DSLAKVPKAK QAAVMNALFG KESLSAIAPL LTNLDLLRTN FDRVADAQEY GGSMQKEYAS RAATTENQLV LLKNSVNAIS VTLGDTFLPA INEAAEAVMP YLEQLRTFVR ANPELVQSAA KFGAALLAVG VSIGSLSRAV KILNSVINLS PAKVAIAALV AGAMLIIENW DDVAPVIKAV WQEVDNVAQE MGGWETVIEG VGLVMAGSFT VRTIGALQQS VLLAGRLSGL LGKIGRMGAM TLTIGVAVSL FKELKDLEQG AKDAGMDAGA FAVQKLQTKE RERGYNGFIP RLKELLGMDT PIPQGRYQPY VPLTRRSGVL ERAVPPSTQR SELKVTFENA PQGMRVTDIP KSGNQLMNIS HDVGYSPFRT SR
|
| |