Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A4475 |
Symbol | |
ID | 6519163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 4341875 |
End bp | 4343800 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642749426 |
Product | tail protein |
Protein accession | YP_002117162 |
Protein GI | 194734652 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00323148 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGACA GTTTCCAGTT AAAGGCCATT ATCACTGCCG TTGACCAGTT ATCGGGTCCG CTGAAAGGGA TGCAGCGGGA ACTGAAGGGA TTTCAGAAAG AAATGGCCGG GCTGGCGATC GGTGCTGCTG CTGCCGGGAC CGCTGTTCTT GGGGCGCTGG CGCTGCCCGT GAATGCTGCG ATCGGCTTTG AGTCAAAAAT GGCTGACATC CGGAAGGTGG TTGACGGCCT GGATGATAAA AAAGCATTCG CGCAGATGAG TGACGATATC CTGACGCTGT CCACACAGTT ACCGATGGCG GCGGAGGGAA TTGCAGAGAT CGTGGCGGCG GGCGGGCAGG CAGGCATTGC CCGCGGCGAT TTGATGCAGT TTGCGAACGA CGCAGTGAAA ATGGGTGTGG CGTTTGATAC CACTGCCGAA GAGTCCGGTC AGATGATGGC GCAGTGGCGG ACAGCGTTCA AACTGACGCA GGAAGACGTG GTTGTCCTGG CCGATAAAAT CAACTATCTG GGGAATACCG GCCCGGCAAA TGCGAAGAAA ATTTCTGATA TCGTGACGCG GATTGGTCCG CTTGGCGGTG TTGCCGGAGT GGCATCCGGC GAAATTGCCG CGATGGGCGC CACCATTGCC GGGATGGGGG TTGAATCGGA GATAGCATCC ACCGGCATCA AAAACTTTAT GTTGTCCCTT ACGGCGGGCA AATCGGCAAC GAAGTCGCAG AAGCGGGCAA TGGCCTTTCT GAAACTGAAT CCGGCGCAAC TGGCCGCAGA TATGCAGAAG GATTCGCGCG CGGCGATGCT GAAAGTGCTG GACTCACTGG CGAAGGTGCC GAAAGCAAAA CAGGCATCCG TCATGAATGC CCTGTTCGGG AAAGAGTCTT TAGGGGCGAT AGCGCCACTG CTGACTAATC TTGATTTACT GCGCACCAAT TTTAATCGTG TTGCGGATGC CCAGGAATAT GGCGGCTCGA TGCAGAAGGA ATATGCATCA CGCGCAGCCA CAACAGAAAA CCAGCTGGTT CTGCTGAAAA ACAGCGTCAA TGCGATTTCA GTGACGCTGG GTGATACTTT TCTGCCCGCC ATTAACGAAG CCGCAGAAGC GGTCATGCCT TACCTGGAGC AGCTCCGGAC ATTCGTTCGC GCGAATCCTG AACTGGTTCA GTCTGCGGCG AAGTTCGGTG CGGCGCTGCT GGCTGTTGGC GTATCCATCG GCAGCCTGTC CCGGGCTGTC AAAATCCTGA ACAGCGTCAT TAATCTCTCT CCGGCGAAAG TCGCCATTGC GGCGCTGGTG GCCGGCGCTA TGCTGATCAT TGAGAACTGG GACGATGTTG CTCCGGTGAT TAAGGCGGTA TGGCAGGAGG TCGATAACGT TGCGCAGGAG ATGGGCGGAT GGGAGACGGT GATTGAAGGG GTTGGTCTGG TTATGGCTGG TTCTTTTACC GTCAGGACCA TTGGTGCCCT GCAGCAGTCC GTCCTGCTGG CCGGACGGCT TTCCGGTCTG CTGGGTAAAA TTGGCCGGAT GGGGGCCATG ACGCTGACAA TTGGCGTGGC GGTGTCACTC TTTAAAGAGC TTAAGGATCT GGAGCAAGGG GCGAAGGATG CGGGTATGGA TGCTGGCGCA TTCGCTGTAC AGAAGCTGCA AACGAAGGAG CGTGAACGCG GGTATAACGG TTTTATTCCC AGACTCAAAG AGCTTCTTGG TATGGACACC CCGATTCCGC AGGGGCGTTA TCAACCTTAT GTGCCACTGA CCCGGCGTTC TGGCGTACTC GAGCGAGCTG TCCCGCCATC AACGCAGCGC AGTGAACTCA AAGTGACATT TGAGAATGCA CCACAGGGCA TGAGGGTGCT GGACATACCG AAAACGGGAA ATCCTTTAAT GAACATTACC CATGATGTAG GGTATTCTCC CTTCAGTAAT AAATAA
|
Protein sequence | MADSFQLKAI ITAVDQLSGP LKGMQRELKG FQKEMAGLAI GAAAAGTAVL GALALPVNAA IGFESKMADI RKVVDGLDDK KAFAQMSDDI LTLSTQLPMA AEGIAEIVAA GGQAGIARGD LMQFANDAVK MGVAFDTTAE ESGQMMAQWR TAFKLTQEDV VVLADKINYL GNTGPANAKK ISDIVTRIGP LGGVAGVASG EIAAMGATIA GMGVESEIAS TGIKNFMLSL TAGKSATKSQ KRAMAFLKLN PAQLAADMQK DSRAAMLKVL DSLAKVPKAK QASVMNALFG KESLGAIAPL LTNLDLLRTN FNRVADAQEY GGSMQKEYAS RAATTENQLV LLKNSVNAIS VTLGDTFLPA INEAAEAVMP YLEQLRTFVR ANPELVQSAA KFGAALLAVG VSIGSLSRAV KILNSVINLS PAKVAIAALV AGAMLIIENW DDVAPVIKAV WQEVDNVAQE MGGWETVIEG VGLVMAGSFT VRTIGALQQS VLLAGRLSGL LGKIGRMGAM TLTIGVAVSL FKELKDLEQG AKDAGMDAGA FAVQKLQTKE RERGYNGFIP RLKELLGMDT PIPQGRYQPY VPLTRRSGVL ERAVPPSTQR SELKVTFENA PQGMRVLDIP KTGNPLMNIT HDVGYSPFSN K
|
| |