Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B2844 |
Symbol | |
ID | 6797165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 2789047 |
End bp | 2791854 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642777017 |
Product | phage tail tape measure protein |
Protein accession | YP_002147631 |
Protein GI | 197249129 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.581537 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGACA ATAACCTGCG TCTGCAGGTG ATTCTTAATG CGGTTGACAA GCTCACCCGC CCATTTCGAT CCGCGCAGGC CAGCTCAAAA GAGCTGGCTA CAGCGATTCA GCAAAGCCGC GCCCGTTTAA AAGAATTAGA TGCTCAGGCG GGCCGCATTG ACGGTTTCCG CAAGGCCAGC GCGCAGCTGG CAGTCACCGG TAACAGCCTG AAAGCCGCAC GCGAAGAAGC TGCAAAACTT GCCACGCAAT TCTCTGCCAC CAACCGCCCG ACGGCGGCGC AGGCACGGTT GCTTGAGCAG GCAAAAAACC GCGTTACGGA GTTACAGAGC AAATATAACG GTCTACGTCA GTCGGTGCAG CGCCAGCGTC TTGCGCTCAA TGAAGCCGGG CTGGACACGA AAAAGCTCAG TAGTGCGCAG CGGGAACTGC GGCAGAATGC CGACGAAACC CGGCAGGCCC TGGACCGGCA GCAGAAATCC CTTAAACGCC TGGGCGAACA GCAGGCGCGA ATGAACGCCG TCCGCGATCA GTATTCACGC CGTCTTGAGG TGCGGGATCG CATCGCCGGG GCAGGGGCTA CCACTACGGC TGCGGGGGTG GCAATGGGCG CACCTGTTGT GGCGGCAGTT AAGAGCTACG CCAGCATGGA AGATGCCATG AAAGGCGTGG CAAAGCAGGT AAACAGGCTG CGGGACGATA ATGGCAACCG CACAAAACAG TTTTATGACA TGCAGGATGC CATCAAGGCC GCCAGCGAAC AGCTGCCGAT GGAGAACGGC GCTATAGACT ATGCCGCGCT GGTTGAAGGT GGTGCTCGCA TGGGGGTGAC CAATCAGGAC GATCCTTACG AAGAGCAGAA ACGTGACCTG CTGGCTTTTG CATCCACGGC GGCAAAAGCG GCAACGGCCT TTGAGCTGCC CGCAGATGAA CTGGCAGAAG GGCTGGGGAA AATCGCGCAG CTCTATAAAG TTCCGACGCG CAATATTGAA CAACTGGGCG ATGCGCTGAA CTACCTGGAC GATAACGCCA TGTCAAAGGG TGGGGACATT ATCAACGTCC TGCAGCGTAT GGGGGGCGTG GCTGACCGCC TTGACTTCCG AAAGGCCGCG GCGCTGGGTT CAACATTCCT TTCTCTTGGG GCTGCCCCGG AAATCGCCGC CAGCGCCTCT AATGCCATGG TGCGTGAACT GTCCATTGCC ACCATGCAAA GTAAACGATT TTTTGAAGGC ATGAATCTGT TGCAACTCAA TCCGGCGGAG ATTGAAAAGC AGATGACCAC CGATGCCATG GGCACAATTC AGCGGGTTCT GGAGAAGGTC AACAATCTGC CGCAGGATAA ACGCCTGTCA GCCATGACAA TGATTTTTGG CAAAGAGTTT GGCGATGATG CGGCAAAGCT GGCTAACAAC CTGCCGGAGC TGCAGCGCCA GCTGAAACTC ACATCAGGCA GTGGTGCTAA TGGCTCCATG CAGAAAGAAT CCGACATTAA CAAGGATTCA TTGTCTGCGC AGTGGTTGCT GGTTAAGACG GGCGCGCAAA ACGCTTTCAG CAGCCTGGGG GAAACGCTGC GCCAGCCGCT GATGGATATT ATGGGCATGG TTAAGCGCGT GACCGGGGCG TTGCGTCGCT GGGTTGAGCA GAATCCCGTG CTGGCTGGCA CGCTGATGAA AGTGGCGGCA GCTACGGCAG CCATTACTGT TGGGTTGGGG GCGCTGGCAG TGGCGGTGGC TGCTGTGCTG GGACCGCTGG CGGTTATCCG GTTTGGCCTG TCCATGCTGT CAGTTAAAGC GTTACCTTCT GCAGCCGCCG CTGCCACACG TACAGGTAGC GTGCTGCGTC TGTTGATCTC TGGTCCGCTG GCTTTGCTGC GCGTGGCATT ATTTGCTGTT GGTAGCCTGC TGGGGGCGCT GCTCAGTCCT GTAGGGCTGG TTGTGGCTGC ACTGGCAGGC GTGGCGCTGG TTATCTGGAA ATACTGGCAG CCCATTAGTG CATTTCTGGG GGGCGTGGTG GAAGGGTTCA GAGCCGCTGC TGCGCCCATC AGCGCCGCCT TTGAGCCGCT CAGACCCGTG TTTCAGTGGA TTGGTGACAG GGTGCAGGCC TTGTGGGGCT GGTTCAATGA TTTACTTACC CCGGTTAAAT CCACTTCCGA AGAACTGAAC AGCGCAGCTG CAATGGGGCG TCGGTTTGGT GAGGCGCTGG CGGAAGGTCT GAATATGGTG ATGCACCCAC TTGAGTCACT TAAATCCGGT GTGTCATGGC TGCTGGAAAA GCTCGGTATT GTCAGTAAGG AGGCGGCAAA GGCGAAACTA CCTGCGCAGG TTACGCAGCA GCAGTCCGCC ACAGTGAACA GTGACGGCAA AGTGGTGCTG CCGCCAGGCG GGTTCCCGGC TTACGCGGGG ATGTACGACA CGGGCGGGAT CATTCCACGC GGGCAGTTTG GCATTGTCGG AGAAAATGGC CCTGAAATTG TGAACGGACC GGCAAATGTT ACCAGCAGGC GGCGTACTGC TGCGCTGGCC TCTGTCGTTG CTGGCGTGAT GGGGGGAGCT GCGACACCTG CAGAAGCGGC TCCGCTTCAT CCGTTCAGTT TGCCTGCGAG GGCATACCAG CCCCCGCTTG CTAAGGCAGA TAGCCCGCCG CCGGTTATTC GTTATGAGAT AAATGCGCCC ATTCATATTG TCGCTCAGCC TGGGCAGAAC GCGCAGGAAA TTGCCCGTGA AGTGGCACGC CAGCTTGACG AACGGGAGCG CCGGGCCAGG GCAAAAGCAC GCAGCAATTT CAGCGATCAG GGGGGGTATG AATCATGA
|
Protein sequence | MSDNNLRLQV ILNAVDKLTR PFRSAQASSK ELATAIQQSR ARLKELDAQA GRIDGFRKAS AQLAVTGNSL KAAREEAAKL ATQFSATNRP TAAQARLLEQ AKNRVTELQS KYNGLRQSVQ RQRLALNEAG LDTKKLSSAQ RELRQNADET RQALDRQQKS LKRLGEQQAR MNAVRDQYSR RLEVRDRIAG AGATTTAAGV AMGAPVVAAV KSYASMEDAM KGVAKQVNRL RDDNGNRTKQ FYDMQDAIKA ASEQLPMENG AIDYAALVEG GARMGVTNQD DPYEEQKRDL LAFASTAAKA ATAFELPADE LAEGLGKIAQ LYKVPTRNIE QLGDALNYLD DNAMSKGGDI INVLQRMGGV ADRLDFRKAA ALGSTFLSLG AAPEIAASAS NAMVRELSIA TMQSKRFFEG MNLLQLNPAE IEKQMTTDAM GTIQRVLEKV NNLPQDKRLS AMTMIFGKEF GDDAAKLANN LPELQRQLKL TSGSGANGSM QKESDINKDS LSAQWLLVKT GAQNAFSSLG ETLRQPLMDI MGMVKRVTGA LRRWVEQNPV LAGTLMKVAA ATAAITVGLG ALAVAVAAVL GPLAVIRFGL SMLSVKALPS AAAAATRTGS VLRLLISGPL ALLRVALFAV GSLLGALLSP VGLVVAALAG VALVIWKYWQ PISAFLGGVV EGFRAAAAPI SAAFEPLRPV FQWIGDRVQA LWGWFNDLLT PVKSTSEELN SAAAMGRRFG EALAEGLNMV MHPLESLKSG VSWLLEKLGI VSKEAAKAKL PAQVTQQQSA TVNSDGKVVL PPGGFPAYAG MYDTGGIIPR GQFGIVGENG PEIVNGPANV TSRRRTAALA SVVAGVMGGA ATPAEAAPLH PFSLPARAYQ PPLAKADSPP PVIRYEINAP IHIVAQPGQN AQEIAREVAR QLDERERRAR AKARSNFSDQ GGYES
|
| |