Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3027 |
Symbol | |
ID | 6873202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2924924 |
End bp | 2927731 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642786058 |
Product | phage tail tape measure protein |
Protein accession | YP_002216704 |
Protein GI | 198244813 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.000444183 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGACA ATAACCTGCG TCTGCAGGTG ATTCTTAATG CGGTTGACAA GCTCACCCGC CCATTTCGAT CCGCGCAGGC CAGCTCAAAA GAGCTGGCTG CAGCCATTCA GCAAAGCCGC GCCCGTTTAA AAGAATTAGA TGCTCAGGCG GGCCGCATTG ACGGTTTCCG CAAGGCCAGC GCGCAGCTGG CAGTCACCGG TAACAGCCTG AAAGCCGCAC GCGAAGAAGC TGCGAAACTT GCCACGCAAT TCTCTGCCAC CAACCGCCCG ACGGCGGCGC AGGCACGGCT GCTTGAGCAG GCAAAAAACC GCGTTACGGA GTTACAGAGC AAATATAACG GTCTACGTCA GTCGGTGCAG CGCCAGCGTC TTGCGCTCAA TGAAGCCGGA CTGGACACGA AAAAGCTCAG TAGTGCGCAG CGGGAACTGC GGCAGAATGC CGACGAAACC CGGCAGGCCC TGGACCGGCA GCAGAAATCC CTTAAAAGCT TGGGCGAACA GCAGGCGCGA ATGAACGCCG TCCGCGATCA GTATTCGCGC CGTCTTGAGG TGCGGGATCG CATCGCCGGG GCAGGGGCTA CCACTACGGC TGCAGGGGGG GCAATGGGCG CACCTGTTGT GGCGGCAGTT AAGAGCTACG CCAGCATGGA AGATGCCATG AAAGGTGTGG CAAAGCAGGT AAACGGGCTG CGGGACGATA ATGGCAACCG CACAAAACAG TTTTATGACA TGCAGGATGC CATCAAGGCC GCCAGCGAAC AGCTACCGAT GGAGAACGGC GCTATAGACT ATGCCGCGCT GGTTGAAGGT GGTGCTCGCA TGGGGGTGAC CAATCAGGAC GATCCTTACG AAGAGCAGAA ACGTGACCTG CTGGCTTTTG CATCCACGGC GGCAAAAGCG GCAACGGCCT TTGAGCTGCC CGCAGATGAA CTGGCAGAAG GACTGGGGAA AATCGCGCAG CTCTATAAAG TTCCGACGCG CAATATTGAA CAACTGGGCG ATGCGCTGAA CTACTTGGAC GATAACGCCA TGTCAAAGGG TGGGGACATT ATCAACGTCC TGCAGCGTAT GGGGGGCGTG GCTGACCGCC TTGACTTCCG AAAGGCCGCG GCGCTGGGTT CAACATTCCT TTCTCTTGGG GCTGCCCCGG AAATCGCCGC CAGCGCCTCT AATGCCATGG TGCGTGAACT GTCCATTGCT ACCATGCAAA GTAAACGATT TTTTGAAGGC ATGAATCTGT TGCAACTCAA TCCGGCGGAG ATTGAAAAGC AGATGACCAC CGATGCCATG GGCACAATTC AGCGGGTTCT GGAGAAGGTC AACAATCTGC CGCAGGATAA ACGCTTGTCA GCCATGACAA TGATTTTTGG CAAAGAGTTT GGCGATGATG CGGCAAAGCT GGCTAACAAC CTGCCGGAGC TGCAGCGCCA GCTGAAACTC ACATCAGGTA GTGGTGCTAA TGGCTCTATG CAGAAAGAAT CCGACATTAA CAAGGATTCA TTGTCTGCGC AGTGGTTGCT GGTTAAGACG GGCGCGCAGA ACGCTTTCAG CAGCCTGGGG GAAACGTTGC GCCAGCCGCT GATGGATATT ATGGGCATGG TTAAGCGCGT GACCGGGGCG TTGCGTCGCT GGGTTGAACA GAATCCCGTG CTGGCTGGCA CGCTGATGAA AGTGGCGGCA GCTACGGCAG CCATTACTGT TGGGTTGGGG GCGCTGGCAG TGGCGGTGGC TGCGGTACTA GGGCCGCTGG CGGTTATCCG GTTTGGCCTG TCAATGCTGT CAGTTAAAGC GTTACCTTCT GCAGCCGCCG CTGCCACACG TACAGGTAGC GTGCTGCGTT TGTTGATCTC TGGTCCGCTG GCTTTGCTGC GCGTGGCATT ATTTGCTGTT GGTAGCCTGC TGGGTGCGCT GCTCAGTCCT GTAGGGCTGG TTGTGGCTGC ACTGGCAGGC GTGGCGCTGG TTATCTGGAA ATACTGGCAG CCCATTAGTG CATTTCTGGG GGGCGTGGTG GAAGGGTTCA GAGCCGCTGC TGCGCCCATC AGCGCCGCCT TTGAGCCGCT CAGACCCGTG TTTCAGTGGA TTGGTGACAG GGTGCAGGCC TTGTGGGGCT GGTTCAATGA TTTACTTACC CCGGTTAAAT CCACTTCCGA AGAACTGAAC AGCGCAGCTG CAATGGGGCG TCGGTTTGGT GAGGCGCTGG CGGAAGGTCT GAATATGGTG ATGCACCCAC TTGAGTCACT TAAATCCGGT GTGTCATGGC TGCTGGAAAA GCTCGGTATT GTCAGTAAGG AGGCGGCAAA GGCGAAACTA CCTGCGCAGG TTACGCAGCC GCAGTCCGCC ACAGTGAACA GTGACGGCAA AGTGGTGCTG CCGCCAGGCG GGTTCCCGGC TTACGCGGGG ATGTATGACA CGGGCGGGAT CATTCCACGC GGGCAGTTTG GCATTGTCGG AGAAAATGGC CCTGAAATTG TGAACGGACC GGCAAATGTT ACCAGCAGGC GGCGTACTGC TGCACTGGCC TCTGTCGTTG CTGGCGTGAT GGGGGGAGCT GCGACACCTG CAGAAGCGGC TCCGCTTCAT CCGTTCAGTT TGCCTGCGAG GGCATACCAG CCCCCGCTTG CTAAGGCAGA TAGCCCGCCG CCGGTTATTC GTTATGAGAT AAATGCGCCC ATTCATATTG TCGCTCAGCC TGGGCAGAAC GCGCAGGATA TTGCCCGTGA AGTGGCACGC CAGCTTGACG AACGGGAGCG CCGGGCCAGG GCAAAAGCAC GCAGCAATTT CAGCGATCAG GGGGGGTATG AATCATGA
|
Protein sequence | MSDNNLRLQV ILNAVDKLTR PFRSAQASSK ELAAAIQQSR ARLKELDAQA GRIDGFRKAS AQLAVTGNSL KAAREEAAKL ATQFSATNRP TAAQARLLEQ AKNRVTELQS KYNGLRQSVQ RQRLALNEAG LDTKKLSSAQ RELRQNADET RQALDRQQKS LKSLGEQQAR MNAVRDQYSR RLEVRDRIAG AGATTTAAGG AMGAPVVAAV KSYASMEDAM KGVAKQVNGL RDDNGNRTKQ FYDMQDAIKA ASEQLPMENG AIDYAALVEG GARMGVTNQD DPYEEQKRDL LAFASTAAKA ATAFELPADE LAEGLGKIAQ LYKVPTRNIE QLGDALNYLD DNAMSKGGDI INVLQRMGGV ADRLDFRKAA ALGSTFLSLG AAPEIAASAS NAMVRELSIA TMQSKRFFEG MNLLQLNPAE IEKQMTTDAM GTIQRVLEKV NNLPQDKRLS AMTMIFGKEF GDDAAKLANN LPELQRQLKL TSGSGANGSM QKESDINKDS LSAQWLLVKT GAQNAFSSLG ETLRQPLMDI MGMVKRVTGA LRRWVEQNPV LAGTLMKVAA ATAAITVGLG ALAVAVAAVL GPLAVIRFGL SMLSVKALPS AAAAATRTGS VLRLLISGPL ALLRVALFAV GSLLGALLSP VGLVVAALAG VALVIWKYWQ PISAFLGGVV EGFRAAAAPI SAAFEPLRPV FQWIGDRVQA LWGWFNDLLT PVKSTSEELN SAAAMGRRFG EALAEGLNMV MHPLESLKSG VSWLLEKLGI VSKEAAKAKL PAQVTQPQSA TVNSDGKVVL PPGGFPAYAG MYDTGGIIPR GQFGIVGENG PEIVNGPANV TSRRRTAALA SVVAGVMGGA ATPAEAAPLH PFSLPARAYQ PPLAKADSPP PVIRYEINAP IHIVAQPGQN AQDIAREVAR QLDERERRAR AKARSNFSDQ GGYES
|
| |