Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2271 |
Symbol | |
ID | 6873019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2165627 |
End bp | 2167360 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642785369 |
Product | putative phage terminase, large subunit |
Protein accession | YP_002216031 |
Protein GI | 198243032 |
COG category | [R] General function prediction only |
COG ID | [COG4626] Phage terminase-like protein, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.441488 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.000000000000278373 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCGGA AATCATATCC CAACGTTAAC GCCGCTAATC AGTATGCCCG CAACGTTGTG CGGGGAAAAA TCACGGCATG TCAGTATGTC ATTCAGGCCT GCCAGCGTCA CATTGATGAT ATGGCGGCGG AGAAGAGTAA AAGGTTTCGG TACCGCTTTG ATAAAGACAT GGCTGAGAAA GCTGCAAAGT TTATTCAGTT ACTTCCACAT ACAAAAGGTG AATGGGCGTT CAAACGTATG CCGATTACCC TGGAACCGTG GCAACTTTTC ATCGTGTGCT GTGCCTTTGG CTGGGTACAG AAGGGAACAA AGCTTCGTCG TTTTCGTGAG GTCTACACAG AGATACCACG TAAGAACGGG AAATCGGCTA TTTCAGCTGG TGTAGCTCTC TACTGTTTCA CCTGTGATAA CGAATTCGGT GCGGAAGTAT ACTCCGGCGC CACGACTGAA AAACAGGCGT GGGAGGTATT TCGTCCCGCG CGTCTGATGT GTAAGCGCAC ACCACTACTG GTGGAGGCAT TCGGTATAGA GGTGAATGCC TCAAACCTGA ACCGTCCGGA GGATGGTGCC CGCTTCGAGC CGTTGATCGG CAACCCCGGG GACGGGGCAT CACCGCACTG CGCAATAGTT GACGAATACC ACGAACACCC TACGGATGCG CTCTACACAA CAATGCTTAC AGGTATGGGC GCGCGCCGAC AGCCGCTGAT GTGGGCAATA ACCACGGCGG GCTACAACAT CGAGGGGCCG TGTTACGACA AGCGACGCGA AGTGATTGAG ATGCTGAACG GATCGGTGCC GAACAACGAA CTTTTTGGCG TGATTTACAC GGTTGATGAA GGGGATGACT GGACAGATCC AAAAGTGCTG GAGAAAGCAA ACCCGAACAT TGGGGTGTCA GTATACCGTG ACTTCCTTCT CAGTCAGCAA CAGCGTGCTA TTAACAATGC CCGCCATGCG GGTGTGTTCA AAACGAAGCA TCTCAATGTA TGGGTTGCCG CCCGCACAGC ATTCTTTAAT CTGGTTTCCT GGCAAAACTG TGAGGATAAG ACGCTGACGC TGGAACTGTT TGAGGGTCAA CCCTGCGTAC TGGCGTTCGA TCTGGCTCGT AAGCTGGACA TGAACAGCAT GGCGAGGTTA TTTACCCGTG AAATAGACGG GAAAACGCAT TTTTACAGCG TGGCGCCACG TTTCTGGGTG CCGTATGACA CGGTCTACAG TGTTGAGAAA AATGAGGATC GCCGTACTGC GGAACGTTTT CAGAAATGGG TTGAAATGGG CTTTTTGACA GTAACTGATG GTGCGGAGGT GGATTACCGC TACATCCTTG AAGAGGCCAA AGCTGCGAAC AAACTGAACC CGGTCAGCGA ATCCCCCATT GATCCATTTG GTGCCACCGG GCTTTCACAT GATCTAGCTG ATGAAAACCT GAATCCCGTC ACTATCATCC AGAATTACAC CAACATGTCC GATCCGATGA AAGAACTGGA AGCGGCGATT GAATCGGGTC GCTTTCATCA TGACGGCAAT CCCATCATGA CCTGGTGTAT CGGCAACGTG GTCGGCAAAA CCATTCCGGG TAACGATGAC GTGGTGAAGC CTATTAAGGA GCAGGCGGAA AATAAAATCG ATGGTGCAGT TGCACTGATT ATGGCGGTTG GCAGAGCCAT GCTGTACGAG AAAGAAGACA CGCTGTCTGA CCACATTGAG TCCTACGGGA TCCGCTCGCT TTAA
|
Protein sequence | MSRKSYPNVN AANQYARNVV RGKITACQYV IQACQRHIDD MAAEKSKRFR YRFDKDMAEK AAKFIQLLPH TKGEWAFKRM PITLEPWQLF IVCCAFGWVQ KGTKLRRFRE VYTEIPRKNG KSAISAGVAL YCFTCDNEFG AEVYSGATTE KQAWEVFRPA RLMCKRTPLL VEAFGIEVNA SNLNRPEDGA RFEPLIGNPG DGASPHCAIV DEYHEHPTDA LYTTMLTGMG ARRQPLMWAI TTAGYNIEGP CYDKRREVIE MLNGSVPNNE LFGVIYTVDE GDDWTDPKVL EKANPNIGVS VYRDFLLSQQ QRAINNARHA GVFKTKHLNV WVAARTAFFN LVSWQNCEDK TLTLELFEGQ PCVLAFDLAR KLDMNSMARL FTREIDGKTH FYSVAPRFWV PYDTVYSVEK NEDRRTAERF QKWVEMGFLT VTDGAEVDYR YILEEAKAAN KLNPVSESPI DPFGATGLSH DLADENLNPV TIIQNYTNMS DPMKELEAAI ESGRFHHDGN PIMTWCIGNV VGKTIPGNDD VVKPIKEQAE NKIDGAVALI MAVGRAMLYE KEDTLSDHIE SYGIRSL
|
| |