Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3051 |
Symbol | |
ID | 6875231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 2943387 |
End bp | 2944418 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642786081 |
Product | phage portal protein, pbsx family |
Protein accession | YP_002216727 |
Protein GI | 198242747 |
COG category | [R] General function prediction only |
COG ID | [COG5518] Bacteriophage capsid portal protein |
TIGRFAM ID | [TIGR01540] phage portal protein, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.0247481 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAGA GTAAGAAAAA CCGCGCTGCG GCGACGAATC AGCTCAAGCA TAAAAGCCAA ACTTCAGCCG AAGCATTCAG CTTTGGTGAT CCCGTTCCTG TTCTGGACCG CCGTGAACTG CTGGACTATG TGGAATGCGT ACAGATGGAC CGCTGGTATG AGCCGCCCGT CAGCTTTGAC GGACTGGCAC GAACCTTCCG CGCCGCCGTG CATCACAGCT CACCAATTGC GGTGAAATGC AACATTCTGA CCAGTACCTA CATCCCTCAC CCGCTGCTCA GCCAGCAGGC TTTTTCACGT TTTGTGCAGG ACTATCTGGT ATTTGGTAAC GCCTACCTGG AGAAACGCAC GAACCGCTTC GGTGAAGTTA TCGCCCTTGA GCCTGCTCTG GCAAAATACA CCCGACGCGG GTTAGACCTG GATACCTACT GGTTTGTGCA ATACGGTATG ACAACCCAGC CGTATCAGTT CACGAAAGGC AGCATTTTTC ATCTGATGGA ACCGGACATC AACCAGGAGA TCTACGGCCT GCCAGGTTAT CTTTCTGCCA TTCCATCAGC CCTGCTCAAC GAGTCCGCCA CGCTGTTCCG CCGGAAGTAT TACATTAACG GCAGTCATGC AGGCTTCATC ATGTACATGA CCGATGCCGC GCAGAACCAA GAGGATGTGA ACAACCTCCG CAATGCGATG AAAAGCGCCA AAGGCCCTGG CAACTTCCGC AACCTGTTTA TGTACTCGCC TAACGGCAAA AAGGACGGGC TTCAGATCAT CCCGTTGTCA GAAGTCGCGG CGAAGGATGA GTTTCTGAAT ATCAAGAACG TGAGCCGGGA CGACATGATG GCGGCACACC GTGTGCCGCC GCAAATGATG GGGATAATGC CTAATAATGT TGGGGGGTTT GGGGATATCG AAAAAGCTAG CAAGGTATTT GTTAGGAATG AATTAACCCC CCTTCAGAAG CGATTTAGTG AGTTAAACGA ATGGATTAAT GAAAAAATAA TCACTTACTC ACAATATTCT ATTGGAGACT AA
|
Protein sequence | MGKSKKNRAA ATNQLKHKSQ TSAEAFSFGD PVPVLDRREL LDYVECVQMD RWYEPPVSFD GLARTFRAAV HHSSPIAVKC NILTSTYIPH PLLSQQAFSR FVQDYLVFGN AYLEKRTNRF GEVIALEPAL AKYTRRGLDL DTYWFVQYGM TTQPYQFTKG SIFHLMEPDI NQEIYGLPGY LSAIPSALLN ESATLFRRKY YINGSHAGFI MYMTDAAQNQ EDVNNLRNAM KSAKGPGNFR NLFMYSPNGK KDGLQIIPLS EVAAKDEFLN IKNVSRDDMM AAHRVPPQMM GIMPNNVGGF GDIEKASKVF VRNELTPLQK RFSELNEWIN EKIITYSQYS IGD
|
| |