Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2938 |
Symbol | |
ID | 6483359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2869316 |
End bp | 2870377 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642738255 |
Product | phage portal protein pbsx family |
Protein accession | YP_002041984 |
Protein GI | 194445286 |
COG category | [R] General function prediction only |
COG ID | [COG5518] Bacteriophage capsid portal protein |
TIGRFAM ID | [TIGR01540] phage portal protein, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 3.19492e-28 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGTAAAA AGAAACACTT CATTAAGCGC GACCAGCGCG GCGATAAGTC AAAAAAAATG AGCATCATTA CGTTCGGCAA ACCGGAACCG GTTCTGACCA CTGGCACCGA CTACCGGGAT ATCTGGTACG ACAATGCCGC CGATCATTTT ACTCAGCCAA TTGACCGGCT GGCACTGGCA CAACTGATTA ACCTTAACGG TCAACATGGC GGTATCATCC ACGCCCGTAA AAACATGATT GTGTCTGATT ATCTGTCTGG CGGCCTGACT TACGACCAAC TGGAAGCCGC AGCTTTTGAC TACATCACAT TTGGGGATAT TGCGCTTGGA AAAATTCGTA ACGGATGGGG AGATGTGATC GGACTGGAAC CCTTACCCGG CCTCTATATC CGACGCAGGA AAGACAGGAA CAACGCAACT GATCAACCTG GTGATTACGT GGTGTTACAG GAAGGCGAAC CGCAGATATG GCCTGAAGAA GATATTATCT TCATCAAAAT GTATGATCCG CAACAGCATA TTTACGGACT GCCGGACTAC ATCGGCGGCG TACATTCTGC ATTACTCAAC AGTGAAGCGG TCATTTTCCG TCGCCGTTAC TACCACAATG GCGCCCACAC TGGCGGCATT CTCTACACGC GCGATCCCAG CATGACGGAT GAAATGGAAG AGGAAATTGA ACAGCAGCTG CGTGACAGCA AAGGGATCGG CAACTTCTCC ACCATCCTGG TAAACATTCC CGGTGGAGAC GGTGACGCCA TCAAATTCAT TGAAATGGGG GATATTTCCG CTAAGGATGA ATTTGCCAAC ATCAAAAATA TCAGCGCCCA GGATATTCTG AACGCGCACC GTTTTCCTGC CGGGCTTGCC GGCATTGTCC CGCAAAATAC TGCCGGACTT GGTGACGTAG AAAAGGCCGA ACGGATTTAT AAAAAAAGCG AAGTCGCCCC TGTTCAGCGC CGGTTTATGA TGGCCGTAAA CAATGATCCA GAAATACCGG AAAGGCTACA CCTTAACTTT GATTTAAGTT ACGCAGAATC AACGGATAAG GGTGCAGCAT GA
|
Protein sequence | MSKKKHFIKR DQRGDKSKKM SIITFGKPEP VLTTGTDYRD IWYDNAADHF TQPIDRLALA QLINLNGQHG GIIHARKNMI VSDYLSGGLT YDQLEAAAFD YITFGDIALG KIRNGWGDVI GLEPLPGLYI RRRKDRNNAT DQPGDYVVLQ EGEPQIWPEE DIIFIKMYDP QQHIYGLPDY IGGVHSALLN SEAVIFRRRY YHNGAHTGGI LYTRDPSMTD EMEEEIEQQL RDSKGIGNFS TILVNIPGGD GDAIKFIEMG DISAKDEFAN IKNISAQDIL NAHRFPAGLA GIVPQNTAGL GDVEKAERIY KKSEVAPVQR RFMMAVNNDP EIPERLHLNF DLSYAESTDK GAA
|
| |