Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4363 |
Symbol | |
ID | 6484199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4240381 |
End bp | 4241418 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642739605 |
Product | phage portal protein pbsx family |
Protein accession | YP_002043299 |
Protein GI | 194444567 |
COG category | [R] General function prediction only |
COG ID | [COG5518] Bacteriophage capsid portal protein |
TIGRFAM ID | [TIGR01540] phage portal protein, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0000000000346537 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCAAGA AACGCAACAA GCGCCAGCAG CCGCCGCGCA CCCAAAACCA CACCGCCGCA CCAGCGCAAA GCATGGAGGC ATTCACCTTT GGTGAGCCAA CGCCGGTACT CGACCGCCGC GATATTCTCG ATTATGTCGA GTGTATCAAC AACGGCCAGT GGTACGAGCC GCCGGTGAGC TTCTCCGGGC TGGCGAAAAG TATGCGCGCC GCCGTGCACC ACAGCTCACC GATTTACGTA AAGCGTAATA TTCTGGTGTC GACCTACATC CCGCACCCGT TGTTATCCCG TCAGGACTTC ACCCGGTTTG CGCTCGACTA TCTGGTGTTT GGCAATGCTT TTATCGAAGA GCGTCGCAGC CTGACCGGCA AGCCGTTAAA ACTGGAAACC TCACCGGCGA AATACACCCG CCGTGGCATC GAGGAGGACG TGTACTGGTA TATTCAGTCC TACACGCAGC CGCACCAGTT CGCGCCCGGC TCCGTCTTCC ACCTGCTCGA GCCCGATATT AATCAGGAGC TTTACGGGAT GCCGGAATAC CTGAGCGCAC TCAATTCAGC CTGGCTGAAT GAATCAGCGA CCCTGTTCCG TCGCAAGTAT TACCAGAACG GCGCGCATGC GGGTTACATC ATGTATGTGA CCGACGCCGC GCAAAGCAGC ACCGACGTCG AGGCACTGCG AAAGGCGATG CGCGACTCGA AAGGGCTCGG CAATTTTAAG AACCTGTTTT TCTACGCCCC TAATGGTAAA GCAGACGGGA TTAAAATTGT GCCACTGAGC GAAGTCGCCA CGAAGGATGA TTTTTTTAAT ATCAAGAAAG TCAGCGCCGC TGACCTGCTC GACGCGCACC GCATCCCATT CCAGCTTATG GGCGGTAAGC CCGATAACGT CGGCTCAGTG GGTGACGTTG AGAAGGTGGC AAAGGTCTTT GTACGTAACG AACTGACCCC GCTACAGGCG CGGTTTATGG AGTTGAACGA GTGGGCGGGT GAAGAGATTA TCCGCTTCGA AAAATATAGC CTCGGCGACG ACGAGTAA
|
Protein sequence | MSKKRNKRQQ PPRTQNHTAA PAQSMEAFTF GEPTPVLDRR DILDYVECIN NGQWYEPPVS FSGLAKSMRA AVHHSSPIYV KRNILVSTYI PHPLLSRQDF TRFALDYLVF GNAFIEERRS LTGKPLKLET SPAKYTRRGI EEDVYWYIQS YTQPHQFAPG SVFHLLEPDI NQELYGMPEY LSALNSAWLN ESATLFRRKY YQNGAHAGYI MYVTDAAQSS TDVEALRKAM RDSKGLGNFK NLFFYAPNGK ADGIKIVPLS EVATKDDFFN IKKVSAADLL DAHRIPFQLM GGKPDNVGSV GDVEKVAKVF VRNELTPLQA RFMELNEWAG EEIIRFEKYS LGDDE
|
| |