Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C3494 |
Symbol | |
ID | 6489503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 3391374 |
End bp | 3392435 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642743623 |
Product | phage portal protein, pbsx family |
Protein accession | YP_002047237 |
Protein GI | 194450777 |
COG category | [R] General function prediction only |
COG ID | [COG5518] Bacteriophage capsid portal protein |
TIGRFAM ID | [TIGR01540] phage portal protein, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 4.19269e-27 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGTAAAA AGAAACACTT CGTTAAGCGC GACCAGCGCG GCGATAAGTC AAAAAAAATG AGCATCATTA CGTTCGGCAA ACCGGAACCT GTTCTGACCA CCGGTACCGA CTACCGGGAT ATCTGGTACG ACAATGCAGC CGATCATTTT ACCCAGCCAA TTGACCGGCT GGCACTGGCA CAACTGATTA ACCTTAACGG TCAACATGGC GGCATCATCC ATGCCCGTAA AAACATGATT GTGTCTGATT ATCTGTCTGG CGGCCTGACT TACGACCAGC TGGAAGCCGC TGCTTTTGAC TACATCACAT TTGGGGATAT TGCACTTGGA AAAATTCGTA ACGGATGGGG AGATGTGATC GGACTGGAAC CCTTACCCGG TCTCTATATC CGACGCAGGA AAGACAGGAA CAACGCAGCT GATCAACCTG GTGATTACGT GGTGCTACAG GAAGGCGAAC CGCAGATATG GCCGCAGGAA GATATCATTT TTATCAAGAT GTACGACCCG CAGCAGCATA TTTACGGACT GCCGGACTAC ATCGGCGGCG TACATTCTGC ATTGCTCAAC AGTGAAGCGG TCATTTTCCG TCGCCGCTAT TACCACAATG GCGCACACAC GGGCGGTATT CTTTATACCC GCGACCCCAG CATGACGGAT GAAATGGAAG AAGAAATTGA ACAGCAGCTG CGTGACAGCA AAGGGATCGG CAACTTCTCC ACCATCCTGG TAAACATTCC CGGTGGAGAC GGTGACGCCA TCAAATTCAT TGAAATGGGG GATATTTCCG CTAAGGATGA ATTTGCCAAC ATCAAGAATA TCAGCGCCCA GGACATTCTG AACGCGCACC GTTTTCCTGC CGGGCTTGCC GGCATTGTCC CGCAAAATAC TGCCGGGCTG GGTGACGTAG AAAAGGCCGA ACGGATTTAT AAAAAAAGCG AAGTCGCCCC TGTTCAGCGC CGTTTTATGA TGGCCGTAAA CAATGATCCA GAAATACCGG GAAACCTGCA CCTGAACTTT GATTTAAGTT ACACAGAATC AACGGATAAG GGTGCGGTAT GA
|
Protein sequence | MSKKKHFVKR DQRGDKSKKM SIITFGKPEP VLTTGTDYRD IWYDNAADHF TQPIDRLALA QLINLNGQHG GIIHARKNMI VSDYLSGGLT YDQLEAAAFD YITFGDIALG KIRNGWGDVI GLEPLPGLYI RRRKDRNNAA DQPGDYVVLQ EGEPQIWPQE DIIFIKMYDP QQHIYGLPDY IGGVHSALLN SEAVIFRRRY YHNGAHTGGI LYTRDPSMTD EMEEEIEQQL RDSKGIGNFS TILVNIPGGD GDAIKFIEMG DISAKDEFAN IKNISAQDIL NAHRFPAGLA GIVPQNTAGL GDVEKAERIY KKSEVAPVQR RFMMAVNNDP EIPGNLHLNF DLSYTESTDK GAV
|
| |