Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2266 |
Symbol | |
ID | 6873640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2162344 |
End bp | 2163549 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642785366 |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_002216028 |
Protein GI | 198242419 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0000000000739522 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGGTTG ATATTAAAGA TGTGGAACAG GTCGCGCAGG AACTGCAACA GAAGTTTGAC GACTTCAAAG CAAAGAACGA CAAGCGCGTT GAGGCGATTG AGCAGGAAAA GGGCAAGCTT GCCGGGCAGG TGGAAACCCT GAACGGGAAA CTCAGCGAGC TGGAAAATCT CAAAAGCGAC CTTGAAAAAG AGCTGCTTGA GCTGAAACGT CCGGCACGTG GAGCGCAAAA CAAGGTGGCT GCAGAACATA AAGACGCTTT CGTCGGCTTT CTGCGTAAAG GCCGCGAAGA CGGTCTGCGC GATCTGGAGC GTAAGGCGTT GCAGGTGGGC ACTGATGAAG ATGGTGGTTA TGCCGTGCCG GAAGAGCTGG ATCGCAGCAT TCTCAGCCTG CTGAAAGATG AGGTGGTGAT GCGCCAGGAG GCCACGGTGA TCACCGTGGG CGGTTCCGAC TATAAAAAAC TGGTGAATCT GGGTGGTACG GCTTCCGGAT GGGTCGGCGA AACTGACACG CGTTCCCAGA CCGCTACTTC CAGGCTGGGA CTGATTGAGC CTTTCATGGG GGAAATCTAC GGCAACCCGC AGGCCACCCA GAAAATGCTG GATGATGCCT TCTTCAACGT GGAAGCCTGG ATCAACAGTG AACTGGCGAC CGAATTTGCC GAACAGGAGG AAATTGCCTT TACCACTGGT GACGGCACCA AGAAGCCGAA AGGGTTCCTG GCCTATGAAT CCACCGAAGA GTCCGATAAG GCTCGTGCGT TCGGTAAACT TCAGCACATC GTATCCGGTG AAGCGACCGC GGTGACCGCT GATGCCATCA TTAAGCTGAT TTACACGCTG CGTAAGGCGC ATCGTACCGG CGCGAAGTTC ATGATGAACA ACAACAGCCT GTTTGCCATC CGTCTGCTGA AAGATACCGA GGGTAACTAT CTGTGGCGTC CGGGGCTGGA ACTGGGACAG CCATCCTCAC TGGCGGGTTA CGGTATCGCT GAAAACGAAC AGATGCCGGA TATCGCCGCC GATGCGAAAG CCATTGCGTT TGGTAACTTC AAACGGGGTT ACACCATCGT TGACCGTATC GGCACCCGCA TCCTGCGCGA CCCGTACACC AACAAACCGT TTGTCGGTTT TTATACCACC AAGCGCACCG GGGGTATGCT GGTCGATTCA CAGGCTATCA AGCTGCTGAA AATCGCTGCG GCGTAA
|
Protein sequence | MAVDIKDVEQ VAQELQQKFD DFKAKNDKRV EAIEQEKGKL AGQVETLNGK LSELENLKSD LEKELLELKR PARGAQNKVA AEHKDAFVGF LRKGREDGLR DLERKALQVG TDEDGGYAVP EELDRSILSL LKDEVVMRQE ATVITVGGSD YKKLVNLGGT ASGWVGETDT RSQTATSRLG LIEPFMGEIY GNPQATQKML DDAFFNVEAW INSELATEFA EQEEIAFTTG DGTKKPKGFL AYESTEESDK ARAFGKLQHI VSGEATAVTA DAIIKLIYTL RKAHRTGAKF MMNNNSLFAI RLLKDTEGNY LWRPGLELGQ PSSLAGYGIA ENEQMPDIAA DAKAIAFGNF KRGYTIVDRI GTRILRDPYT NKPFVGFYTT KRTGGMLVDS QAIKLLKIAA A
|
| |