Gene SeD_A2266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2266 
Symbol 
ID6873640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2162344 
End bp2163549 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content55% 
IMG OID642785366 
Productphage major capsid protein, HK97 family 
Protein accessionYP_002216028 
Protein GI198242419 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0000000000739522 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGTTG ATATTAAAGA TGTGGAACAG GTCGCGCAGG AACTGCAACA GAAGTTTGAC 
GACTTCAAAG CAAAGAACGA CAAGCGCGTT GAGGCGATTG AGCAGGAAAA GGGCAAGCTT
GCCGGGCAGG TGGAAACCCT GAACGGGAAA CTCAGCGAGC TGGAAAATCT CAAAAGCGAC
CTTGAAAAAG AGCTGCTTGA GCTGAAACGT CCGGCACGTG GAGCGCAAAA CAAGGTGGCT
GCAGAACATA AAGACGCTTT CGTCGGCTTT CTGCGTAAAG GCCGCGAAGA CGGTCTGCGC
GATCTGGAGC GTAAGGCGTT GCAGGTGGGC ACTGATGAAG ATGGTGGTTA TGCCGTGCCG
GAAGAGCTGG ATCGCAGCAT TCTCAGCCTG CTGAAAGATG AGGTGGTGAT GCGCCAGGAG
GCCACGGTGA TCACCGTGGG CGGTTCCGAC TATAAAAAAC TGGTGAATCT GGGTGGTACG
GCTTCCGGAT GGGTCGGCGA AACTGACACG CGTTCCCAGA CCGCTACTTC CAGGCTGGGA
CTGATTGAGC CTTTCATGGG GGAAATCTAC GGCAACCCGC AGGCCACCCA GAAAATGCTG
GATGATGCCT TCTTCAACGT GGAAGCCTGG ATCAACAGTG AACTGGCGAC CGAATTTGCC
GAACAGGAGG AAATTGCCTT TACCACTGGT GACGGCACCA AGAAGCCGAA AGGGTTCCTG
GCCTATGAAT CCACCGAAGA GTCCGATAAG GCTCGTGCGT TCGGTAAACT TCAGCACATC
GTATCCGGTG AAGCGACCGC GGTGACCGCT GATGCCATCA TTAAGCTGAT TTACACGCTG
CGTAAGGCGC ATCGTACCGG CGCGAAGTTC ATGATGAACA ACAACAGCCT GTTTGCCATC
CGTCTGCTGA AAGATACCGA GGGTAACTAT CTGTGGCGTC CGGGGCTGGA ACTGGGACAG
CCATCCTCAC TGGCGGGTTA CGGTATCGCT GAAAACGAAC AGATGCCGGA TATCGCCGCC
GATGCGAAAG CCATTGCGTT TGGTAACTTC AAACGGGGTT ACACCATCGT TGACCGTATC
GGCACCCGCA TCCTGCGCGA CCCGTACACC AACAAACCGT TTGTCGGTTT TTATACCACC
AAGCGCACCG GGGGTATGCT GGTCGATTCA CAGGCTATCA AGCTGCTGAA AATCGCTGCG
GCGTAA
 
Protein sequence
MAVDIKDVEQ VAQELQQKFD DFKAKNDKRV EAIEQEKGKL AGQVETLNGK LSELENLKSD 
LEKELLELKR PARGAQNKVA AEHKDAFVGF LRKGREDGLR DLERKALQVG TDEDGGYAVP
EELDRSILSL LKDEVVMRQE ATVITVGGSD YKKLVNLGGT ASGWVGETDT RSQTATSRLG
LIEPFMGEIY GNPQATQKML DDAFFNVEAW INSELATEFA EQEEIAFTTG DGTKKPKGFL
AYESTEESDK ARAFGKLQHI VSGEATAVTA DAIIKLIYTL RKAHRTGAKF MMNNNSLFAI
RLLKDTEGNY LWRPGLELGQ PSSLAGYGIA ENEQMPDIAA DAKAIAFGNF KRGYTIVDRI
GTRILRDPYT NKPFVGFYTT KRTGGMLVDS QAIKLLKIAA A