Gene SeD_A2269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2269 
Symbol 
ID6875310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2164192 
End bp2165433 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content56% 
IMG OID642785367 
Productphage portal protein, HK97 family 
Protein accessionYP_002216029 
Protein GI198246051 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0000000000275865 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTTCTTTT CGGGATTATT TCAACGAAAA AGTGACGCGC CGGTGACCAC GCCAGCAGAG 
CTGGCGGATG CTATCGGGCT GTCATACGAC ACCTATACCG GAAAGCAGAT CAGCAGTCAG
CGGGCCATGC GACTGACGGC GGTTTTTTCC TGCGTCAGAG TGCTGGCAGA GTCGGTCGGG
ATGTTGCCCT GCAACCTGTA TCACCTGAAC GGCAGCCTGA AACAGAGAGC CACTGGCGAA
CGTCTGCATA AGCTGATCTC CACGCATCCC AATAGCTATA TGACGCCGCA GGAGTTCTGG
GAGCTGGTGG TCACCTGTCT GTGCCTGCGG GGCAACTTTT ATGCCTACAA AGTGAAAGCA
TTTGGCGAAG TGGCTGAACT GCTGCCCGTC GATCCCGGTT GCGTGGTGCC GAAGCTTAAC
AGTAGCTGGG AGCCGGTCTA TCAGGTCACA TTCCCGGATG GCTCCACGGA TGTACTGAGC
CAGGAGGATA TCTGGCATGT GCGCACGCTG ACGCTGGACG GACTGGTGGG GCTGAATCCC
ATCGCCTATG CCCGCGAGGC AATATCGCTG GCGGCAGCGA CCGAAGAGCA CGGGGCCAGA
CTGTTCAGCA ATGGCGCGGT GACGTCGGGT GTGTTGCGTA CAGAGCAGAC GCTGTCGGAT
CAGGCTTATG AGCGCCTGAA GAAAGATTTT GAGGAGCGTC ACACCGGGCT TGGTAATGCT
CACCGCCCGA TGATCCTTGA GATGGGGCTG GACTGGAAGT CGATGGCGCT GAACGCCGAG
GACAGCCAGT TCCTGGAAAC CCGCAAGTTT CAGCTTGAAG AAATCTGTCG TCTGTTCCGG
GTGCCATTGC ACATGGTGCA GAACACCGAT CGCGCCACCT TCAACAATAT TGAAGAGCTG
GGGCTGGGAT TTATCAACTA TTCACTGGTG CCGTATCTGA CCCGCATCGA ACAGCGGATC
AACACCGGAC TGGTACGAAA AAGTAAGCAG GGCGTTTTTT ACGCCAAATT TAACGCGGGG
GCGTTACTGC GTGGGGATAT GAAGTCCCGT TTTGAAGCCT ATGCCACCGG GATCAACTGG
GGGATTTACT CTCCCAATGA CTGCCGCGAC CTGGAAGATA TGAATCCGCG TCCCGGTGGT
GATGTCTATC TCACACCGAT GAACATGACC ACGAAACCCT CCGATGGCAG TAAAGCCGGT
AAGCAGAAGG ATAACGCCAA TGCAGACGAA ACAACGTCTT GA
 
Protein sequence
MFFSGLFQRK SDAPVTTPAE LADAIGLSYD TYTGKQISSQ RAMRLTAVFS CVRVLAESVG 
MLPCNLYHLN GSLKQRATGE RLHKLISTHP NSYMTPQEFW ELVVTCLCLR GNFYAYKVKA
FGEVAELLPV DPGCVVPKLN SSWEPVYQVT FPDGSTDVLS QEDIWHVRTL TLDGLVGLNP
IAYAREAISL AAATEEHGAR LFSNGAVTSG VLRTEQTLSD QAYERLKKDF EERHTGLGNA
HRPMILEMGL DWKSMALNAE DSQFLETRKF QLEEICRLFR VPLHMVQNTD RATFNNIEEL
GLGFINYSLV PYLTRIEQRI NTGLVRKSKQ GVFYAKFNAG ALLRGDMKSR FEAYATGINW
GIYSPNDCRD LEDMNPRPGG DVYLTPMNMT TKPSDGSKAG KQKDNANADE TTS