Gene SeD_A2256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2256 
Symbol 
ID6873006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2155979 
End bp2157907 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content56% 
IMG OID642785356 
Producttail protein 
Protein accessionYP_002216018 
Protein GI198244985 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.000173389 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGACA GTTTCCAGTT AAAGGCCATT ATCACTGCCG TTGACCAGTT ATCGGGTCCG 
CTGAAAGGGA TGCAGCGGGA ACTGAAGGGA TTTCAGAAAG AAATGGCCGG GCTGGCGATC
GGTGCTGCTG CTGCCGGGAC CGCTGTTCTT GGGGCGCTGG CGCTGCCCGT GAATGCTGCG
ATCGGCTTTG AGTCAAAAAT GGCTGACATC CGGAAGGTGG TTGACGGCCT GGATGATAAA
AAAGCATTCG CGCAGATGAG TGACGATATC CTGACGCTGT CCACACAGTT ACCGATGGCG
GCGGAGGGAA TTGCAGAGAT CGTGGCGGCG GGCGGGCAGG CAGGCATTGC CCGCGGCGAT
TTGATGCAGT TTGCGAACGA CGCAGTGAAA ATGGGTGTGG CGTTTGATAC CACTGCCGAA
GAGTCCGGTC AGATGATGGC GCAGTGGCGG ACAGCGTTCA GACTGACGCA GGAAGACGTG
GTTGTCCTGG CCGATAAAAT CAACTATCTG GGGAATACCG GCCCGGCAAA TGCGAAGAAA
ATTTCTGATA TCGTGACGCG GATTGGTCCG CTTGGCGGTG TTGCCGGAGT GGCATCCGGC
GAAATTGCCG CGATGGGCGC CACCATTGCC GGGATGGGGG TTGAATCGGA GATAGCCTCC
ACCGGCATCA AAAACTTCAT GCTGTCGTTA ACCGCAGGTA ATTCGGCAAC CAAAGCCCAG
AAACAGGCTA TGGCTTTCCT GAAGCTGAAT CCCCGGAAAC TCGCTGAGGA TATGCAAAAG
GATTCGCGCG GGGCCATGCT GAAGGTGCTG GACTCGCTCG CGAAAGTGCC AAAAGCTAAA
CAGGCCGCCG TCATGAATGC GCTGTTTGGC AAGGAGTCAC TTAGCGCGAT TGCCCCGCTG
CTGACCAACC TGGATTTGTT ACGCACCAAT TTTGATCGTG TGGCTGATGC CCAGGAATAT
GGCGGCTCGA TGCAGAAGGA ATACGCATCC CGCGCGGCCA CAACAGAAAA CCAGCTGGTT
CTGCTGAAAA ACAGCGTCAA TGCGATTTCG GTAACGCTGG GCGATACCTT CCTGCCCGCC
ATTAACGAAG CTGCAGAAGC GGTCATGCCT TACCTGGAGC AGCTCCGGAC ATTCGTTCGC
GCGAATCCTG AACTGGTTCA GTCTGCGGCG AAGTTCGGCG CGGCGCTGCT GGCTGTTGGC
GTATCCATTG GCAGCCTGTC CCGGGCTGTC AAAATCCTGA ACAGTGTCAT TAATCTCTCT
CCGGCGAAAG TCGCCATTGC GGCGCTGGTG GCCGGCGCTA TGCTGATCAT TGAGAACTGG
GACGATGTTG CTCCGGTGAT TAAGGCGGTA TGGCAGGAGG TCGATAACGT TGCGCAGGAG
ATGGGCGGAT GGGAGACGGT GATTGAAGGG GTTGGTCTGG TTATGGCTGG TTCTTTTACC
GTCAGGACCA TTGGTGCCCT GCAGCAGTCC GTCCTGCTGG CCGGACGGCT TTCCGGTCTG
CTGGGTAAAA TTGGCCGGAT GGGGGCCATG ACGCTGACAA TTGGCGTGGC GGTGTCACTC
TTTAAAGAGC TTAAGGATCT GGAGCAGGGG GCAAAGGATG CGGGTATGGA TGCTGGCGCA
TTCGCTGTAC AGAAGCTGCA AACGAAGGAG CGTGAACGCG GGTATAACGG TTTTATTCCC
AGACTCAAAG AGCTTCTTGG TATGGACACC CCGATTCCGC AGGGGCGTTA TCAACCTTAT
GTGCCACTGA CCCGGCGTTC TGGCGTACTC GAGCGAGCTG TCCCGCCATC AACGCAGCGC
AGCGAACTCA AAGTGACATT TGAGAATGCA CCACAAGGTA TGCGTGTGAC TGATATACCG
AAATCCGGTA ATCAATTGAT GAACATCAGC CATGATGTGG GTTACTCACC CTTTCGTACA
TCACGATAA
 
Protein sequence
MADSFQLKAI ITAVDQLSGP LKGMQRELKG FQKEMAGLAI GAAAAGTAVL GALALPVNAA 
IGFESKMADI RKVVDGLDDK KAFAQMSDDI LTLSTQLPMA AEGIAEIVAA GGQAGIARGD
LMQFANDAVK MGVAFDTTAE ESGQMMAQWR TAFRLTQEDV VVLADKINYL GNTGPANAKK
ISDIVTRIGP LGGVAGVASG EIAAMGATIA GMGVESEIAS TGIKNFMLSL TAGNSATKAQ
KQAMAFLKLN PRKLAEDMQK DSRGAMLKVL DSLAKVPKAK QAAVMNALFG KESLSAIAPL
LTNLDLLRTN FDRVADAQEY GGSMQKEYAS RAATTENQLV LLKNSVNAIS VTLGDTFLPA
INEAAEAVMP YLEQLRTFVR ANPELVQSAA KFGAALLAVG VSIGSLSRAV KILNSVINLS
PAKVAIAALV AGAMLIIENW DDVAPVIKAV WQEVDNVAQE MGGWETVIEG VGLVMAGSFT
VRTIGALQQS VLLAGRLSGL LGKIGRMGAM TLTIGVAVSL FKELKDLEQG AKDAGMDAGA
FAVQKLQTKE RERGYNGFIP RLKELLGMDT PIPQGRYQPY VPLTRRSGVL ERAVPPSTQR
SELKVTFENA PQGMRVTDIP KSGNQLMNIS HDVGYSPFRT SR