Gene SeSA_A4475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4475 
Symbol 
ID6519163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4341875 
End bp4343800 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content55% 
IMG OID642749426 
Producttail protein 
Protein accessionYP_002117162 
Protein GI194734652 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00323148 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGACA GTTTCCAGTT AAAGGCCATT ATCACTGCCG TTGACCAGTT ATCGGGTCCG 
CTGAAAGGGA TGCAGCGGGA ACTGAAGGGA TTTCAGAAAG AAATGGCCGG GCTGGCGATC
GGTGCTGCTG CTGCCGGGAC CGCTGTTCTT GGGGCGCTGG CGCTGCCCGT GAATGCTGCG
ATCGGCTTTG AGTCAAAAAT GGCTGACATC CGGAAGGTGG TTGACGGCCT GGATGATAAA
AAAGCATTCG CGCAGATGAG TGACGATATC CTGACGCTGT CCACACAGTT ACCGATGGCG
GCGGAGGGAA TTGCAGAGAT CGTGGCGGCG GGCGGGCAGG CAGGCATTGC CCGCGGCGAT
TTGATGCAGT TTGCGAACGA CGCAGTGAAA ATGGGTGTGG CGTTTGATAC CACTGCCGAA
GAGTCCGGTC AGATGATGGC GCAGTGGCGG ACAGCGTTCA AACTGACGCA GGAAGACGTG
GTTGTCCTGG CCGATAAAAT CAACTATCTG GGGAATACCG GCCCGGCAAA TGCGAAGAAA
ATTTCTGATA TCGTGACGCG GATTGGTCCG CTTGGCGGTG TTGCCGGAGT GGCATCCGGC
GAAATTGCCG CGATGGGCGC CACCATTGCC GGGATGGGGG TTGAATCGGA GATAGCATCC
ACCGGCATCA AAAACTTTAT GTTGTCCCTT ACGGCGGGCA AATCGGCAAC GAAGTCGCAG
AAGCGGGCAA TGGCCTTTCT GAAACTGAAT CCGGCGCAAC TGGCCGCAGA TATGCAGAAG
GATTCGCGCG CGGCGATGCT GAAAGTGCTG GACTCACTGG CGAAGGTGCC GAAAGCAAAA
CAGGCATCCG TCATGAATGC CCTGTTCGGG AAAGAGTCTT TAGGGGCGAT AGCGCCACTG
CTGACTAATC TTGATTTACT GCGCACCAAT TTTAATCGTG TTGCGGATGC CCAGGAATAT
GGCGGCTCGA TGCAGAAGGA ATATGCATCA CGCGCAGCCA CAACAGAAAA CCAGCTGGTT
CTGCTGAAAA ACAGCGTCAA TGCGATTTCA GTGACGCTGG GTGATACTTT TCTGCCCGCC
ATTAACGAAG CCGCAGAAGC GGTCATGCCT TACCTGGAGC AGCTCCGGAC ATTCGTTCGC
GCGAATCCTG AACTGGTTCA GTCTGCGGCG AAGTTCGGTG CGGCGCTGCT GGCTGTTGGC
GTATCCATCG GCAGCCTGTC CCGGGCTGTC AAAATCCTGA ACAGCGTCAT TAATCTCTCT
CCGGCGAAAG TCGCCATTGC GGCGCTGGTG GCCGGCGCTA TGCTGATCAT TGAGAACTGG
GACGATGTTG CTCCGGTGAT TAAGGCGGTA TGGCAGGAGG TCGATAACGT TGCGCAGGAG
ATGGGCGGAT GGGAGACGGT GATTGAAGGG GTTGGTCTGG TTATGGCTGG TTCTTTTACC
GTCAGGACCA TTGGTGCCCT GCAGCAGTCC GTCCTGCTGG CCGGACGGCT TTCCGGTCTG
CTGGGTAAAA TTGGCCGGAT GGGGGCCATG ACGCTGACAA TTGGCGTGGC GGTGTCACTC
TTTAAAGAGC TTAAGGATCT GGAGCAAGGG GCGAAGGATG CGGGTATGGA TGCTGGCGCA
TTCGCTGTAC AGAAGCTGCA AACGAAGGAG CGTGAACGCG GGTATAACGG TTTTATTCCC
AGACTCAAAG AGCTTCTTGG TATGGACACC CCGATTCCGC AGGGGCGTTA TCAACCTTAT
GTGCCACTGA CCCGGCGTTC TGGCGTACTC GAGCGAGCTG TCCCGCCATC AACGCAGCGC
AGTGAACTCA AAGTGACATT TGAGAATGCA CCACAGGGCA TGAGGGTGCT GGACATACCG
AAAACGGGAA ATCCTTTAAT GAACATTACC CATGATGTAG GGTATTCTCC CTTCAGTAAT
AAATAA
 
Protein sequence
MADSFQLKAI ITAVDQLSGP LKGMQRELKG FQKEMAGLAI GAAAAGTAVL GALALPVNAA 
IGFESKMADI RKVVDGLDDK KAFAQMSDDI LTLSTQLPMA AEGIAEIVAA GGQAGIARGD
LMQFANDAVK MGVAFDTTAE ESGQMMAQWR TAFKLTQEDV VVLADKINYL GNTGPANAKK
ISDIVTRIGP LGGVAGVASG EIAAMGATIA GMGVESEIAS TGIKNFMLSL TAGKSATKSQ
KRAMAFLKLN PAQLAADMQK DSRAAMLKVL DSLAKVPKAK QASVMNALFG KESLGAIAPL
LTNLDLLRTN FNRVADAQEY GGSMQKEYAS RAATTENQLV LLKNSVNAIS VTLGDTFLPA
INEAAEAVMP YLEQLRTFVR ANPELVQSAA KFGAALLAVG VSIGSLSRAV KILNSVINLS
PAKVAIAALV AGAMLIIENW DDVAPVIKAV WQEVDNVAQE MGGWETVIEG VGLVMAGSFT
VRTIGALQQS VLLAGRLSGL LGKIGRMGAM TLTIGVAVSL FKELKDLEQG AKDAGMDAGA
FAVQKLQTKE RERGYNGFIP RLKELLGMDT PIPQGRYQPY VPLTRRSGVL ERAVPPSTQR
SELKVTFENA PQGMRVLDIP KTGNPLMNIT HDVGYSPFSN K