Gene SeHA_C1151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1151 
Symbol 
ID6488352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1130828 
End bp1133914 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content56% 
IMG OID642741393 
Productgifsy-1 prophage VmtH 
Protein accessionYP_002045045 
Protein GI194451878 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCCAGA AAGTCGGTGA TATCGTCATC AACATGGATG TTGATACAGC TAAAGTTGCC 
GCCGGTCTTC AGACTGCCAG TAACGGGCTG GGGAAGCTGG TGGACAGCAG TGATCTCGTT
GAAAAACGCA TCAAGCGATG TATGGAGTCC AGCGCCAGAA GTGTGGCGGC ATCGGCAAAA
AGTATCAGTG CCGCTATGGC GCAATCACAG GTTGCCACAC GCACACAGAG TGACGCTATG
GCACAACTGG CGCGTGAGGC GAACGAGGCC AGAGAAAGGG CTGTCGACCT GAATCAGAAG
TTAAGGGCGG AAGCTGCGCA GGCAGCGGCG GTTGCACAGG CTCAGGATGC AGCCGCAGCG
GCATTTTACC GTCAGATTGA CAGTGTAAAA CAGTTAAGCG GTGGTCTGCA GGAGTTACAG
CGTATCCAGG CGCAGGTACG ACAGGCGAAA GGACGCGGAG ATATTTCACA GGGCGATTAT
CTGGCGCTGG TGTCTGAAGC TGCTGCAAAG ACACGCGAAC TTACCGATGC GGAGGCGCTG
GCCACGCAGA AAAAAGCACA GTTTATACGT CGACTGAAAG AGCAGACGGC GGTACAGGGC
CTCTCCCGTA CTGAGTTGCT GCGGGTGAAG GCGGCTGAAC TGGGGGTTAG CAGTGCCGCC
GATGTCTATA TCCGCAAACT GGATACCGCA ACAAAATCCA CTCATGCACT GGGACTGAAA
TCAGCAATGG CGCGCCGCGA GATAGGCGTA CTGATTGGTG AACTGGCACG GGGAAATTTT
GGCGCCCTTC GCGGTTCCGG TATCACGCTG GCCAACCGGG CCGGGTGGAT TGAGCAACTG
ATGTCGCCGA AGGGCATGAT GCTCGGCGGG CTGGTTGGCG GTGTGGCTGC GGCGGTTTAC
GGACTGGGTA AGGCGTACTA TGAGGGGGCG AAAGAAAGTG AGGAGTTCAA TAAACAGCTT
ATTCTGACCG GGAGTTATGC CGGAAAAACC ACAGGCCAGC TTAATGCGAT GGCGAAGTCG
CTCGCCGGAA ATGGCGTCAC GCAGCACGAT GCTGCAGGCG TGCTGGCACA GGTGGTCGGT
AGCGGAGCGT TTACCGGGCA GGCAGTGGCA ATGGTATCCC GTACCGCGAC CAGAATGCAG
GAAAACGTGG GACAATCAGT GGATGAAACC ATCCGCCAGT TTAAACGCCT GCGGGATGAT
CCGGTGAATG CGGCGAAAGA ACTGGACAGG ACACTGCATT TTCTGACCGC CACCCAGCTT
GAACAAATCA GGGTACTGGG CGAGCAGGGA AGAGTGGCTG ATGCCGCGAA AATTGCCATG
TCCGCGTATT CGGAAGAAAT GAATAAGCGG ATGGGGGACG TACACGACAA TCTGGGCTGG
ATTGAAAGAG CATGGAATGC TGTCGGTGAT GCGGCGAAGT GGGCATGGGA TCGGATGCTG
GATATCGGGC GGGAAGACAC GCTCGATGAA AAGATCGCGA CACTGCAGGA AAAAATCGCG
CGCGGCAGAA AAACGCCCTG GACGGTGTCT TCCTCCCAGA CTGAATACGA TCAGCAGCAG
CTGAACGAAC TTCAGGAACA GAAACGCCAG AAGGACCTGC TGGATGCGAA GGCGCAGGCA
GAGCGTAATT ATCAGGAAAC GCAGAAACGT CGGAACGAGC AGAACGCCGC GCTGAACCGG
GATAATGAAA CTGAATCCCT GCGGCATCAA CGGGAGGTGG CGCGCATTAC CGCCATGCAG
TATGCCGATG CTGCGGTACG CAATGCCGCG CTGGAGCGTG AAAACGAACG CCATAAAAAA
GCAATGGCAC GGCAGAAGGA AAAGCCAAAG GCTTACCACA ACGACGAGGC CGGGCGACTG
CTTTTGCAGT ACAGCCAGCA ACAGGCGCAG ACTGAAGGGC TGATTGCCGC CGCGAAGCTT
TCCACGACCG AAAAAATGAC GGAAGCGCAT AAGCAGCTTT TGTCATTTCA GCAGCGCATC
GCTGATTTGT CCGGTAAAAA ACTGACGGCG GATGAACAAA GCGTACTGGC ACATAAGGAT
GAAATAGCGC TTGCGCTACA GAAGCTGGAT ATCTCACAAC AGGATTTGCA ACACCAGAAT
GCCTTTAATG AACTGAAGAA AAAGACGCTC ACATTAACCA GCCAGCTCGC TGACGAAGAA
TCCCGCGTCA GGCAGCAGCA CGCACTGGCG CTGGCCACAA TGGGTATGGG CGATCAGCAA
CGTGGCCGGT ACGAAGAGCA TCTGAAAATT CAACAGCACT ACCAGGAACA ACTGGAGCAG
CTTAAGCGCG ACAGCAAGGC AAAAGGGACA TACGGTTCTG ACGAATACCG TCAGGCGGAG
CAGGAACTTC AGGCCAGTCT CGATCGCCGA CTGGCTGAGT GGGCGGATTA TAACGCGAAA
GTGGATGCTG CGCAGGGAGA CTGGACGCAG GGCGCGTCGC GGGCGCTGGA TAACTTTCTG
GCGCAGGGGG GCAACGTGGC AGGCATGACG GAGAACGTTT TCACAAACGC ATTTAACGGC
ATGGCGGACA GTATCGCGAA TTTTTCCGTG ACCGGAAAGG GCAGTTTCCG GAGCCTGACG
GTCTCCATCC TGGCTGACCT GGCAAAAATG GAGGCACGTA TTGCGGCTTC TAAACTGTTG
GGTTCAGTAC TGGGTATGTT CGGCTTTGGC GCATCAGCAG GCGGAAGTAC ACCATCCGGG
GCATACAGTT CAGCGGCGCT GTCGGTCATT CCAAATGCGG ACGGCGGCGT GTACCGCTCA
GCAGGACTCA GTCAGTACAG CGGCAGTATT GTTAACAGAC CGACGTTCTT TGCATTTGCC
AGAGGGGCGG CAGTAATGGG AGAGGCCGGT CCGGAGGCTA TACTGCCGCT TCGTCGCGGT
ACTGACGGTA AGCTGGGGGT TGTGGCAGCA GGTTCCGGAG GGATGGCGAT GTTTGCGCCG
CAGTATCATA TTGCAATCAG CAACACGGGG CCGGAGCTGA CGCCGCAGGC GCTGAAGGCG
GTTTATGATC TGGGTAAAAA GGCGGCGGCT GATTTCGTGC AGCAGCAGGG GCGTGACGGC
GGCAGGCTGA GCGGGGCATA TCGATGA
 
Protein sequence
MSQKVGDIVI NMDVDTAKVA AGLQTASNGL GKLVDSSDLV EKRIKRCMES SARSVAASAK 
SISAAMAQSQ VATRTQSDAM AQLAREANEA RERAVDLNQK LRAEAAQAAA VAQAQDAAAA
AFYRQIDSVK QLSGGLQELQ RIQAQVRQAK GRGDISQGDY LALVSEAAAK TRELTDAEAL
ATQKKAQFIR RLKEQTAVQG LSRTELLRVK AAELGVSSAA DVYIRKLDTA TKSTHALGLK
SAMARREIGV LIGELARGNF GALRGSGITL ANRAGWIEQL MSPKGMMLGG LVGGVAAAVY
GLGKAYYEGA KESEEFNKQL ILTGSYAGKT TGQLNAMAKS LAGNGVTQHD AAGVLAQVVG
SGAFTGQAVA MVSRTATRMQ ENVGQSVDET IRQFKRLRDD PVNAAKELDR TLHFLTATQL
EQIRVLGEQG RVADAAKIAM SAYSEEMNKR MGDVHDNLGW IERAWNAVGD AAKWAWDRML
DIGREDTLDE KIATLQEKIA RGRKTPWTVS SSQTEYDQQQ LNELQEQKRQ KDLLDAKAQA
ERNYQETQKR RNEQNAALNR DNETESLRHQ REVARITAMQ YADAAVRNAA LERENERHKK
AMARQKEKPK AYHNDEAGRL LLQYSQQQAQ TEGLIAAAKL STTEKMTEAH KQLLSFQQRI
ADLSGKKLTA DEQSVLAHKD EIALALQKLD ISQQDLQHQN AFNELKKKTL TLTSQLADEE
SRVRQQHALA LATMGMGDQQ RGRYEEHLKI QQHYQEQLEQ LKRDSKAKGT YGSDEYRQAE
QELQASLDRR LAEWADYNAK VDAAQGDWTQ GASRALDNFL AQGGNVAGMT ENVFTNAFNG
MADSIANFSV TGKGSFRSLT VSILADLAKM EARIAASKLL GSVLGMFGFG ASAGGSTPSG
AYSSAALSVI PNADGGVYRS AGLSQYSGSI VNRPTFFAFA RGAAVMGEAG PEAILPLRRG
TDGKLGVVAA GSGGMAMFAP QYHIAISNTG PELTPQALKA VYDLGKKAAA DFVQQQGRDG
GRLSGAYR