Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C1151 |
Symbol | |
ID | 6488352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 1130828 |
End bp | 1133914 |
Gene Length | 3087 bp |
Protein Length | 1028 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642741393 |
Product | gifsy-1 prophage VmtH |
Protein accession | YP_002045045 |
Protein GI | 194451878 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCCAGA AAGTCGGTGA TATCGTCATC AACATGGATG TTGATACAGC TAAAGTTGCC GCCGGTCTTC AGACTGCCAG TAACGGGCTG GGGAAGCTGG TGGACAGCAG TGATCTCGTT GAAAAACGCA TCAAGCGATG TATGGAGTCC AGCGCCAGAA GTGTGGCGGC ATCGGCAAAA AGTATCAGTG CCGCTATGGC GCAATCACAG GTTGCCACAC GCACACAGAG TGACGCTATG GCACAACTGG CGCGTGAGGC GAACGAGGCC AGAGAAAGGG CTGTCGACCT GAATCAGAAG TTAAGGGCGG AAGCTGCGCA GGCAGCGGCG GTTGCACAGG CTCAGGATGC AGCCGCAGCG GCATTTTACC GTCAGATTGA CAGTGTAAAA CAGTTAAGCG GTGGTCTGCA GGAGTTACAG CGTATCCAGG CGCAGGTACG ACAGGCGAAA GGACGCGGAG ATATTTCACA GGGCGATTAT CTGGCGCTGG TGTCTGAAGC TGCTGCAAAG ACACGCGAAC TTACCGATGC GGAGGCGCTG GCCACGCAGA AAAAAGCACA GTTTATACGT CGACTGAAAG AGCAGACGGC GGTACAGGGC CTCTCCCGTA CTGAGTTGCT GCGGGTGAAG GCGGCTGAAC TGGGGGTTAG CAGTGCCGCC GATGTCTATA TCCGCAAACT GGATACCGCA ACAAAATCCA CTCATGCACT GGGACTGAAA TCAGCAATGG CGCGCCGCGA GATAGGCGTA CTGATTGGTG AACTGGCACG GGGAAATTTT GGCGCCCTTC GCGGTTCCGG TATCACGCTG GCCAACCGGG CCGGGTGGAT TGAGCAACTG ATGTCGCCGA AGGGCATGAT GCTCGGCGGG CTGGTTGGCG GTGTGGCTGC GGCGGTTTAC GGACTGGGTA AGGCGTACTA TGAGGGGGCG AAAGAAAGTG AGGAGTTCAA TAAACAGCTT ATTCTGACCG GGAGTTATGC CGGAAAAACC ACAGGCCAGC TTAATGCGAT GGCGAAGTCG CTCGCCGGAA ATGGCGTCAC GCAGCACGAT GCTGCAGGCG TGCTGGCACA GGTGGTCGGT AGCGGAGCGT TTACCGGGCA GGCAGTGGCA ATGGTATCCC GTACCGCGAC CAGAATGCAG GAAAACGTGG GACAATCAGT GGATGAAACC ATCCGCCAGT TTAAACGCCT GCGGGATGAT CCGGTGAATG CGGCGAAAGA ACTGGACAGG ACACTGCATT TTCTGACCGC CACCCAGCTT GAACAAATCA GGGTACTGGG CGAGCAGGGA AGAGTGGCTG ATGCCGCGAA AATTGCCATG TCCGCGTATT CGGAAGAAAT GAATAAGCGG ATGGGGGACG TACACGACAA TCTGGGCTGG ATTGAAAGAG CATGGAATGC TGTCGGTGAT GCGGCGAAGT GGGCATGGGA TCGGATGCTG GATATCGGGC GGGAAGACAC GCTCGATGAA AAGATCGCGA CACTGCAGGA AAAAATCGCG CGCGGCAGAA AAACGCCCTG GACGGTGTCT TCCTCCCAGA CTGAATACGA TCAGCAGCAG CTGAACGAAC TTCAGGAACA GAAACGCCAG AAGGACCTGC TGGATGCGAA GGCGCAGGCA GAGCGTAATT ATCAGGAAAC GCAGAAACGT CGGAACGAGC AGAACGCCGC GCTGAACCGG GATAATGAAA CTGAATCCCT GCGGCATCAA CGGGAGGTGG CGCGCATTAC CGCCATGCAG TATGCCGATG CTGCGGTACG CAATGCCGCG CTGGAGCGTG AAAACGAACG CCATAAAAAA GCAATGGCAC GGCAGAAGGA AAAGCCAAAG GCTTACCACA ACGACGAGGC CGGGCGACTG CTTTTGCAGT ACAGCCAGCA ACAGGCGCAG ACTGAAGGGC TGATTGCCGC CGCGAAGCTT TCCACGACCG AAAAAATGAC GGAAGCGCAT AAGCAGCTTT TGTCATTTCA GCAGCGCATC GCTGATTTGT CCGGTAAAAA ACTGACGGCG GATGAACAAA GCGTACTGGC ACATAAGGAT GAAATAGCGC TTGCGCTACA GAAGCTGGAT ATCTCACAAC AGGATTTGCA ACACCAGAAT GCCTTTAATG AACTGAAGAA AAAGACGCTC ACATTAACCA GCCAGCTCGC TGACGAAGAA TCCCGCGTCA GGCAGCAGCA CGCACTGGCG CTGGCCACAA TGGGTATGGG CGATCAGCAA CGTGGCCGGT ACGAAGAGCA TCTGAAAATT CAACAGCACT ACCAGGAACA ACTGGAGCAG CTTAAGCGCG ACAGCAAGGC AAAAGGGACA TACGGTTCTG ACGAATACCG TCAGGCGGAG CAGGAACTTC AGGCCAGTCT CGATCGCCGA CTGGCTGAGT GGGCGGATTA TAACGCGAAA GTGGATGCTG CGCAGGGAGA CTGGACGCAG GGCGCGTCGC GGGCGCTGGA TAACTTTCTG GCGCAGGGGG GCAACGTGGC AGGCATGACG GAGAACGTTT TCACAAACGC ATTTAACGGC ATGGCGGACA GTATCGCGAA TTTTTCCGTG ACCGGAAAGG GCAGTTTCCG GAGCCTGACG GTCTCCATCC TGGCTGACCT GGCAAAAATG GAGGCACGTA TTGCGGCTTC TAAACTGTTG GGTTCAGTAC TGGGTATGTT CGGCTTTGGC GCATCAGCAG GCGGAAGTAC ACCATCCGGG GCATACAGTT CAGCGGCGCT GTCGGTCATT CCAAATGCGG ACGGCGGCGT GTACCGCTCA GCAGGACTCA GTCAGTACAG CGGCAGTATT GTTAACAGAC CGACGTTCTT TGCATTTGCC AGAGGGGCGG CAGTAATGGG AGAGGCCGGT CCGGAGGCTA TACTGCCGCT TCGTCGCGGT ACTGACGGTA AGCTGGGGGT TGTGGCAGCA GGTTCCGGAG GGATGGCGAT GTTTGCGCCG CAGTATCATA TTGCAATCAG CAACACGGGG CCGGAGCTGA CGCCGCAGGC GCTGAAGGCG GTTTATGATC TGGGTAAAAA GGCGGCGGCT GATTTCGTGC AGCAGCAGGG GCGTGACGGC GGCAGGCTGA GCGGGGCATA TCGATGA
|
Protein sequence | MSQKVGDIVI NMDVDTAKVA AGLQTASNGL GKLVDSSDLV EKRIKRCMES SARSVAASAK SISAAMAQSQ VATRTQSDAM AQLAREANEA RERAVDLNQK LRAEAAQAAA VAQAQDAAAA AFYRQIDSVK QLSGGLQELQ RIQAQVRQAK GRGDISQGDY LALVSEAAAK TRELTDAEAL ATQKKAQFIR RLKEQTAVQG LSRTELLRVK AAELGVSSAA DVYIRKLDTA TKSTHALGLK SAMARREIGV LIGELARGNF GALRGSGITL ANRAGWIEQL MSPKGMMLGG LVGGVAAAVY GLGKAYYEGA KESEEFNKQL ILTGSYAGKT TGQLNAMAKS LAGNGVTQHD AAGVLAQVVG SGAFTGQAVA MVSRTATRMQ ENVGQSVDET IRQFKRLRDD PVNAAKELDR TLHFLTATQL EQIRVLGEQG RVADAAKIAM SAYSEEMNKR MGDVHDNLGW IERAWNAVGD AAKWAWDRML DIGREDTLDE KIATLQEKIA RGRKTPWTVS SSQTEYDQQQ LNELQEQKRQ KDLLDAKAQA ERNYQETQKR RNEQNAALNR DNETESLRHQ REVARITAMQ YADAAVRNAA LERENERHKK AMARQKEKPK AYHNDEAGRL LLQYSQQQAQ TEGLIAAAKL STTEKMTEAH KQLLSFQQRI ADLSGKKLTA DEQSVLAHKD EIALALQKLD ISQQDLQHQN AFNELKKKTL TLTSQLADEE SRVRQQHALA LATMGMGDQQ RGRYEEHLKI QQHYQEQLEQ LKRDSKAKGT YGSDEYRQAE QELQASLDRR LAEWADYNAK VDAAQGDWTQ GASRALDNFL AQGGNVAGMT ENVFTNAFNG MADSIANFSV TGKGSFRSLT VSILADLAKM EARIAASKLL GSVLGMFGFG ASAGGSTPSG AYSSAALSVI PNADGGVYRS AGLSQYSGSI VNRPTFFAFA RGAAVMGEAG PEAILPLRRG TDGKLGVVAA GSGGMAMFAP QYHIAISNTG PELTPQALKA VYDLGKKAAA DFVQQQGRDG GRLSGAYR
|
| |