Gene SeHA_C4549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4549 
Symbol 
ID6488447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4426482 
End bp4428836 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content58% 
IMG OID642744621 
Productputative phage tail protein 
Protein accessionYP_002048198 
Protein GI194448584 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.30814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.374859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCAACG ACATTATTAC CCAGCTTCAG GCGCGTAATG AGACGTTGAC GCAGGCAATA 
GCCCGTTACG GCTCACTCAA CGCCAGCACG CTGCACACGC TCAGCTTTGA GCAAACAAAA
GTCACCCGGC TTACGCAACA GCTCGCTAAC TCCGCCCTTC GCCGGGAAGA GAACGATAAA
CAGCGCGCCG GGTTGCTGGA AAAAACACAA ACGTTCGCCG GACAGCTCGG CAAGCTCCTG
AACGTTGAGA CTCCCGACTG GAAGCTGCCT TACGAATTTC AGGGCAACAT GGTCGATATG
GCGGCGAAGG GCGGCATGGA TAACACCGCG CGGGACGCCC TGAGCCTGAA TATCCGCGAC
TGGAGCCTTG ATTTCAATCA AGATCAAAAA GATCTGCAAA GCACCGCCGC CACGATGATC
GAAGGCGGCG TCAGCGCATT GCAGGATCTT AGCCGCTACA TGCCCGATAT CGCCAAAGCC
GCGACCGCCT CCCGCGACAG CGCGCAAAGC TGGGCGCAGG CGGCTCTGGC CACTCGCGAC
AAACTGAACA TCGCCCCTGA CGACTTCCGT TTTGCGCAAA ATATGCTGTA CAGCGTGGCA
AAAAGCGGCG GCGGCTCCGT TGCAGAGCAA ACCCAGTGGA TTAACGCCTT TGCCGGAAAA
ACCGGCGCTC AGGGGAAAGA AGGCATTGCG GAACTGACCG CAACGATGCA AATCGCCATG
AAAAATGCCC CTGACGCAGG CGCGGCGGCA GCGAATTTCG ACCATTTCCT GAAATCCGCC
TTCTCGAAAG AGACGGACAG TTGGTTTGCC CGCCAGGGCG TGGATCTTCA GGGATCGCTG
CTGGAGCATC AGCAAAATGG GATCGGCGTG ACGGAAGCGA TGACCCACAT CGTGCAGATG
CAACTGGAGA AAATGAACCC GCAGATCCTC GACACCTTCA GGCAAACCAT GAAGATTGAG
GATCTTTCCG CGCGCGGCGA CGCGCTACAG GCCATGGTGG AGAAATTTAA CCTCGGCGCG
ATGTTCGGCG ATGCGCAAAC GCGGGATTTT CTCGCCCCGA TGCTGGCGAA TATGGACGAA
TATCGCCAGC TAAAAGCCTC CGCAATGCAG GCGGCGGGGC AACATGTTAT TGATGATGAC
TTCGCCGCGA AAATGACATC GCCCGGAGAA CAGACCAAAG CGTTACAACT TTCACTTAAC
GATCTGTGGC TGACCGTCGG CCTGGAACTG ATGCCCGCCA TTGGCGAACT GGCGCAAAGC
ATCACGCCGC TGGTGCGGCA GTTCAGCGCC TGGCTGCGGG AAAATCCGGC GCTGGTGCAA
GGGGTCGCTA AAGTCGTTAG CGTTATCTGG CTGTTCAACG GGGCGCTGAA TATTCTCAGG
CTGGGAGCAA ACCTCATTGC GTCACCGTTT ATTCGCCTGA TCGATATCTT CCTGAAGGTC
AAAGCCGGTC TGGCGCTGGG CGGCGGCAGT CGCGCGCTGT CGGTTCTGAA ATCGTTTGGC
AACGGTGCGA AAAGCCTGAC GATGCTGCTG GGAAACGGCC TGATAAAAGG GCTACGGCTG
GTCGGCCAGA CGTTTATCTG GCTGGGTCGG GCGCTGCTGA TGAACCCTGT CGGCCTGACT
ATCACCGCTA TCGCAGGCGC CGCCTATTTA CTTTATCGCT ACTGGGAACC GATTTCCGGT
TTCTTTGCCG GAGTCTGGGA GCGCATCAAA ACCGCCTTTG ACGGAGGCAT TGCCGGCGTC
ACGCGTCTGA TTCTCGACTG GTCGCCGCTG GGGTTGTTTT ACCGCGCCTT CGCCGGCGTA
CTGGACTGGT TTGGCATTGA ACTCCCCGCC AGTTTCAGCG AATTTGGCGG CAATATTCTC
GATAGCCTGA TCAACGGCAT TCTGAATGCG CTTCCTTTCC TGAGCGGGGC GATTGAGAAG
ATAAAAGCGC TGATCCCCGA CTGGGCGAAA AGCGCGCTGG GCATCAGCGC TGAAATGCCG
TCTGTCGCCG CCGCCGTCCC CGGTATTGCC GGAACAATGG TCGCGCAACA GGCCAGCGCG
CCGCTGGCAT CGGGAGCGAA AGCGGTGACA ACCTCGGCCA AAACGATGGC CTCGCCGCAG
CCTGTGAAGA CGAAAAGCGC CGCCACGCCG CCGACGCCAG CCGCGCTTCC CGGCAAATCC
GGCGGGAAAC CTTATACGCT GCCCTCCCGC GCGCAAAGCA ACGTGCAGGT ACACTTTTCC
CCGCAGGTTA CCGTGCAGGG AAGCGGCGCG AATGCCGCCA AAGATATCAA CAACGTGCTG
TCGCTGAGCA AACGCGAGCT GGAGAGAATG ATTAACGATG TCATGGCGCA GCAACGGCGC
CGGGAGTACG CATAA
 
Protein sequence
MANDIITQLQ ARNETLTQAI ARYGSLNAST LHTLSFEQTK VTRLTQQLAN SALRREENDK 
QRAGLLEKTQ TFAGQLGKLL NVETPDWKLP YEFQGNMVDM AAKGGMDNTA RDALSLNIRD
WSLDFNQDQK DLQSTAATMI EGGVSALQDL SRYMPDIAKA ATASRDSAQS WAQAALATRD
KLNIAPDDFR FAQNMLYSVA KSGGGSVAEQ TQWINAFAGK TGAQGKEGIA ELTATMQIAM
KNAPDAGAAA ANFDHFLKSA FSKETDSWFA RQGVDLQGSL LEHQQNGIGV TEAMTHIVQM
QLEKMNPQIL DTFRQTMKIE DLSARGDALQ AMVEKFNLGA MFGDAQTRDF LAPMLANMDE
YRQLKASAMQ AAGQHVIDDD FAAKMTSPGE QTKALQLSLN DLWLTVGLEL MPAIGELAQS
ITPLVRQFSA WLRENPALVQ GVAKVVSVIW LFNGALNILR LGANLIASPF IRLIDIFLKV
KAGLALGGGS RALSVLKSFG NGAKSLTMLL GNGLIKGLRL VGQTFIWLGR ALLMNPVGLT
ITAIAGAAYL LYRYWEPISG FFAGVWERIK TAFDGGIAGV TRLILDWSPL GLFYRAFAGV
LDWFGIELPA SFSEFGGNIL DSLINGILNA LPFLSGAIEK IKALIPDWAK SALGISAEMP
SVAAAVPGIA GTMVAQQASA PLASGAKAVT TSAKTMASPQ PVKTKSAATP PTPAALPGKS
GGKPYTLPSR AQSNVQVHFS PQVTVQGSGA NAAKDINNVL SLSKRELERM INDVMAQQRR
REYA