Gene SeHA_C4536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4536 
Symbol 
ID6490510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4416479 
End bp4418137 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content52% 
IMG OID642744608 
Producttail fiber domain-containing protein 
Protein accessionYP_002048185 
Protein GI194449628 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value0.371688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAATG AGTTTTATAC CCTCCTGACC GACAGGGGAA TGGCGAAAAT CGCCAGCGCC 
CTTGCGGACA AAAAACAGAT ACATCTGCAA AAGATGGCGG TTGGCGACGG CGGCGGACAA
TATTATGAAC CGACCGCCAG CCAGACCAAT TTACGCCACG AAGTCTGGCG CGGCGAGATG
AATACGCTGA CCGTTGCACC GAATAATCCC AACTGGCTGA TTGCCGAACT GGTGCTGCCG
GAGGACGTTG GCGGCTGGTA TGTGCGTGAA GTGGGCGTGT TCGACGACGA GGGCGAGCTA
ATCGCCATCG GCAAATTCCC GGAATCTTAC AAACCGCTGT TGCCTGGCGG CTGCGGCAAG
CAGGTCTGTA TCCGCCTGAT TATGGAGGTC TCCAACACCA CGGCGGTGAC GCTGACGGTC
GATCCGAGTA TTGTGCTGGC GACGCGCGAC TATGTGGATG CCCGGCTGGA CGAGCATGAA
CATTCGACAA ATCACCCGGA TGCGACATTA ACGCAGAAAG GGTTTACGCA GCTCAGTAAC
GCCACCGACA GCGATGACGA AACCAAAGCG GCTACGCCAA AGGCGGTAAA AGCGGCGATG
GCGGAAGCGC GTAATCACAC GCATACCTGG AACCAGATTA CCGGCGTTCC GGACGGCACG
CTGACGCAAA AGGGGATTGT TCAGCTTAGT AGTGCTACTG ATAGTACCAG TGAAGTACTG
GCTGCAACGC CAAAAGCGGT AAAGGCGGCG ATGGATAAGG CGAATGCGGC AGCTCCGGCC
AGCCATACTC ACGCCTGGAA CCAGATTACC GGCGTCCCGG ACGGTACGCT GACGCAAAAA
GGGATCGTGA AGCTTAACAG CGCGACGGAC AGCACCAGTA CGACGGAGGC GGCAACGCCC
AGTGCGGTAA AGGCGGCGAT GGACAAGGCA AACGCGGCGG CCCCGGCGAA CCATACTCAT
ACGCAGTTTT TTACCACAAA TGGGACATTT ACGGTTCCAG ATGGAGTGAC GACTCTATTT
ATCGAAGTAA TGGGCGGTGG CGGCGGTGGG GCTGGCGGGT CTCAATCAAT CTATTACGAG
GCGCGTGGCG GTCACGCTGG AGAACAAATT GTCAGTATTG TTAATGTAGT TCCAGGCCAA
CAATTTCCCG TAAAAATTGG AGCTGGTGGG TGTGGGGGAG CATTTTGGTC AAATCCCCCG
ACGACATCAG TAGGAACAGT GACTGATCAA ACGACTATCT ATAGAAAGAG TTTTGATGGC
GGTAGTTCCT CCTTCTCAGA CATCACTGCG GCTGGGGGAA TTGGAGGCGA AAGTATTTAC
CATACTAGAA ATATTCAACC TTATATTAAA TTTGTTGATC ATCCTATGCC ATATGCCTCG
CACGAAATGG TTGTATATGC AGAATTATAC TATGGACACA GCGGTGAAGG CTCACTTTAT
GGAGCCGGAG GGAAACCAGG AACAGTTATA ACCGAATCAT TAGCGAATGG AGGATATAAA
GCAAATATGA TACCTCCTAC ATCAGCGACA GGCTATGGTG CAGGAGGGGC TGGCGGCTCC
TATCTTCCAC CATTTAATTA TCAAAATAGT GACTTAACAA ATCTTGGAAA TACCAGTGGC
ACAAATGGCT CCCCTGGTTT CGTAAAAATT TCATGGTAA
 
Protein sequence
MDNEFYTLLT DRGMAKIASA LADKKQIHLQ KMAVGDGGGQ YYEPTASQTN LRHEVWRGEM 
NTLTVAPNNP NWLIAELVLP EDVGGWYVRE VGVFDDEGEL IAIGKFPESY KPLLPGGCGK
QVCIRLIMEV SNTTAVTLTV DPSIVLATRD YVDARLDEHE HSTNHPDATL TQKGFTQLSN
ATDSDDETKA ATPKAVKAAM AEARNHTHTW NQITGVPDGT LTQKGIVQLS SATDSTSEVL
AATPKAVKAA MDKANAAAPA SHTHAWNQIT GVPDGTLTQK GIVKLNSATD STSTTEAATP
SAVKAAMDKA NAAAPANHTH TQFFTTNGTF TVPDGVTTLF IEVMGGGGGG AGGSQSIYYE
ARGGHAGEQI VSIVNVVPGQ QFPVKIGAGG CGGAFWSNPP TTSVGTVTDQ TTIYRKSFDG
GSSSFSDITA AGGIGGESIY HTRNIQPYIK FVDHPMPYAS HEMVVYAELY YGHSGEGSLY
GAGGKPGTVI TESLANGGYK ANMIPPTSAT GYGAGGAGGS YLPPFNYQNS DLTNLGNTSG
TNGSPGFVKI SW