Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C4536 |
Symbol | |
ID | 6490510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 4416479 |
End bp | 4418137 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642744608 |
Product | tail fiber domain-containing protein |
Protein accession | YP_002048185 |
Protein GI | 194449628 |
COG category | [R] General function prediction only |
COG ID | [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 0.371688 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAATG AGTTTTATAC CCTCCTGACC GACAGGGGAA TGGCGAAAAT CGCCAGCGCC CTTGCGGACA AAAAACAGAT ACATCTGCAA AAGATGGCGG TTGGCGACGG CGGCGGACAA TATTATGAAC CGACCGCCAG CCAGACCAAT TTACGCCACG AAGTCTGGCG CGGCGAGATG AATACGCTGA CCGTTGCACC GAATAATCCC AACTGGCTGA TTGCCGAACT GGTGCTGCCG GAGGACGTTG GCGGCTGGTA TGTGCGTGAA GTGGGCGTGT TCGACGACGA GGGCGAGCTA ATCGCCATCG GCAAATTCCC GGAATCTTAC AAACCGCTGT TGCCTGGCGG CTGCGGCAAG CAGGTCTGTA TCCGCCTGAT TATGGAGGTC TCCAACACCA CGGCGGTGAC GCTGACGGTC GATCCGAGTA TTGTGCTGGC GACGCGCGAC TATGTGGATG CCCGGCTGGA CGAGCATGAA CATTCGACAA ATCACCCGGA TGCGACATTA ACGCAGAAAG GGTTTACGCA GCTCAGTAAC GCCACCGACA GCGATGACGA AACCAAAGCG GCTACGCCAA AGGCGGTAAA AGCGGCGATG GCGGAAGCGC GTAATCACAC GCATACCTGG AACCAGATTA CCGGCGTTCC GGACGGCACG CTGACGCAAA AGGGGATTGT TCAGCTTAGT AGTGCTACTG ATAGTACCAG TGAAGTACTG GCTGCAACGC CAAAAGCGGT AAAGGCGGCG ATGGATAAGG CGAATGCGGC AGCTCCGGCC AGCCATACTC ACGCCTGGAA CCAGATTACC GGCGTCCCGG ACGGTACGCT GACGCAAAAA GGGATCGTGA AGCTTAACAG CGCGACGGAC AGCACCAGTA CGACGGAGGC GGCAACGCCC AGTGCGGTAA AGGCGGCGAT GGACAAGGCA AACGCGGCGG CCCCGGCGAA CCATACTCAT ACGCAGTTTT TTACCACAAA TGGGACATTT ACGGTTCCAG ATGGAGTGAC GACTCTATTT ATCGAAGTAA TGGGCGGTGG CGGCGGTGGG GCTGGCGGGT CTCAATCAAT CTATTACGAG GCGCGTGGCG GTCACGCTGG AGAACAAATT GTCAGTATTG TTAATGTAGT TCCAGGCCAA CAATTTCCCG TAAAAATTGG AGCTGGTGGG TGTGGGGGAG CATTTTGGTC AAATCCCCCG ACGACATCAG TAGGAACAGT GACTGATCAA ACGACTATCT ATAGAAAGAG TTTTGATGGC GGTAGTTCCT CCTTCTCAGA CATCACTGCG GCTGGGGGAA TTGGAGGCGA AAGTATTTAC CATACTAGAA ATATTCAACC TTATATTAAA TTTGTTGATC ATCCTATGCC ATATGCCTCG CACGAAATGG TTGTATATGC AGAATTATAC TATGGACACA GCGGTGAAGG CTCACTTTAT GGAGCCGGAG GGAAACCAGG AACAGTTATA ACCGAATCAT TAGCGAATGG AGGATATAAA GCAAATATGA TACCTCCTAC ATCAGCGACA GGCTATGGTG CAGGAGGGGC TGGCGGCTCC TATCTTCCAC CATTTAATTA TCAAAATAGT GACTTAACAA ATCTTGGAAA TACCAGTGGC ACAAATGGCT CCCCTGGTTT CGTAAAAATT TCATGGTAA
|
Protein sequence | MDNEFYTLLT DRGMAKIASA LADKKQIHLQ KMAVGDGGGQ YYEPTASQTN LRHEVWRGEM NTLTVAPNNP NWLIAELVLP EDVGGWYVRE VGVFDDEGEL IAIGKFPESY KPLLPGGCGK QVCIRLIMEV SNTTAVTLTV DPSIVLATRD YVDARLDEHE HSTNHPDATL TQKGFTQLSN ATDSDDETKA ATPKAVKAAM AEARNHTHTW NQITGVPDGT LTQKGIVQLS SATDSTSEVL AATPKAVKAA MDKANAAAPA SHTHAWNQIT GVPDGTLTQK GIVKLNSATD STSTTEAATP SAVKAAMDKA NAAAPANHTH TQFFTTNGTF TVPDGVTTLF IEVMGGGGGG AGGSQSIYYE ARGGHAGEQI VSIVNVVPGQ QFPVKIGAGG CGGAFWSNPP TTSVGTVTDQ TTIYRKSFDG GSSSFSDITA AGGIGGESIY HTRNIQPYIK FVDHPMPYAS HEMVVYAELY YGHSGEGSLY GAGGKPGTVI TESLANGGYK ANMIPPTSAT GYGAGGAGGS YLPPFNYQNS DLTNLGNTSG TNGSPGFVKI SW
|
| |