Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4452 |
Symbol | |
ID | 6794129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 4340075 |
End bp | 4341820 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642778543 |
Product | tail fiber domain protein |
Protein accession | YP_002149109 |
Protein GI | 197251834 |
COG category | [R] General function prediction only |
COG ID | [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAATG AGTTTTATAC CCTCCTGACC GACAGGGGAA TGGCGAAAAT CGCCAGCGCC CTTGCGGACA AAAAACAGCT ACATCTACAA AAGATGGCGG TTGGCGACGG CGGCGGGCAA TATTATGAGC CGACCGCCAG CCAGGCCAAA TTACGCCACG AAGTCTGGCG CGGTGAGATG AATACGCTGA CCGTTGCGCC GAATAATCCC AACTGGCTGA TTGCCGAACT GGTGCTGCCG GAGGACGTTG GCGGCTGGTA TGTGCGTGAA GTGGGCGTGT TCGACGACGA GGGCGAGCTA ATCGCCATCG GCAAGTTCCC GGAATCCTAC AAACCGCTGT TGCCCGGCGG CTGCGGCAAG CAGGTCTGTA TCCGCCTGAT TATGGAGGTC TCCAACACCA CGGCGGTGAC GCTGACGGTC GATCCGAGCA TTGTGCTGGC GACGCGCGAC TATGTGGATG TCCGGCTGGA CGAGCATGAA CATTCGACAA ATCACCCGGA TGCGACATTA ACGCAGAAAG GGTTTACACA GCTCAGTAAC GCCACCGACA GCGATGACGA AACCAAAGCC GCTACGCCGA AGGCGGTAAA AGCGGCGATG GCGGAAGCGC GTAATCACAC GCATACCTGG AACCAGATCA CTGGCGTTCC GGACGGCACG CTGACGCAAA AGGGGATTGT TCAGCTTAAC AGTGCGACGG ACAGCACCAG CACAACGGAA GCGGCAACGC CGAGCGCGGT CAAGGCGGCG ATGGATAAGG CGAATGCGGC AGCTCCGGCG AACCATACTC ACGTCTGGAA CCAGATTATC GGCGTCCCGG ACGGCACGCT GGCGCAAAAA GGGATCGTGA AACTTAATAA CGCCACCGAT AGCACCAGCA CCACCGAAGC GGCAACGCCC AGTGCGGTAA AGGCGGCGAT GGATAAGGCA AACGCGGCGG CCCCGGCCAG CCATATACAC GCCTGGGGGC AGATCACCGG CGTCCCGGAC GGTACGCTGA CGCAAAAAGG GATTGTGAAG CTTAATAGCG CCACCGATAG CACCAGCACC ACCGAAGCGG CAACGCCTAG TGCGGTCAAG GCGGCGTATG ACAAGGCGAG CGCAGCGGCT CCGGCTAATC ATTCCCATTA TCAGTTTTTT ACGGCTAACG GTACGTTTAC GGTACCTGAT GGGGTGACTC AGGTCTTTGT GGAAATGTTA GGAGGAGGAG GAGGAGGTGG TGGTGGTGCA GTCACCGATG GAGGATTCGC GGGTGCTAGT GGTGGTTCGG GAGGAACCTG TGGAAGCACA AATATTTCCA TTGTTCCTGT AACTCCGGGA GGAAAATACG CGGTTATAGT TGGCGCAGGA GGGGTTGGCG GAGTCGCAGC CAGTCAGTCG TCTACGGCAC CATCAGGAAT TCACACCTTG GTAACTAGCA CGCCAGGATC CCCCGGTATT GATGGCGGGG ATTCAATTTT TGTTAATGTT ACCGCCAAAG GTGGTTCAGG AGGAGCCGGC GGCGTTATCT CAACTGTATC AGTAATAAAC CCAGCCCCAT CTGGTAATGG CGCAGCAGGG GAAAACTCAT CGTACGGTAC GGGAGGGAGC GGTGGCAGTA ATACTGACGG GGGCAATGCT GGTGGTTATG GTGCCGGGGG CGGCGGAGGC GCCAGAGGCA AAACCACTGG CAGTGATAAT ACCTATAGTG GGTCTGGCTT TCCTGGAGGG AAAGGCTCAA ACGGTTTTGT AAAAATTTCA TGGTGA
|
Protein sequence | MDNEFYTLLT DRGMAKIASA LADKKQLHLQ KMAVGDGGGQ YYEPTASQAK LRHEVWRGEM NTLTVAPNNP NWLIAELVLP EDVGGWYVRE VGVFDDEGEL IAIGKFPESY KPLLPGGCGK QVCIRLIMEV SNTTAVTLTV DPSIVLATRD YVDVRLDEHE HSTNHPDATL TQKGFTQLSN ATDSDDETKA ATPKAVKAAM AEARNHTHTW NQITGVPDGT LTQKGIVQLN SATDSTSTTE AATPSAVKAA MDKANAAAPA NHTHVWNQII GVPDGTLAQK GIVKLNNATD STSTTEAATP SAVKAAMDKA NAAAPASHIH AWGQITGVPD GTLTQKGIVK LNSATDSTST TEAATPSAVK AAYDKASAAA PANHSHYQFF TANGTFTVPD GVTQVFVEML GGGGGGGGGA VTDGGFAGAS GGSGGTCGST NISIVPVTPG GKYAVIVGAG GVGGVAASQS STAPSGIHTL VTSTPGSPGI DGGDSIFVNV TAKGGSGGAG GVISTVSVIN PAPSGNGAAG ENSSYGTGGS GGSNTDGGNA GGYGAGGGGG ARGKTTGSDN TYSGSGFPGG KGSNGFVKIS W
|
| |