Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B2850 |
Symbol | |
ID | 6794868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 2794696 |
End bp | 2796462 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642777022 |
Product | tail fiber domain protein |
Protein accession | YP_002147636 |
Protein GI | 197248971 |
COG category | [R] General function prediction only |
COG ID | [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCAA AATTTTATAC CCTGCTGACG GAGATCGGCG CGGCGAAACT GGCAAGCGCC GCCGCGCTCG GTGTCCCGTT GAAAATTACC CATATGGCGG TGGGTAGCGG TGGTGGTGTG CTGCCCACAC CCAACGCGCA ACAGACCGCA TTAGTTGCTG AGGAGCGCCG CGCAGCGCTG AATATGCTGT ATATCGACCC GCAGAACAGC AGCCAGATTA TTGCTGAGCA GGTGATCCCG GAAAATGAGG GCGGGTGGTG GATTCGTGAA GTCGGCCTGT TTGATGAAAC CGGTGCGCTG ATCGCTGTGG GTAACTGCCC GGAGAGCTAC AAGCCGAAGC TGGTAGAAGG GAGCGGACGT ACGCAGACCG TGCGCATGGT ACTGATTACC AGCAGCACCG ATAACATCAC CCTGAAAATT GACCCTGCAG TAGTGCTGGC AACCCGTAAA TATGCGGATG ATAAGGCGCT GGAGCTGAAG GTATATGTAG ACGACCTGAT GGCAAAACAT CTTGCTGCAG TAGATCCACA TACGCAGTAC GCGCCTAAAG TTAGCCCGTC GTTCACAGGT ACACCAAGAG CGCCAACGCC AGCGGCAGGA AATAACAGCA GCCAGATTGC TAATACGGCA TTTGTTCAGG CAGCGATAGC AGCACTGGTT GCGTCATCGC CAGCGGCGCT GGATACATTA AATGAACTGG CTGCTGCGCT GGGGAATGAT CCAAACTTTG CGACAACAGT GATGAATGCG CTGGCAGGAA AACAACCTCT CGATACCACG CTGACGAATC TGAGTGGAAA AGACAATGCA GGGATTCTCC AATACCTCGG TTTAAAAGAC GCACTTTTAA AAGGTGACGG GCGATTCCTT GCGGGAACGT TTGTCAGTGA CGCAATTGAC CGAACATCAA TTGGTGCCAG AGCGGCTACA GGCTGTCAGT TTATGCGCGC ACATCAGGCA CCTGATGCGC CAGACCAGGT AAGTTTCTGG CAAATTATTA CCCTTAGCGA GGTGATAAGT CCGACCACTG TTGTGGATGT TCTTGCAGTC AGTGGCAATA ACGTATTGTT TGGTCACGGT ACAGGAGCGG GTATTACCTC ATGGCGTCAA GTGGCGATGC TGGAGGGAGG CGCCTTTACG GGGGGTATTT CTGCACCAAA TATGCGTGGT GATACCCTGG TTACAGTTGG GGATGGCACT GGTGGGATGG CTAAAGGTGA CGTTGATGGT GCAGGTTTTA ATGGTAACAA TCTGAACATT AAGTCATGGA ATGGTATCGG ATTTCAGAAC TCAGAAGACC TGGCTATCCG GGCATATATC AGCACCCGAC TCGGTGTTAT CGCAGCTGCT GAAAATTTGC AGGCCGGAAA TGCGATATTC AACAAAAACG GCGATGTTTA CGGCGATATA TGGGGCACAG GCAGCGGGCC TGGCTGGTTG AGTGCGTATA TTGCAGGCAG ACCGTTACGA CAATACATCA CCATGGTCGG TGTGTACCAG AACGACAAAA CAAAGCCATT TATGCTTCAT GATGATGGTT CTGGTGTATT CCTTGCTACA ACTGACATGC TAAGTGGGTA TGTTCAGTCA ATTCGATTCG GTGCCGTTGA GCATGGAAAC GTATATCGTT CGCCCGGATT TGCAGACCAA TTAGGTTACG TCATTACAGG TGTTGAGAAT GGAGACTCGA ACGAAACACC AGACAGGATC CAACGACGCT TGTTACAGCT TAAAGTGAAT GGTCAGTGGT ATACGGTGGG GGCATAA
|
Protein sequence | MSAKFYTLLT EIGAAKLASA AALGVPLKIT HMAVGSGGGV LPTPNAQQTA LVAEERRAAL NMLYIDPQNS SQIIAEQVIP ENEGGWWIRE VGLFDETGAL IAVGNCPESY KPKLVEGSGR TQTVRMVLIT SSTDNITLKI DPAVVLATRK YADDKALELK VYVDDLMAKH LAAVDPHTQY APKVSPSFTG TPRAPTPAAG NNSSQIANTA FVQAAIAALV ASSPAALDTL NELAAALGND PNFATTVMNA LAGKQPLDTT LTNLSGKDNA GILQYLGLKD ALLKGDGRFL AGTFVSDAID RTSIGARAAT GCQFMRAHQA PDAPDQVSFW QIITLSEVIS PTTVVDVLAV SGNNVLFGHG TGAGITSWRQ VAMLEGGAFT GGISAPNMRG DTLVTVGDGT GGMAKGDVDG AGFNGNNLNI KSWNGIGFQN SEDLAIRAYI STRLGVIAAA ENLQAGNAIF NKNGDVYGDI WGTGSGPGWL SAYIAGRPLR QYITMVGVYQ NDKTKPFMLH DDGSGVFLAT TDMLSGYVQS IRFGAVEHGN VYRSPGFADQ LGYVITGVEN GDSNETPDRI QRRLLQLKVN GQWYTVGA
|
| |