Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0407 |
Symbol | |
ID | 6873927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 429363 |
End bp | 431408 |
Gene Length | 2046 bp |
Protein Length | 681 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642783639 |
Product | outer membrane autotransporter |
Protein accession | YP_002214326 |
Protein GI | 198245524 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.000021193 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGTGACG TTGTGTTTAC GAGTAACTTC AACAATGCTG GCGACGCGAA TGCTGATTCC AATGGCGATG GTGTTATCAA CGCCAGCGAT GGTTTCAGCA AAATCGGTTA TGACACCAAC AATGATGGTG TAGAGGATAC GAATGGCGGT TGGAGTCACG ACAACGATAA CGTTGATGAA CTGAACCTCA AACTGGATAA CGGTAGTAAG TGGGTAGGTG ACGCATACTT TACCTATGAT CGTATCGATC CGACTGAGAT GTACAATCTT GAAGATGGTA CTAACAGCTT AGAACCTGAT TCTGTTTTCG ATAAATGGGG CAACGTTGTT GATGACAAGA CCTTCCAGAG CGGTATCTTC ACTGTAGCGC TGGATAACGG TTCTGAATGG GATACGGTAA ACGCTTCAAA CGTCGATACC CTAACTGTTA ATAATGGTTC TCAGGTTAAC GTAGCTGATA GTTCTTCTTT AATCGCAGAC ACCATCACAT TGACCAATGG TTCAACGATG AACCTCAGTT CCTATGGTGA AGTTGATACC GATCACCTGA CGGTTGATAG TTACAGTAAA GTTGATCTGA CAAACGAAAC TGCTTATCTG TATGCTAACA CTATTACCGT ATCAAACGGC GGTGAATTCA GCATCGGTGC TGGTGAATTT GATGCCGATT CTTTCGGTAC GGATACTCTG GAACTGACCA ACGCTGGTGT ATTTAACATC AACAACAGCG ACTATGTGCT GGATGCAGAT TTGGTTAACG GCCACACCAA TACAACCGAT ACATCAAATG CTACCTATGG TTACGGTGTC ATCGCTATGA CTTCTGACGG TCATTTGACC GTGAATGGTA ATGGTGATTA TTACAACGGT GATAACACTG CCGATACTAC TTACAGTGCT AACGGTGAAG CGGATAATAG CTACACGGAC AATGTTGTAG CGGCTACCGG TAACTATAAA GTGCGCATCG ACAACGCTAC TGGTGCGGGT TCTGTTGCGG ATTACAAAGG CAACGAGCTG ATTCGTGTCA ATGACGTAAA CACCGACGCA ACCTTCTCTG CAGCAAACAA AGCTGACCTG GGTGCTTACA CCTATCAGGC TAAGCAGGAA GGCAACACTG TCGTGCTGGA ACAGATGGAA CTGACCGACT ACGCTAACAT GGCGCTGAGC ATTCCTTCTG CGAACACCAA TATCTGGAAC CTGGAACAAG ACACCGTTGG TACTCGTCTG ACCAACGCTC GTCATGGCCT GGCGGATAAC GGCGGCGCAT GGGTAAGCTA CTTCGGCGGT AACTTCAACG GCGACAACGG CACCATTAAC TACGATCAGG ATGTTAATGG CATCATGGTC GGTGTTGATA CCAAAGTTGA TGGTAACAAC GCTAAGTGGA TCGTTGGTGC GGCAGCAGGC TTCGCGAAAG GCGATCTGAG CGATCGTACC GGTCAGGTGG ATCAGGACAG CCAGTCTGCC TACATCTACT CTTCCGCTCG TTTCGCAAAC AACATCTTTG TTGACGGTAA CTTGAGCTAC TCTCACTTCA ACAACGATTT GTCTGCTAAC ATGAGCGACG GTACTTACGT TGACGGCAAC ACCTCTTCTG ACGCCTGGGG CTTCGGCTTG AAACTGGGTT ATGATCTGAA GCTGGGTGAT GCAGGCTACG TAACGCCTTA CGGCAGCGTA TCCGGTCTGT TCCAGTCCGG CGACGACTAC CAGCTGAGCA ACGACATGAA AGTTGACGGT CAGTCTTACG ACAGCATGCG TTATGAACTC GGTGTAGATG CAGGTTATAC CTTCACTTAC AGCGAAGATC AGGCGCTGAC CCCGTACTTC AAACTGGCTT ACGTTTACGA CGACTCCAAC AACGATGCTG ACGTAAACGG CGACTCTATC GACAACGGCG TAGAAGGTTC TGCGGTACGT GTTGGTCTGG GTACTCAGTT CAGCTTCACG AAGAACTTCA GCGCCTACAC CGATGCTAAC TACCTCGGCG GCGGTGATGT TGATCAAGAC TGGTCTGCAA ACGTTGGTGT TAAATATACC TGGTAA
|
Protein sequence | MGDVVFTSNF NNAGDANADS NGDGVINASD GFSKIGYDTN NDGVEDTNGG WSHDNDNVDE LNLKLDNGSK WVGDAYFTYD RIDPTEMYNL EDGTNSLEPD SVFDKWGNVV DDKTFQSGIF TVALDNGSEW DTVNASNVDT LTVNNGSQVN VADSSSLIAD TITLTNGSTM NLSSYGEVDT DHLTVDSYSK VDLTNETAYL YANTITVSNG GEFSIGAGEF DADSFGTDTL ELTNAGVFNI NNSDYVLDAD LVNGHTNTTD TSNATYGYGV IAMTSDGHLT VNGNGDYYNG DNTADTTYSA NGEADNSYTD NVVAATGNYK VRIDNATGAG SVADYKGNEL IRVNDVNTDA TFSAANKADL GAYTYQAKQE GNTVVLEQME LTDYANMALS IPSANTNIWN LEQDTVGTRL TNARHGLADN GGAWVSYFGG NFNGDNGTIN YDQDVNGIMV GVDTKVDGNN AKWIVGAAAG FAKGDLSDRT GQVDQDSQSA YIYSSARFAN NIFVDGNLSY SHFNNDLSAN MSDGTYVDGN TSSDAWGFGL KLGYDLKLGD AGYVTPYGSV SGLFQSGDDY QLSNDMKVDG QSYDSMRYEL GVDAGYTFTY SEDQALTPYF KLAYVYDDSN NDADVNGDSI DNGVEGSAVR VGLGTQFSFT KNFSAYTDAN YLGGGDVDQD WSANVGVKYT W
|
| |