Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C0469 |
Symbol | |
ID | 6489265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 472630 |
End bp | 475236 |
Gene Length | 2607 bp |
Protein Length | 868 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642740738 |
Product | putative autotransporter/pertactin |
Protein accession | YP_002044405 |
Protein GI | 194447396 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.043223 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGGATG ACACAGCATA TGTTGATCAT GACTGGGATG ATGGCGATAC ATTTACCCTC AATATCGCTA ACTCCACTAT CGATGATGAT TATGAATATT TCTACTTCAC CGATGATTAT AATAATGCTG ATGGAAAAGT TACTACGGAA GATTGGCGTA CTTCCGAATT AGCTTCTTTA GGCACTGCTG TAACGTTGGA TGTCGAAAGT AATATTAATA TCACCAATAA CTCTCGTGTT GCAGGTATTA CTCTGAGCCA GGGCGATACT TACGACGCAA CATATGCTAC TGGTGGTGAT AGCGCTTATG TTCATACTTG GGACAACACT ATTGTCGTTG ACAATTCCAC TGTTACATCA GGGGCTGTGA CTCCGCTTGA GGGTAGCGGT TGGTTTGGTA ACTCTTCAGA GCCAAGCGAT TATTCTGGCA ACGCTGTTTA TGATACAACG ACAGGAACCT GGTACCATAA TCCGAATGAC ACTGCTTTAT CGTTCTCAGA TGATCCTGAC TCATACTATT CGATGAAGAA CAACGTTACA TTCACCAACG GTTCAACGTT GATGGGTGAT GTTGTGTTTA CGAGTAACTT CAACAATGCT GACGACGCGA ATGCTGATTC CAATGGCGAT GGTGTTATCA GCGCCAGCGA TGGTTTCAGC CCAATCGGTT ATGACACCAA CAATGATGGC GTAGAAGATA CGAATGGCGG TTGGAGTCAC GACAACGATA ACGTTGATGA ACTGAACCTC AAACTGGATA ACGGTAGTAA GTGGGTAGGT GACGCATACT TTAGCTATGA GTATATCGCA CCGGCTGATA TGTACGATCT TGAAGATGGT ACTAACAGCT TAGAACCAAG TTCTACTGTG GATAAATGGG GCAACGTTGT TGATGACAAG ACCTTCCAGA GCGGTATCTT CACTGTAGCG CTGGATAACG GTTCTGAATG GGATACGGTA AACGCTTCTA ACGTCGATAC CCTAACTGTT AATAATGGTT CTCAGGTTAA CGTAGCTGAT AGTTCTTCTT TAATCGCAGA CACCATCACA TTGACCAATG GTTCAACGAT GAACCTCAGT TCCTATGGTG AAGTTGATAC CGATCACCTG ACGGTTGATA GTTACAGTAA AGTTGATCTG ACAAACGAAA CTGCTTATCT GTATGCTAAC ACCATTACCG TATCAAACGG CGGTGAATTC AGCATCGGTG CTGGTGAATT TGATGCCGAT TCTTTCGGTA CGGATACTCT GGAACTGACC AACGCTGGTG TATTTAACAT CAACAACAGC GACTATGTGC TGGATGCAGA TTTGGTTAAC GGCCACACCA ACACAACCGA TACATCAAAT GCTACCTATG GTTACGGTGT CATCGCTATG ACTTCTGACG GTCATTTGAC CGTGAATGGT AATGGTGATT ATTACAACGG TGATAACACT GCCGATACTA CTTACAGTGC TAACGGTGAA GCGGATAATA GCTACACGGA CAATGTTGTA GCGGCTACCG GTAACTATAA AGTGCGCATC GACAACGCTA CTGGTGCGGG TTCTGTTGCG GATTACAAAG GCAACGAGCT GATTCGTGTC AATGACGTAA ACACCGACGC AACCTTCTCT GCAGCAAACA AAGCTGACCT GGGTGCTTAC ACCTATCAGG CTAAGCAGGA AGGCAACACT GTCGTGCTGG AACAGATGGA GCTGACCGAC TACGCTAACA TGGCGCTGAG CATTCCTTCT GCGAACACCA ATATCTGGAA CCTGGAACAA GACACCGTTG GTACTCGTCT GACCAACGCT CGTCATGGCC TGGCGGATAA CGGCGGCGCA TGGGTAAGCT ACTTCGGCGG TAACTTCAAC GGCGACAACG GCACCATTAA CTACGATCAG GATGTTAATG GCATCATGGT CGGTGTTGAT ACCAAAGTTG ACGGTAACAA CGCTAAGTGG ATCGTTGGTG CGGCAGCAGG CTTCGCGAAA GGCGATCTGA GCGATCGTAC CGGTCAGGTG GATCAGGACA GCCAGTCTGC CTACATCTAC TCTTCCGCTC GTTTCGCAAA CAACATCTTT GTTGACGGTA ACTTGAGCTA CTCTCACTTC AACAACGATT TGTCTGCTAA CATGAGCGAC GGTACTTACG TTGACGGCAA CACCTCTTCT GACGCCTGGG GCTTCGGCTT GAAACTGGGT TATGATCTGA AGCTGGGTGA TGCAGGCTAC GTAACGCCTT ACGGCAGCGT ATCCGGTCTG TTCCAGTCTG GCGATGACTA CCAGCTGAGC AACGACATGA AAGTTGACGG TCAGTCTTAC GACAGCATGC GTTATGAACT CGGTGTAGAT GCAGGTTATA CCTTCACTTA CAGCGAAGAT CAGGCGTTGA CGCCGTACTT CAAACTGGCT TACGTTTACG ACGACTCCAA CAACGATGCT GACGTAAACG GCGACTCTAT CGACAACGGC GTAGAAGGTT CTGCGGTACG TGTTGGTCTG GGTACTCAGT TCAGCTTCAC GAAGAACTTC AGCGCCTACA CCGATGCTAA CTACCTCGGC GGCGGTGATG TTGATCAAGA CTGGTCTGCA AACGTTGGTG TTAAATATAC CTGGTAA
|
Protein sequence | MWDDTAYVDH DWDDGDTFTL NIANSTIDDD YEYFYFTDDY NNADGKVTTE DWRTSELASL GTAVTLDVES NINITNNSRV AGITLSQGDT YDATYATGGD SAYVHTWDNT IVVDNSTVTS GAVTPLEGSG WFGNSSEPSD YSGNAVYDTT TGTWYHNPND TALSFSDDPD SYYSMKNNVT FTNGSTLMGD VVFTSNFNNA DDANADSNGD GVISASDGFS PIGYDTNNDG VEDTNGGWSH DNDNVDELNL KLDNGSKWVG DAYFSYEYIA PADMYDLEDG TNSLEPSSTV DKWGNVVDDK TFQSGIFTVA LDNGSEWDTV NASNVDTLTV NNGSQVNVAD SSSLIADTIT LTNGSTMNLS SYGEVDTDHL TVDSYSKVDL TNETAYLYAN TITVSNGGEF SIGAGEFDAD SFGTDTLELT NAGVFNINNS DYVLDADLVN GHTNTTDTSN ATYGYGVIAM TSDGHLTVNG NGDYYNGDNT ADTTYSANGE ADNSYTDNVV AATGNYKVRI DNATGAGSVA DYKGNELIRV NDVNTDATFS AANKADLGAY TYQAKQEGNT VVLEQMELTD YANMALSIPS ANTNIWNLEQ DTVGTRLTNA RHGLADNGGA WVSYFGGNFN GDNGTINYDQ DVNGIMVGVD TKVDGNNAKW IVGAAAGFAK GDLSDRTGQV DQDSQSAYIY SSARFANNIF VDGNLSYSHF NNDLSANMSD GTYVDGNTSS DAWGFGLKLG YDLKLGDAGY VTPYGSVSGL FQSGDDYQLS NDMKVDGQSY DSMRYELGVD AGYTFTYSED QALTPYFKLA YVYDDSNNDA DVNGDSIDNG VEGSAVRVGL GTQFSFTKNF SAYTDANYLG GGDVDQDWSA NVGVKYTW
|
| |