Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C3785 |
Symbol | |
ID | 6492056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 3643473 |
End bp | 3649298 |
Gene Length | 5826 bp |
Protein Length | 1941 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642743897 |
Product | porin autotransporter |
Protein accession | YP_002047503 |
Protein GI | 194448727 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAGA AAAAACTTAT TTCTATCGCT ATCGCTTTAA CGCTACAAAG TTATTACATT CCGGCCATCG CCGCAGAAAA TAACGATGAT GAAAAAGAAT GTCCCAGTAA TATCTCCTCC CTGCCTAAAG AAAAACGCGC AAAACTTTCA CCGACCTGCC TTGCTACACC TGAAAATGAT AATCACTGGG GCTGGGTTGC TGGCGGCGTT GCTGCACTGG TCGCAGGTGT GGCTATTGGC GTTGAAAATA ACGGTGGCGG AGATTCTAAT CATTCTTATA CCCCCCCTAA GCCCGATAAT GGCGGCGACG TCACCCCGCC CGACGATGGC GGCAACGTCA CCCCGCCCGA CGATGGCGGC GATGACAATG TGACCCCGCC CGACGATAGT GGCGATGACG ATGTGGCCCC GCCTGACGAT AGCGGCGATG ACGATGTAAC CCCGCCCGAC GATAGCGGCG ATGACGATGT GACCCCGCCC GATGATAGCG GCGATGACGA TGTAACCCCG CCCGACGATA GCGGCGATGA CGATGTAACC CCGCCCGATG ATAGCGGCGA TGACGATGTA ACCCCGCCCG ACGATAGCGG CGATGACGAT GTAACCCCGC CCGATGATAG CGGCGATGAC GATGTAACCC CGCCTGACGA TAGCGGCGAT GACGATGTGA CCCCGCCCGA CGATAGCGGC GACGACGACG ACACGCCCCC GGATGACTCT GTTATAACCT TCAGCAACGG CGTCACCATC GATAAAGGCA AAGACACCCT GACCTTCGAT AGCTTCAAAC TGGATAACGG CAGCGTTCTC GAGGGTGCCG TGTGGAATTA TTCAGAACAG GACAACCAGT GGCAGCTCAC CACCGCGGAC GGCAAAACGC TGAACGTCAC CGGCTGGGAC GTGACCGACG CCAATGCCGC CGTGATTGAA GGCACCCAGG AAAACGGTCT CTACTGGAAG TACGACAGCC GGGGCTATCT GATTATTGCC GACGATAACA CCACCGTTAT CAGCGGCGAT GACCAGGCGC ATAATTCCGA TCGCGGCATG GATATCAGCG GCCAGGATCG CACCGGCGTG ATTATTTCCG GCGATAGAAC CGTCAACACG CTCACCGGGG ACTCCAGTGT GACCGACGGT GCCACCGGCA TGGTTATCTC CGGTGACGGC ACCACCAACA CCATTTCGGG CCACTCCACG GTGGACAACG CCACCGGCGC GCTGATTTCC GGCAACGGCA CCACCACCAA TTTCGCCGGT GACATTGCCG TGAGCGGCGG CGGTACCGCC ATCATCATCG ACGGCGACAA CGCCACGATT AAGAATACCG GTACCTCTAA CATCAGCGGC GCAGGCTCCA CCGGCACCGT CATTGACGGC AATAACGCCC GCGTCAACAA TGACGGTGAT ATGACCATCA CCGACGGCGG CACCGGCGGC CACATTACCG GCGATAACGT GGTTATCGAT AACGCCGGGA GCACTACCGT CAGCGGCGCG GACTCCACGG CGCTGTATAT CGATGGCGAC AACGCGCTCG TTATCAACGA AGGTAATCAA ACTATCTCTG GCGGCGCCGT CGGTACGCGC ATTGACGGCG ACGACGCCCA TACCACCAAT ACCGGTGATA TCGCGGTGGA TGGCGCGGGC TCTGCCGCCG TGATTATCAA CGGCGACAAC GGCAGCCTGA CCCAGGCGGG CGATCTGCTG GTCACCGACG GCGCGATGGG CATCATCACC TACGGCACCG GAAATGAAGC AAAAAATACC GGCAACGCCA CCGTGCGTGA TGCGGACTCG GTGGGTTTTG TGGTTGCAGG CGAAAAAAAC ACCTTCAAAA ACAAAGGGGA TATTGACGTC AGCCTTAACG GCACCGGCGC GCTGGTGAGC GGCGATATGT CGCAGGTTAC GCTGGATGGC GATATTAACG TTGTCTCAGT CCAGGACAGC GAAGGCGTGT TTAGCTCAGC GACAGGGGTG AGCGTGAGCG GCGACAGCAA CGCCGTTGAT ATCACCGGCA ACGTAAATAT CAGCGCCGAC TACGGGCAGG ATGATCTGGC TGCCGGGGCT CCCCCGTTAA CCGGCGTTGT CGTCGGCGGT AACGGCAATA CCGTTACCCT TAATGGCGCG CTGAATATTG ATGACAACGA TCTGTCCGCC ACCAGCGGAC AATACCTGGA CGTTGTTGGC CTGAGCGTAA CAGGTGATGA TAACGACGTT GAGATTGGCG GCGGTATTAA TATCACCCAC AGCGAGGATC CACTTGATGG AACCTCTGCA GACATTACCG GCATCAGCGT CAGCGGTAAC AGTACCGTTA CGCTAAACGG TCATTCTACC ATTGATACCA ACACGGTAGT GGGGGGTCAC GTTGTACTGG CGCGGGTCAA CAACGGCGGC TCCCTGATTC TGGGTGATGA CTCAGTTGTT GACGTTAATG TCAGTTATAT ACCCACAGGC TATTACACCT ACAACGCGTT GCTGATGGCT GATGGCGAAG GCACATCAAT TGAAAATAAA GGCGATATTA CAAGCCATGG CGTGTATTCC GTCATTCGCG CAGATAACGG CTCGGAAGTC AGCAACAGCG GAGATATTCT GGTCTACGCG ACCAGTAGCA ACAGTAGTGA GGATCGTGCA GCCATCACAA GGGCTAGTGG CGAGGGATCG GCTGTTCATA ACAAAGCCAG CGGCGATATC ACCCTCATTT CTGATCAAAC GCCGCAGGGC AGTGGCGGTA TTGAAGTATA CCCATTGAAA TGGTACACCC ACACCTTTTA CGCCATGATG GCTTCGGATT ATGGCGATGT CGTTAACGAT GAGGGCGCCA CGATCCATTT GCAGGGGGCA GGTGTATATG GCGTTACCGC CAGCCGAGGT AAAGCACTAA ACGAAGGCGA TATCTATCTG GATGGCCTTG TCCCGACGCT GGACGATGAA AATAACATCA CCAGCACCAG CTACTGGCAG CCATCATCGC TCTATCTCAC CAGCTCAGGA ATGGTGGCGG GTTCCACCGA TGCCGATGGC GACGCCACCG CCATCAACAC CGGCAACATT ACCGTCAACA ACGCCGGGTT CGGCATGATG GCGCTTAATG GCGGCACCGC CATTAACCAG GGCGTGATCA CCCTGACCGC CGATGACGGC GTGACCGGTC AGGCAGACGA GCTGGTCGGG ATGGCGGCGC TCAACGGCGG CGTCGTCATC AACGACACCA GCGGCGTGAT TAACATCGAC GCCGATTACG GCCAGGCGTT TCTGAGCGAC AGCTCCAGCT ATATCATCAA TAACGGCTCC ATCAACCTTA ACGGCAGCCC GATGGATGAT ACTGACTCCC ATATGGGCGG CACGCCAACG GACAAAATCT GGATTCAGTC CCTGCCCGGC AGCGGCGACA GCGACACCAG GACCTCCGAC ACAGGTTTCT TCACCGCCGG TACGCTGGCC AACTACGGTA CTGAAACCCT GAACGGCGAT GTGGACGTTA ACGGGGGCTG GCTGTACAAC GAAGCGGGCG CCTCGCTCAC CGTCAACGGC ACCGTGACGA TTAACGGCGG GGCTAACGCG CTGGCTAACT ACGGGACGCT GGACGCGGAC GCTATTTCCA CCTGGCACAG CCTCTTTAAT GAAGCGGACG GCAGCATCAC CACCGATCTG TTAACCCTTA ATGGCGACGT CACTTTTTAC AATAACGGCG ATTTCACCGG CTCCATAGCG GGCACCAGCT ACCAGCAGGA AATCGTCAAT ACTGGCGATA TGACGGTGGC GGAAGATGGC AAATCGCTGG TCAGCGGCAG CTTCTATTTC TATAACGAAG AGGACGCAAC GCTTACCAAC AGCGGCAGCG CGGTGGAAGG CGGTGAGAAC ACCATCATCA ATCTGACGCG CGCCAACGAT TCGCTGACCC AGGTGAACAG CGGCACCATC ACCGCCACTA ACGGTTACAG CGCCATCACC ACGGTCAACG GCAGCAATGA CCCCAAATGG ATCTGGAACA CCGCAACCGG CGTGATTAAC GGTATTAACC CGGATGCGCC GCTAATCAAT TTGGGCCGCG GCTATAACTT CGGCAACCAG GGCACTATCA ACGTGCAGGG CGATAACGCC GTGGCGATTA GCGGCGGCAC CAGCAGTTAT GTCATTAACC TGGTCAATAG CGGCACCATC AACGTCGGTA CCGAGCAGGG CAAGGAGGAC GGCACCAATG GCACCGGGCT TATCGGCATC AAGGGCAACG GTAATGCCAC CACCATAAAC AACACCGCAG ACGGCGTGAT CAACGTTTAC GCGGATGACT CGTATGCGTT TGGCGGCAAG ACCAAAGCCA TCATCAACAA CGGCGAAATC AACCTGCTGT GCGACAGCGG CTGCGACATC TACGCGCCGG GTACTACGGG TACGCAGAAC GACCATAACG GGACAGCGGA CATCGTCATT CCGGATGCGA CTACCGCCCC GACCGAGGGC AGCATCCCGA CGCCGCCAGC GGATCCCAAC GCGCCGCAGC AGCTCTCCAA CTATATTGTT GGCACCAACG CCGACGGTAG TTCCGGTACG CTGAAAGCCA ATAACCTGGT GATTGGCGAC AACGTGAAGG TGGATACCGG CTTTACCAGC GGTACGGCGG ATACCACCGT GGTGGTGGAT AACGCCTTCA CCGGCAGCAA TATCCAGGGC GCGGACAACA TCACTTCCAC CAGCGTGGTG TGGAACGCCC AGGGTAGCCA GGATGCCGAC GGCAACGTTG ACGTCACCAT GACCAAAAAC GCCTATGCCG ATGTGGCGAC CGACAGCTCG GTAAGCGACG TGGCGCAGGC GCTGGACGCC GGTTACACCA ACAACGAGCT GTACACCAGC CTTAATGTGG GGACTACCGC CGAGCTGAAC AGCGCACTCA AGCAGGTGAG CGGTGCTCAA GCCACCACGG TATTCCGTGA AGCGCGTGTA CTCAGCAACC GCTTTACCAT GCTGGCCGAC GCCGCGCCGC AAATTAAAGA TGGTCTGGCG TTCAACGTGG TGGCGAAAGG CGACCCACGT GCGGAGCTGG GCAACGACAC CCAGTACGAC ATGCTGGCAT TACGCCAGAC GTTGGATCTC ACCGCCAGCC AGAATCTGAC GCTGGAGTAC GGTATTGCGC GTCTGGATGG CGACGGCTCG AAAACTGCGG GCGATAACGG CCTGACAGGC GGCTACAGCC AGTTCTTTGG CCTGAAGCAC AGCATGGCGT TTGATGAAGG TCTGGCGTGG AACAACAGCC TGCGTTATGA CGTGCACAAC CTCGACAGCA GCCGTTCTGT CGCTTATGGC GATGTCAACA AAATTGCCGA TTCCGACATG CGTCAGCAGT ACCTTGAGTT TCGCAGTGAA GGGGCGAAAA CTTTCACCAT GATGAGTGAT ACGCTGAAAG TCACGCCGTA TGCCGGGGTG AAATTCCGTC ACACCATGGA AGGAGGCTAC AAAGAGCGCA GCGCCGGAGA TTTTAACCTG TCCATGAACT CTGGCAACGA AACGGCGGTA GACTCTATTG TCGGCCTGAA GCTGGATTAC GCCGGGAAAG ACGGCTGGAG CGCCACCGCT ACCCTGGAAG GCGGCCCGAA CCTGAGCTAC AGCAAGAGCC AGCGTACAGC GTCATTACAG GGTGCGGCGG GTCAATCGTT CGGCGTGGAT GACGGTCAAA AAGGCGGCGG CATTAATGGC CTGGCAACCA TCGGCGTGAA ATATAGCAGC AATGATACCG CGCTACATCT GGATGCATAC CAGTGGAAGG AAGACAGTAT CAGCGATAAA GGCTTTATGC TTAACGTTAA GAAAACATTT CGTTAA
|
Protein sequence | MQKKKLISIA IALTLQSYYI PAIAAENNDD EKECPSNISS LPKEKRAKLS PTCLATPEND NHWGWVAGGV AALVAGVAIG VENNGGGDSN HSYTPPKPDN GGDVTPPDDG GNVTPPDDGG DDNVTPPDDS GDDDVAPPDD SGDDDVTPPD DSGDDDVTPP DDSGDDDVTP PDDSGDDDVT PPDDSGDDDV TPPDDSGDDD VTPPDDSGDD DVTPPDDSGD DDVTPPDDSG DDDDTPPDDS VITFSNGVTI DKGKDTLTFD SFKLDNGSVL EGAVWNYSEQ DNQWQLTTAD GKTLNVTGWD VTDANAAVIE GTQENGLYWK YDSRGYLIIA DDNTTVISGD DQAHNSDRGM DISGQDRTGV IISGDRTVNT LTGDSSVTDG ATGMVISGDG TTNTISGHST VDNATGALIS GNGTTTNFAG DIAVSGGGTA IIIDGDNATI KNTGTSNISG AGSTGTVIDG NNARVNNDGD MTITDGGTGG HITGDNVVID NAGSTTVSGA DSTALYIDGD NALVINEGNQ TISGGAVGTR IDGDDAHTTN TGDIAVDGAG SAAVIINGDN GSLTQAGDLL VTDGAMGIIT YGTGNEAKNT GNATVRDADS VGFVVAGEKN TFKNKGDIDV SLNGTGALVS GDMSQVTLDG DINVVSVQDS EGVFSSATGV SVSGDSNAVD ITGNVNISAD YGQDDLAAGA PPLTGVVVGG NGNTVTLNGA LNIDDNDLSA TSGQYLDVVG LSVTGDDNDV EIGGGINITH SEDPLDGTSA DITGISVSGN STVTLNGHST IDTNTVVGGH VVLARVNNGG SLILGDDSVV DVNVSYIPTG YYTYNALLMA DGEGTSIENK GDITSHGVYS VIRADNGSEV SNSGDILVYA TSSNSSEDRA AITRASGEGS AVHNKASGDI TLISDQTPQG SGGIEVYPLK WYTHTFYAMM ASDYGDVVND EGATIHLQGA GVYGVTASRG KALNEGDIYL DGLVPTLDDE NNITSTSYWQ PSSLYLTSSG MVAGSTDADG DATAINTGNI TVNNAGFGMM ALNGGTAINQ GVITLTADDG VTGQADELVG MAALNGGVVI NDTSGVINID ADYGQAFLSD SSSYIINNGS INLNGSPMDD TDSHMGGTPT DKIWIQSLPG SGDSDTRTSD TGFFTAGTLA NYGTETLNGD VDVNGGWLYN EAGASLTVNG TVTINGGANA LANYGTLDAD AISTWHSLFN EADGSITTDL LTLNGDVTFY NNGDFTGSIA GTSYQQEIVN TGDMTVAEDG KSLVSGSFYF YNEEDATLTN SGSAVEGGEN TIINLTRAND SLTQVNSGTI TATNGYSAIT TVNGSNDPKW IWNTATGVIN GINPDAPLIN LGRGYNFGNQ GTINVQGDNA VAISGGTSSY VINLVNSGTI NVGTEQGKED GTNGTGLIGI KGNGNATTIN NTADGVINVY ADDSYAFGGK TKAIINNGEI NLLCDSGCDI YAPGTTGTQN DHNGTADIVI PDATTAPTEG SIPTPPADPN APQQLSNYIV GTNADGSSGT LKANNLVIGD NVKVDTGFTS GTADTTVVVD NAFTGSNIQG ADNITSTSVV WNAQGSQDAD GNVDVTMTKN AYADVATDSS VSDVAQALDA GYTNNELYTS LNVGTTAELN SALKQVSGAQ ATTVFREARV LSNRFTMLAD AAPQIKDGLA FNVVAKGDPR AELGNDTQYD MLALRQTLDL TASQNLTLEY GIARLDGDGS KTAGDNGLTG GYSQFFGLKH SMAFDEGLAW NNSLRYDVHN LDSSRSVAYG DVNKIADSDM RQQYLEFRSE GAKTFTMMSD TLKVTPYAGV KFRHTMEGGY KERSAGDFNL SMNSGNETAV DSIVGLKLDY AGKDGWSATA TLEGGPNLSY SKSQRTASLQ GAAGQSFGVD DGQKGGGING LATIGVKYSS NDTALHLDAY QWKEDSISDK GFMLNVKKTF R
|
| |