Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_A0117 |
Symbol | |
ID | 6487595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011081 |
Strand | - |
Start bp | 83040 |
End bp | 84722 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642740278 |
Product | type IVB pilus formation outer membrane protein, R64 PilN family |
Protein accession | YP_002043952 |
Protein GI | 194447207 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02520] type IVB pilus formation outer membrane protein, R64 PilN family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.909577 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAT CACACCAGCG TTCAATGAAG CTGGCGGTGC TCCCCTGCAT GATCGCCGTC GCTCTTTCCA TCTCCGGATG TACTTTCAGC GAAATCAACA AAATGCAGAA AAAAGCACAG GAAGACTCAG CACATGCACG GGAAAAGGTA TCAGCCCTTT CGGCCCGTAA ATCGCAGGCT CTTACCTGGC TCGATAATCA ATGGATAAAC CCTGTTCCGG TCGCTCAGGT ATCAAGAGAG AAAAAACAAA CAGCTCCGGC CTGCTACATC ACGCAGGCAA GAAAAGGAGA GATCACTCTG CAGGAACTGG GGCAACGTAT TACTGCCGTA TGCGGCATCC CTGTGATCAT CACGCCTGAC GCAGCCAATT CAACTCTTGA AGGAGGCGCT ACCCGCCAGA TGACAGGAAC ACTACCAGCA CCAGATGAAA ATGGGCGTCT ACCGTTAAGC AGCCTGGGCA GCACAACAAT GACTACCTCC ACTCAGCCAT TAACGCTGAA TAACCTCATG TGGCAGGGAG ATATCAATGG TCTTCTGGAT CTGATGGCCA GCCGGAGTGG TCTGTACTGG CGCATGGATA ATGGTCGGAT TGTATTCTAT CTGACTGAAA CCAGAACGTA TCCACTTCAT ATGCTGAACA CCAAAACCAG CAGCAGTTCC AGTGTCAGCT CTGGCTCAAC AAGCACAATG GGGGCAACAG GAGGCCAGGA TAACTCAGCA TCCGGTGATG CAACGTCCTC TCAGAGCACA ACCGTTGGTC AGGAATACGA TCTGTATGAA GACATCCGGA AAACTATTGA AGCAATGCTG ACACCAGAAA AAGGCCGTTA CTGGTTATCT GCATCGAGCT CAACGCTGAC TGTCACTGAT ACTCCAGCTG TCCAGGAAGC CGTCGCACGA TATGTGGACG AACAAAACAG TATTATGAAC CGCCAGGTAG CCCTGAACGT ACAGGTTCTG AGCGTCAGCA ATACCAGAAA CGAACAGTTC GGTCTGGACT GGAACCTTGT TTATAAATCG CTACATTCCG CCGGAGCAAC GTTGAACAAT GCAAGCGGAG ATTTTACAGG CGCTACATCT GCAGGCGTAT CAATTCTGGA TACGGCAACA GGGAATGCCG CCAAATTCAG CGGTTCCAGT CTTCTGATTA AAGCGCTGAG TGAACAGGGC GATGTCAGTG TTGTGACTTC ACAAGAAAGC ACTGTCACAA ACCTGACGCC GGTACCTATC CAGATGGCAG ATCAGACGGT TTACGTCGCC CAGTCAGCAA CAACAACGAC TACGGATGTA GGAGCAACAA CAACATTAAC GCCGGGCATG ATCACCACCG GATTCAATAT GACCCTGCTG CCTTTAATTC AGAAAACGGG CAATCTCCAG TTGCAGATGA ATTTTAATCT GTCAGATCCC CCAACAATCC GTAGCTTTAC GTCAAAAGAC GGAAACAGTT ACATCGAAAT GCCGTATACC AAACTGCGTT CACTGAGCCA GAAGGTCAAT CTGAAAGAAG GGCAATCACT TGTCGTTACT GGTTTCGATC AGAACAATAC GACGACAAGT AAAGCCGGTA CGTTTACGCC AGCAAATCCA TTATTTGGTG GTTCACAAAC CGGGAAAAAT GAACGCAGCA CGCTTGTAAT CATCATTACC CCGACTTTCC CGTCAGGAGG CAACAATGGC TGA
|
Protein sequence | MKKSHQRSMK LAVLPCMIAV ALSISGCTFS EINKMQKKAQ EDSAHAREKV SALSARKSQA LTWLDNQWIN PVPVAQVSRE KKQTAPACYI TQARKGEITL QELGQRITAV CGIPVIITPD AANSTLEGGA TRQMTGTLPA PDENGRLPLS SLGSTTMTTS TQPLTLNNLM WQGDINGLLD LMASRSGLYW RMDNGRIVFY LTETRTYPLH MLNTKTSSSS SVSSGSTSTM GATGGQDNSA SGDATSSQST TVGQEYDLYE DIRKTIEAML TPEKGRYWLS ASSSTLTVTD TPAVQEAVAR YVDEQNSIMN RQVALNVQVL SVSNTRNEQF GLDWNLVYKS LHSAGATLNN ASGDFTGATS AGVSILDTAT GNAAKFSGSS LLIKALSEQG DVSVVTSQES TVTNLTPVPI QMADQTVYVA QSATTTTTDV GATTTLTPGM ITTGFNMTLL PLIQKTGNLQ LQMNFNLSDP PTIRSFTSKD GNSYIEMPYT KLRSLSQKVN LKEGQSLVVT GFDQNNTTTS KAGTFTPANP LFGGSQTGKN ERSTLVIIIT PTFPSGGNNG
|
| |