Gene SeHA_A0117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_A0117 
Symbol 
ID6487595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011081 
Strand
Start bp83040 
End bp84722 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content49% 
IMG OID642740278 
Producttype IVB pilus formation outer membrane protein, R64 PilN family 
Protein accessionYP_002043952 
Protein GI194447207 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02520] type IVB pilus formation outer membrane protein, R64 PilN family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.909577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT CACACCAGCG TTCAATGAAG CTGGCGGTGC TCCCCTGCAT GATCGCCGTC 
GCTCTTTCCA TCTCCGGATG TACTTTCAGC GAAATCAACA AAATGCAGAA AAAAGCACAG
GAAGACTCAG CACATGCACG GGAAAAGGTA TCAGCCCTTT CGGCCCGTAA ATCGCAGGCT
CTTACCTGGC TCGATAATCA ATGGATAAAC CCTGTTCCGG TCGCTCAGGT ATCAAGAGAG
AAAAAACAAA CAGCTCCGGC CTGCTACATC ACGCAGGCAA GAAAAGGAGA GATCACTCTG
CAGGAACTGG GGCAACGTAT TACTGCCGTA TGCGGCATCC CTGTGATCAT CACGCCTGAC
GCAGCCAATT CAACTCTTGA AGGAGGCGCT ACCCGCCAGA TGACAGGAAC ACTACCAGCA
CCAGATGAAA ATGGGCGTCT ACCGTTAAGC AGCCTGGGCA GCACAACAAT GACTACCTCC
ACTCAGCCAT TAACGCTGAA TAACCTCATG TGGCAGGGAG ATATCAATGG TCTTCTGGAT
CTGATGGCCA GCCGGAGTGG TCTGTACTGG CGCATGGATA ATGGTCGGAT TGTATTCTAT
CTGACTGAAA CCAGAACGTA TCCACTTCAT ATGCTGAACA CCAAAACCAG CAGCAGTTCC
AGTGTCAGCT CTGGCTCAAC AAGCACAATG GGGGCAACAG GAGGCCAGGA TAACTCAGCA
TCCGGTGATG CAACGTCCTC TCAGAGCACA ACCGTTGGTC AGGAATACGA TCTGTATGAA
GACATCCGGA AAACTATTGA AGCAATGCTG ACACCAGAAA AAGGCCGTTA CTGGTTATCT
GCATCGAGCT CAACGCTGAC TGTCACTGAT ACTCCAGCTG TCCAGGAAGC CGTCGCACGA
TATGTGGACG AACAAAACAG TATTATGAAC CGCCAGGTAG CCCTGAACGT ACAGGTTCTG
AGCGTCAGCA ATACCAGAAA CGAACAGTTC GGTCTGGACT GGAACCTTGT TTATAAATCG
CTACATTCCG CCGGAGCAAC GTTGAACAAT GCAAGCGGAG ATTTTACAGG CGCTACATCT
GCAGGCGTAT CAATTCTGGA TACGGCAACA GGGAATGCCG CCAAATTCAG CGGTTCCAGT
CTTCTGATTA AAGCGCTGAG TGAACAGGGC GATGTCAGTG TTGTGACTTC ACAAGAAAGC
ACTGTCACAA ACCTGACGCC GGTACCTATC CAGATGGCAG ATCAGACGGT TTACGTCGCC
CAGTCAGCAA CAACAACGAC TACGGATGTA GGAGCAACAA CAACATTAAC GCCGGGCATG
ATCACCACCG GATTCAATAT GACCCTGCTG CCTTTAATTC AGAAAACGGG CAATCTCCAG
TTGCAGATGA ATTTTAATCT GTCAGATCCC CCAACAATCC GTAGCTTTAC GTCAAAAGAC
GGAAACAGTT ACATCGAAAT GCCGTATACC AAACTGCGTT CACTGAGCCA GAAGGTCAAT
CTGAAAGAAG GGCAATCACT TGTCGTTACT GGTTTCGATC AGAACAATAC GACGACAAGT
AAAGCCGGTA CGTTTACGCC AGCAAATCCA TTATTTGGTG GTTCACAAAC CGGGAAAAAT
GAACGCAGCA CGCTTGTAAT CATCATTACC CCGACTTTCC CGTCAGGAGG CAACAATGGC
TGA
 
Protein sequence
MKKSHQRSMK LAVLPCMIAV ALSISGCTFS EINKMQKKAQ EDSAHAREKV SALSARKSQA 
LTWLDNQWIN PVPVAQVSRE KKQTAPACYI TQARKGEITL QELGQRITAV CGIPVIITPD
AANSTLEGGA TRQMTGTLPA PDENGRLPLS SLGSTTMTTS TQPLTLNNLM WQGDINGLLD
LMASRSGLYW RMDNGRIVFY LTETRTYPLH MLNTKTSSSS SVSSGSTSTM GATGGQDNSA
SGDATSSQST TVGQEYDLYE DIRKTIEAML TPEKGRYWLS ASSSTLTVTD TPAVQEAVAR
YVDEQNSIMN RQVALNVQVL SVSNTRNEQF GLDWNLVYKS LHSAGATLNN ASGDFTGATS
AGVSILDTAT GNAAKFSGSS LLIKALSEQG DVSVVTSQES TVTNLTPVPI QMADQTVYVA
QSATTTTTDV GATTTLTPGM ITTGFNMTLL PLIQKTGNLQ LQMNFNLSDP PTIRSFTSKD
GNSYIEMPYT KLRSLSQKVN LKEGQSLVVT GFDQNNTTTS KAGTFTPANP LFGGSQTGKN
ERSTLVIIIT PTFPSGGNNG