Gene SeHA_C1594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1594 
Symbol 
ID6488892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1544664 
End bp1548101 
Gene Length3438 bp 
Protein Length1145 aa 
Translation table11 
GC content54% 
IMG OID642741817 
Productsex pilus assembly 
Protein accessionYP_002045462 
Protein GI194451592 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACAA TTCACTCTAT CGGGGACTCG GCATTCTTGG AGCAAATCCT GATTGCGGTG 
TCCATGATCA CTGGTACCGG GGACTTCGAG AAAATGGTCA GCATTGGCCT GCTGCTCGGC
CTGCTCATCA TCTGTATCCA ATCAGTTTTT CAGGGAGCAA AACAAATCAA CTTCCAGCAG
GTTTTGCTGG GCTGGGTCAT TTATGCCTGC TCCTTTGGGC CAACCACGAC GGTTGCCATT
GAAGACGCCT ATACTGGCGA GGTTCGTGTT GTTGCCAACG TCCCGCTATT GGTTGGCTTT
GCGGGCGGGA TGATCTCCAA TGTGGGATAC ACCATTACCA ACCTCTTCGA GACCGGATAC
GGGGTCATCG TACCGAACGT TACGGAAAGC CACTTCTCGG AGACTTTAAA GCTGCTGAAT
GACGTTCGCA GAAGAGCTTA TGATACCGGA GTCTTCACCG CTCTCAATGC GGCTAATGGC
GGTGGATACG TCGATGTGCG CCTGTCCTGG AACAACTACA TCCGTGAATG CACACTGACC
AAGGTTGACC TTAAACTGAT GTCTCTGGAT GAACTTATGA ATCGTCCTAC CGAGACAGCG
CTTCGGTTTA ACTCCCAGCT CTATGGAACT CGCCTTTACC TTTCTACGGG TAATCCTGAT
GGGGCTGATT ACACCTGCAC TGACGGGTGG GTCGCCATCA GTAACGCCAC TGCAAATTTG
AATAGCCCTG TGGTGGTTGA CGCTTTGAAC AACTTGCTGG GAATTGATAC CAGTGCCGGT
GATGACGCGA TCACCAAAAT CGGCGATTCA CTGCAAGCTA TGGGAGCTTC GACAACATCA
TCCATCGATT ATTTGAAAGC AGCCGTGCTG GAACCCCTTT ACTACGAGGC TGCCGCTGGT
CGCTATCAGG ACTTACAGGA CTACGGCTCC GCGCTGATGA TTAACCAGGC CATTCAGCAG
CGGAATACCC AGTGGGCGGC GGAACAGTCC ATGTTCATGA CGGTGGTTCG CCCAATGTTG
ACCTTCTTCG AAGGGTTCAT CTACGCCATC ACTCCGATCA TTGCCTTTAT CATCGTAATG
GGGAGTTTTG GTCTCCAACT GGCGGGCAAA TATGTACAGA CCATCCTTTG GATTCAGCTT
TGGATGCCAG TTCTGTCCAT TATCAACTTG TTTGTTCACA CGGCGGCATC CAAGGAGATG
TCCAGCCTGA GTAGCTCCGG GCTGGACTCT ATGTATGCGC TGTCATCGAC AGGTGACGTG
CTGCAACACT GGATCGCAAC GGGTGGGATG TTGGCCGCTG CTACGCCGAT CATTTCCTTA
TTCATCGTCA CTGGTAGCAC CTATGCCTTT ACCAGCCTAG CCTCGCGTAT CAACGGCGGC
GACCACGTAA ACGAGAAAAT GCAAACGCCT GATCTTCTGC AACAAGATCC GATAATGAAA
GGTCAGCCAG CGTTCTCGTA TAACCAGTTC AGTGGTGCCA TGGCTTCAGG AGCTGAGTCG
CTTGTCGATA AGGTGAACAT CGGACAAACC CTGAGTTCAG TGGTTAGCAG CAGCCAGGCC
GCATCCGCCC AAGCTACGCG GAATTTCTCC GAACAGCTAT CAAACTCGGT TTTCTCCGGG
GCATCGGCCG AGCAGAGTTA CGCCAGGTTG CAGGGGCTGG GCCAAACCCT TAGGGCATCC
GATTCGAGTC AGGCTAAGGC AATTTACGGT CAGGCTCAGG ATTACGCACG GCAGTACGGA
TTGTCGGAAA CACATACTGA TGCCATAGCT GGTGCCATAG CTATGAAGGC TTCAGGATCT
GTTGATGCAG GTAAGCTTGC TGCGGTGATT GGAGGGCCAA TAGGTCGAAT TGCTGCTGCG
TCGGGAGCAG TTGGCAAAGA CGCGCCATTA AAAGTACAGG GAGATATTTC GGGGTCTGCT
GAGTCCAGGA CGACGGATCA GCGTCAGCAA CTGACTACGG ATCTAGACAA GCTGACAAAG
AATTTGGGCT TTTCCAAGGA TGATTCCGCA GCGTTGACCC AAGATCTGGC GCGACAAACC
AACACCGAGT CAGGCCAGCG GTTCACCAAT TCATTGGGTG AGGAACAACG CGAGCAACTT
TCCCAATCCG CGACACAAGC CGTTAGCGCT CAAAATACGT ATCAGCGCCT TTCCTCCGCT
CAAAGCTCAA TTGGCACGGC CAGTCAAATG GATATGCGGG CATTGGCCGC CTCTGCCGTT
GGTAGTCCAG CCGCAAGCAC CGTGTTACAG AATGGGATGC GTATGGCGTC GCCGGAGACT
CGTCAGGCCG CTGCCGAAAA AGCACGGTTC TATCAGGCAT TGGGAATGGA TTCCGCGCAG
GCGTTAGCCG GTGGGCAAAT TTACGCGCTT TTGAACTCCG GCAAGGGAGC CGAGCAGACT
GTTGCGGCTG AGGCTATCTC GGCGGCTACT GGTGGTGTCG TCACACAGGG TGTGGGGCGG
TTTAACGAGA ACCAGTTGCT TGTCCAGCAA GCTCCGGCAA TCACCGGCCT TTCCAACACC
CAGCGACTCC AAGACCCCAG TCGTATGAAC GAGAGGCAGC GTACCCAGGC CTATGGCATT
GTGCCTGGCG AAGAGGCGGT TTTCCAGCAG CATGACAAGA ATATGAGCCG AGTACAGGAC
GTTCATGCCA ATCAAAAGGA GTCGTTCCAG GATGAGCGAT TGGGGTCACT CAGAACGCAG
ATAATGAACT CCAACGTAGA ATCATCCACC GCATCCAATA TCTTTGCCTC GTCCGATGGG
GCTGGTCGAT TCTTCGACAA GATCGTTGGC GGCTCAAGGG CGGCCATGGA TGGATTCTCA
AATGACTTTT CCAACTCGAT GAACCAACTG GCACGGATGA CGCCCGAACA GCGCGATCAG
TTTATTGCCG ACGCACACAA GGGAGATCAG TACATCAAAG ATCAGTATGG CCTTCCAGGG
TTCCTGGCTA CTGGTGCGGC CAACCTGGGG AGGGACATTA TCGGAGCAGG AGTTTCCGGT
TTCTATGCAG CCAAGGAATG GCTCACCGGG AAATCGGACC TATCCGAGGC GGCCAAGGGC
ATGTCGGTTC GCGAGCGTGG TATGTTCTTC GCGGCAGCGT TTGCTAGTGC GAGCGAAGCA
GGCGCAGAGC ATGCAGAGGC CTTTGTTCAG CAGTACGGAG ATGAGTTTCG TAATTTGGCC
ATGCAGACCG CGATACAAGA GCATGGCCTC CATTCGGATG CGGGTGCACG GTTGTTTGCC
TCATCCATCT TGGGCGCAAG CGATGGAAAG GAACAGGAAT ATCGGGCCCA ACTCCGGAGT
GAAATGGGCG ATGACGCCTT GGCGAACAAA ACAGCGGATA TCATTGAGTC CGCCGCAGGG
GCCGGACGAG AGCAGGCCGG GGGGTATCTG GCTCCAGTTT CGCGCTATTT TGCAGTTAAG
CAAGGAGGTG GTTCATGA
 
Protein sequence
MFTIHSIGDS AFLEQILIAV SMITGTGDFE KMVSIGLLLG LLIICIQSVF QGAKQINFQQ 
VLLGWVIYAC SFGPTTTVAI EDAYTGEVRV VANVPLLVGF AGGMISNVGY TITNLFETGY
GVIVPNVTES HFSETLKLLN DVRRRAYDTG VFTALNAANG GGYVDVRLSW NNYIRECTLT
KVDLKLMSLD ELMNRPTETA LRFNSQLYGT RLYLSTGNPD GADYTCTDGW VAISNATANL
NSPVVVDALN NLLGIDTSAG DDAITKIGDS LQAMGASTTS SIDYLKAAVL EPLYYEAAAG
RYQDLQDYGS ALMINQAIQQ RNTQWAAEQS MFMTVVRPML TFFEGFIYAI TPIIAFIIVM
GSFGLQLAGK YVQTILWIQL WMPVLSIINL FVHTAASKEM SSLSSSGLDS MYALSSTGDV
LQHWIATGGM LAAATPIISL FIVTGSTYAF TSLASRINGG DHVNEKMQTP DLLQQDPIMK
GQPAFSYNQF SGAMASGAES LVDKVNIGQT LSSVVSSSQA ASAQATRNFS EQLSNSVFSG
ASAEQSYARL QGLGQTLRAS DSSQAKAIYG QAQDYARQYG LSETHTDAIA GAIAMKASGS
VDAGKLAAVI GGPIGRIAAA SGAVGKDAPL KVQGDISGSA ESRTTDQRQQ LTTDLDKLTK
NLGFSKDDSA ALTQDLARQT NTESGQRFTN SLGEEQREQL SQSATQAVSA QNTYQRLSSA
QSSIGTASQM DMRALAASAV GSPAASTVLQ NGMRMASPET RQAAAEKARF YQALGMDSAQ
ALAGGQIYAL LNSGKGAEQT VAAEAISAAT GGVVTQGVGR FNENQLLVQQ APAITGLSNT
QRLQDPSRMN ERQRTQAYGI VPGEEAVFQQ HDKNMSRVQD VHANQKESFQ DERLGSLRTQ
IMNSNVESST ASNIFASSDG AGRFFDKIVG GSRAAMDGFS NDFSNSMNQL ARMTPEQRDQ
FIADAHKGDQ YIKDQYGLPG FLATGAANLG RDIIGAGVSG FYAAKEWLTG KSDLSEAAKG
MSVRERGMFF AAAFASASEA GAEHAEAFVQ QYGDEFRNLA MQTAIQEHGL HSDAGARLFA
SSILGASDGK EQEYRAQLRS EMGDDALANK TADIIESAAG AGREQAGGYL APVSRYFAVK
QGGGS