Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C1527 |
Symbol | |
ID | 6488868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 1482351 |
End bp | 1485113 |
Gene Length | 2763 bp |
Protein Length | 920 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 642741750 |
Product | secretion system regulator:Sensor component |
Protein accession | YP_002045397 |
Protein GI | 194447656 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG3437] Response regulator containing a CheY-like receiver domain and an HD-GYP domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.0140135 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTGC TCAATCTCAA GAATACGCTG CAAACATCTT TAGTAATCAG GCTAACTTTT TTATTTTTAT TAACAACAAT AATTATTTGG CTGCTATCTG TGCTTACCGC AGCTTATATA TCAATGGTTC AGAAACGGCA GCATATAATA GAGGATTTAT CCGTTCTATC CGAGATGAAT ATTGTACTAA GCAATCAACG GTTTGAAGAG GCTGAACGTG ACGCTAAAAA TTTAATGTAT CAATGCTCAT TAGCGACTGA GATTCATCAT AACGATATTT TCCCTGAGGT GAGCCGGCAT CTATCTGTCG GACCTTCAAA TTGCACGCCG ACGCTAAACG GAGAGAAGCA CCGTCTCTTT CTGCAGTCCT CTGATATCGA TGAAAATAGC TTTCGTCGCG ATAGTTTTAT TCTTAATCAT AAAAATGAGA TTTCGTTATT ATCTACTGAT AACCCTTCAG ATTATTCAAC TCTACAGCCT TTAACGCGAA AAAGCTTTCC TTTATACCCA ACCCATGCCG GGTTTTACTG GAGTGAACCA GAATACATAA ACGGCAAAGG ATGGCACGCT TCCGTTGCGG TTGCCGATCA GCAAGGCGTA TTTTTTGGGG TGACGGTTAA ACTTCCCGAT CTCATTACTA AGAGCCACCT GCCATTAGAT GATAGTATTC GAGTATGGCT GGATCAAAAC AACCACTTAT TGCCGTTTTC ATACATCCCG CAAAAAATAC GTACACAGTT AGAAAATGTA ACGCTGCATG ATGGATGGCA GCAAATTCCC GGATTTCTGA TATTACGCAC AACCTTGCAT GGCCCCGGAT GGAGTCTGGT TACGCTGTAC CCATACGGTA ATCTACATAA TCGCATCTTA AAAATTATCC TTCAACAAAT CCCCTTTACA TTAACAGCAT TGGTGTTGAT GACGTCGGCT TTTTGCTGGT TACTACATCG CTCACTGGCC AAACCGTTAT GGCGTTTTGT CGATGTCATT AATAAAACCG CAACTGCACC GCTGAGCACA CGTTTACCAG CACAACGACT GGATGAATTA GATAGTATTG CCGGTGCTTT TAACCAACTG CTTGATACTC TACAAGTCCA ATACGACAAT CTGGAAAACA AAGTCGCAGA GCGCACCCAG GCGCTAAATG AAGCAAAAAA ACGCGCTGAG CGAGCTAACA AACGTAAAAG CATTCATCTT ACGGTAATAA GTCATGAGTT ACGTACTCCG ATGAATGGCG TACTCGGTGC GATTGAATTA TTACAAACCA CCCCTTTAAA CATAGAGCAG CAAGGATTAG CTGATACCGC CAGAAATTGT ACACTGTCTT TGTTAGCTAT TATTAATAAT CTGCTGGATT TTTCACGCAT CGAGTCTGGT CATTTCACAT TACATATGGA AGAAACAGCG TTACTGCCGT TACTGGACCA GGCAATGCAA ACCATCCAGG GGCCGGCGCA AAGCAAAAAA CTGTCATTAC GTACTTTTGT CGGTCAACAT GTCCCCCTCT ATTTTCATAC CGACAGTATC CGTTTACGGC AAATTTTGGT TAATTTACTC GGGAACGCGG TAAAATTTAC CGAAACCGGA GGGATACGTC TGACGGTCAA GCGTCATGAG GAACAATTAA TATTTCTGGT TAGCGATAGC GGTAAAGGGA TTGAAATACA GCAGCAGTCT CAAATCTTTA CTGCTTTTTA TCAAGCAGAC ACAAATTCGC AAGGTACAGG AATTGGACTG ACTATTGCGT CAAGCCTGGC TAAAATGATG GGAGGTAATC TGACACTAAA AAGTGTCCCC GGGGTTGGAA CCTGTGTCTC GCTAGTATTA CCCTTACAAG AATACCAGCC GCCTCAACCA ATTAAAGGGA CACTATCAGC GCCGTTCTGC CTGCATCGGC AACTGGCTTG CTGGGGAATA CGCGGTGAAC CACCCCACCA GCAAAATGCG CTTCTCAACG CAGAGCTTTT GTATTTCCCC GGAAAACTCT ACGACCTGGC GCAACAGTTA ATATTGTGTA CACCAAATAT GCCAGTAATA AATAATTTGT TACCACCCTG GCAGTTGCAG ATTCTTTTGG TTGATGATGC CGATATTAAT CGGGATATCA TCGGCAAAAT GCTTGTCAGC CTGGGCCAAC ACGTCACTAT TGCCGCCAGT AGTAACGAGG CTCTGACTTT ATCACAACAG CAGCGATTCG ATTTAGTACT GATTGACATT AGAATGCCAG AAATAGATGG TATTGAATGT GTACAATTAT GGCATGATGA GCCGAATAAT TTAGATCCTG ACTGCATGTT TGTGGCGCTA TCCGCTAGCG TAGCGACAGA AGATATTCAT CGTTGTAAAA AAAATGGGAT TCATCATTAC ATTACCAAAC CAGTGACATT GGCTACCTTA GCTCGCTATA TCAGTATTGC CGCAGAATAC CAACTTTTAC GAAATATAGA GCTACAGGAG CAGGATCCAA GTCGCTGCTC AGCGCTACTG GCGACAGATG ATATGGTCAT TAATAGCAAG ATTTTCCAAT CACTGGACCT CTTGCTGGCT GATATTGAAA ATGCCGTATC GGCTGGACAA AAAATCGATC AGTTAATTCA CACATTAAAA GGCTGTTTAG GTCAAATAGG GCAGACTGAA TTGGTATGCT ATGTCATAGA CATTGAGAAT CGCGTAAAAA TGGGGAAAAT CATCGCGCTG GAGGAACTAA CCGACTTACG CCAGAAAATA CGTATGATCT TCAAAAACTA CACCATTACT TAA
|
Protein sequence | MNLLNLKNTL QTSLVIRLTF LFLLTTIIIW LLSVLTAAYI SMVQKRQHII EDLSVLSEMN IVLSNQRFEE AERDAKNLMY QCSLATEIHH NDIFPEVSRH LSVGPSNCTP TLNGEKHRLF LQSSDIDENS FRRDSFILNH KNEISLLSTD NPSDYSTLQP LTRKSFPLYP THAGFYWSEP EYINGKGWHA SVAVADQQGV FFGVTVKLPD LITKSHLPLD DSIRVWLDQN NHLLPFSYIP QKIRTQLENV TLHDGWQQIP GFLILRTTLH GPGWSLVTLY PYGNLHNRIL KIILQQIPFT LTALVLMTSA FCWLLHRSLA KPLWRFVDVI NKTATAPLST RLPAQRLDEL DSIAGAFNQL LDTLQVQYDN LENKVAERTQ ALNEAKKRAE RANKRKSIHL TVISHELRTP MNGVLGAIEL LQTTPLNIEQ QGLADTARNC TLSLLAIINN LLDFSRIESG HFTLHMEETA LLPLLDQAMQ TIQGPAQSKK LSLRTFVGQH VPLYFHTDSI RLRQILVNLL GNAVKFTETG GIRLTVKRHE EQLIFLVSDS GKGIEIQQQS QIFTAFYQAD TNSQGTGIGL TIASSLAKMM GGNLTLKSVP GVGTCVSLVL PLQEYQPPQP IKGTLSAPFC LHRQLACWGI RGEPPHQQNA LLNAELLYFP GKLYDLAQQL ILCTPNMPVI NNLLPPWQLQ ILLVDDADIN RDIIGKMLVS LGQHVTIAAS SNEALTLSQQ QRFDLVLIDI RMPEIDGIEC VQLWHDEPNN LDPDCMFVAL SASVATEDIH RCKKNGIHHY ITKPVTLATL ARYISIAAEY QLLRNIELQE QDPSRCSALL ATDDMVINSK IFQSLDLLLA DIENAVSAGQ KIDQLIHTLK GCLGQIGQTE LVCYVIDIEN RVKMGKIIAL EELTDLRQKI RMIFKNYTIT
|
| |