Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1956 |
Symbol | |
ID | 6871032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 1884884 |
End bp | 1886632 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642785076 |
Product | sensor-kinase |
Protein accession | YP_002215742 |
Protein GI | 198242180 |
COG category | [P] Inorganic ion transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG3221] ABC-type phosphate/phosphonate transport system, periplasmic component [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0000433243 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTTGGCGG CAGTAGGGCT ACTTTGTCAT GGCGCGTGGG CAGGGACGTG GAATATCGGT ATTTTGGCCA TGCGCGGCGA GGCGTCTACG CGTAGCCACT GGCAACCGTT GGCAAAGACA TTAAGCCAAC AGTTTCCAGG CGAAACCTTT CATATCCAGC CGCTGGATCT GCATCAAATG CAGGAAGCCG TTAACCAGGG AACCGTGCAG TTTGTGATAA CCAACCCGGC GCAATTTGTC CAACTGAACA GCCATGCGCC GCTGCGCTGG TTAGCTTCCC TGCGTTCCAC GCGCGATGGG AAAGCGGTGA GTAATGTTAT TGGCAGCGTG ATTTTGACCC GGCGCGATAG CGGCATCACC ACGGCGCATG ATCTCATCGG TAAGACAGTC GGCGCGATTG ATGCTCAGGC GTTTGGCGGC TATTTATTAG GCTATAAAGC GCTCAGCGAC GCGGGCTTAC GCCCGGAGCG CGATTTTCAT CTTCGTTTTA CCGGATTTCC TGGCGATGCC TTAGTCTATA TGCTGCGCGA AAAAGCGGTG CAGGCGGCAA TTGTGCCAGT GTGCCTGTTA GAAAATATGG ATCAGGAAGG ATTGATTAAT AAAAAGGACT TTATCGCGCT GCTTTCCCGA CCGACGCCAC TGCCTTGCTT AACCAGTACG CCGTTATATC CTGACTGGTC GTTCGCGGCG CTACCTGCGG TAAGCGATGC GCTGGCGGAT CGCGTAACGC GAGCGCTATT CAACGCGCCC GCCGCCGCGT CATTTCACTG GGGCGCGCCA GCGTCGACCA GTCAGGTGGA AGCTTTGCTG CGTGATGTTC GTCAGCACCC TCAGCAGCGC CGACTGTGGC TGGATGTCAA AAGTTGGTTA ATCCAGCACC AGCTAATGGT CGGTGGCGTG ATTCTGGCGT TCCTGTTGCT CACGCTCAAT TATATTTGGG TCATGCTGCT GGTGCGTCGA CGCGGAAAAC AACTGGAACG TAATAATGTA GTCCTTCATC AGCATGAGCG GGCGCTGGAA ACCGCCCGGC AAATGAGCGT GTTGGGTGAA ATGACCTCCG GGTTTGCCCA TGAGCTTAAT CAGCCGCTTT CCGCGATTCG ACATTATGCC CAGGGGTGCC TGATTCGACT GCGCGCTGCA GATGAACAGC ATCCCTTGCT GCCGGCGCTG GAGCAGATTG ACCAGCAGGC GCAACGCGGT GCGGATACTC TGCGTAACCT GCGTCACTGG GTCAGCCAGG CGCAGGGCAA CCCGGTGCTA ACCGAAGCGT GGAAGGCCAT AGCCATTCGC GAGGCGATTG ATCATGTCTG GCAATTGTTG CGTATGGCGC AACAGTTTCC GACAGTGACT CTGCATACCG AGGTTAGCGC TGCGCTGCGC GTAACGCTGC CGTCAGTGCT GCTGGAACAG GTGCTGGCGA ATATCATTCT TAATGCGGCT CAGGCGGGCG CCACCCATTT ATGGGTCGTT GCTGAACGCA CTGAACACGG CATCAGTATC GCTTTACAGG ATAACGCCGG GGGAATCGAT GAGGCGCTAT TACGTCAGGC GTTTCAGCCG TTTATGACCA CCCGCAAAGA GGGGATGGGC TTAGGGCTGG CGATTTGCCA GCGGCTGGTG CGGTATGGGC GGGGCGATAT CAGTATCAGG AACCAGACCG CGCCGGACGG TCTGTCGGGA ACGGTGGTCA CGATACATTT CTTACATGAA AATGGGGGCA GGGATGGCGA CAATTCATCT ACTGGATGA
|
Protein sequence | MLAAVGLLCH GAWAGTWNIG ILAMRGEAST RSHWQPLAKT LSQQFPGETF HIQPLDLHQM QEAVNQGTVQ FVITNPAQFV QLNSHAPLRW LASLRSTRDG KAVSNVIGSV ILTRRDSGIT TAHDLIGKTV GAIDAQAFGG YLLGYKALSD AGLRPERDFH LRFTGFPGDA LVYMLREKAV QAAIVPVCLL ENMDQEGLIN KKDFIALLSR PTPLPCLTST PLYPDWSFAA LPAVSDALAD RVTRALFNAP AAASFHWGAP ASTSQVEALL RDVRQHPQQR RLWLDVKSWL IQHQLMVGGV ILAFLLLTLN YIWVMLLVRR RGKQLERNNV VLHQHERALE TARQMSVLGE MTSGFAHELN QPLSAIRHYA QGCLIRLRAA DEQHPLLPAL EQIDQQAQRG ADTLRNLRHW VSQAQGNPVL TEAWKAIAIR EAIDHVWQLL RMAQQFPTVT LHTEVSAALR VTLPSVLLEQ VLANIILNAA QAGATHLWVV AERTEHGISI ALQDNAGGID EALLRQAFQP FMTTRKEGMG LGLAICQRLV RYGRGDISIR NQTAPDGLSG TVVTIHFLHE NGGRDGDNSS TG
|
| |