Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1872 |
Symbol | |
ID | 6409531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2012593 |
End bp | 2015406 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642711760 |
Product | CheA signal transduction histidine kinase |
Protein accession | YP_001990873 |
Protein GI | 192290268 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.134352 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGATC TTCTTCGTGA GTTTTTGACG GAGACCTTCG AGAGCCTGGA CACGGTTGAC AACCAGTTGG TCCGGTTTGA GCAGGAGCCG AACAACGCGA AGATATTGGA CAATATTTTT CGTCTTGTTC ACACCATCAA GGGAACGTGC GGGTTTCTAG GGTTGCCGCG GCTTGAAGCG CTTGCGCACG CGGCCGAGAC CCTGATGGGC AAATTCCGGG ACGGAATGCC GGTGACGGGG GAGGCGGTGA CGCTGATCCT GACCACGATC GACCGGATCA AGGACATTCT GACCCAGCTG GAGGCGACCC AGGCCGAGCC CGAGGGCGAG GACGGCGACC TGATCGGGGA GCTGGAGCGG CTGTCGATGC GCTCGCCGGA AGAGATCGCG GCCGAGCTCG GCGGCGCTGC GCCGGTGGAG GTTGCCGAAG TCGAAGCCCC TGCCGAAGCT GTTGTGGCCG AGACGGCCGA CGCCAATTCG ACCGAAGGCA CCCTGGTGGC GCAGACGCTG GAGCGTCCGC TGCGGCCGGG TGAAGTGTCG CTGGACGAGC TTGAGCGCGC CTTCCGCGAG ACCGAGATCG AGATGGCCTC GCCGCCGCTG CAGCCCGCCG TGAGCGAAGC TCCGGCTGCT GTGGCTGAAG CCGCGCCGCC TGAACCGAAG CCGGCCAAGC CCGCCAAACC GGCCGCCAAG CCGGCGGCGA AGAAGTCCGG CGGCGAAGGC GAGGGCGCAG CTGAAGGTGG GGCCGCTGGC GGCGTCGCCA ACCAGTCGAT CCGCGTCAAC GTCGATACCC TCGAACACCT GATGACGATG GTGTCGGAGC TGGTGCTGAC CCGTAACCAG CTGCTCGAGA TCAGCCGCCG CCACGAGGAC AACGAGTTCA AGGTGCCGCT GCAGCGGCTC TCCACCGTCA CCGCCGAGCT GCAGGACGGG GTGATGAAGA CCCGGATGCA GCCGATCGGC AACGCCTGGC AGAAGCTGCC GCGGATCGTG CGCGATCTGG CCGCCGAACT CGGCAAGCAG ATCGAGCTGG AGATGCACGG TGCCGACACC GAGCTCGACC GCCAGGTGCT CGACCTGATC AAGGATCCGC TCACCCATAT GGTGCGCAAC TCCGCCGACC ACGGGCTGGA GAAGCCCGAG GACCGGGCGC GCGCCGGCAA GCCCGAGCAG GGCACCATCC GCCTGTCCGC CTATCACGAG GGCGGCCACA TCGTGATCTG CATCGCCGAC AACGGCCGCG GGCTGGACAC CGAACGGATC AAGGCCAAGG CCTTGGCCAA CGGGCTGGTC ACCGAGGCCG AACTCGAGAA GATGACCGAG GCGCAGATCC ACAAGTTCAT CTTCGCGCCG GGCTTCTCGA CCGCCGCCGC CGTCACCTCG GTGTCCGGCC GCGGCGTCGG CATGGACGTG GTGCGCACCA ATATCGACCA GATCGGCGGC ACGATTGAAG TGAAGTCGGT CGCGGGCGAA GGCTCGGCCA TCACCATCAA GATCCCGCTC ACCCTGGCGA TCGTCTCGGC GCTGATTGTC GAAGCCGGCG GCGACCGGTT CGCGATCCCG CAGCTCGCGG TGGTCGAGCT GGTGCGGGCA CGGGCCAACT CCGAGCACCG CATCGAGCGG ATCAAGGATA CGCCGGTCCT CAGACTGCGC GACAAGCTGC TGCCGCTGAT CCACCTGAAG AAGCTGCTCG GCATCGACGA GGGCGCCAAC AGCGAGCCGG AGAACGGCTT CATCGTGGTG ACCCAGGTCG GCAGCCAGAC CTTCGGCATC GTGGTCGACG GCGTGTTCCA CACCGAAGAA ATCGTCGTCA AGCCGATGTC GACCAAGCTG CGTCACATCG GAATGTTCTC GGGCAACACC ATCCTGGGCG ACGGCGCGGT GATCATGATC GTCGATCCGA ACGGGATCGC GCAGGCGCTC GGCACCGCGG TGTCGGCGCA GCACGATATC TCCGACCAGG CGGCGGCGAG CCGCAACGCC TCGGCCGAAC AGCTCACCTC GCTGCTGGTG TTCCGCGCCG GCTCGAGCCA GCCGAAGGCG GTGCCGCTGT CGCTGGTGAC GCGCCTGGAA GAGATCGCCT CCGACAAGAT CGAGATGTCG AACGGCCGCT ACATGGTGCA GTACCGCGAC CAGCTGATGC CGCTCGTGCT GATGGAAGGC GTCGAGGTCG CCACCAGCGG CGTGCAGCCG ATCCTGGTGT TCGCCGACGA GGACCGGTCG ATGGGCCTTG TGGTCGACGA GATCGTCGAC ATCGTCGAGG AGCATCTGCA CATCCAGGTC GGCTCCAGCC GCGAGGGCAT TCTCGGCTCT GCGGTGATCA AGGGCCAGGC CACCGAGGTG ATCGACGTCG CGCACTTCCT GCCGATGGCG TTCTCCGACT GGCTGGCGCG CAAGGAGATG AAGCAGTCGC TGACCACCCG CTCGGTGCTG CTGGTCGATG ACTCGGCGTT CTTCCGCAAC ATGCTGGGTC CGGTGCTGAA GGCGGCGGGC TACAAGGTGC GGGTCGCGAC CTCGGCGGTC GAGGGCCTGT CGGTGCTGCG CTCGGGTGCG CAGTTCGACG TGATCCTGAC CGACATCGAG ATGCCGGAGA TGAACGGCTT CGAGTTCGCC GAGGCGATCC GCTCCGACAC GAAGATGTCG AACCTGCCGG TGATCGCGCT GAGTTCGCTG GTGTCGCCGG CGGCGATCGA GCGCGGCCGC CAGGCCGGTC TGACCGACTA CATCGCCAAG TTCGATCGGC CCGGCCTGAT CGCTGCGCTG AAGGAGCAGA CCACGATGCA TGCGACGCCC GAAGTGCTGG AGCAGGCGGC ATGA
|
Protein sequence | MDDLLREFLT ETFESLDTVD NQLVRFEQEP NNAKILDNIF RLVHTIKGTC GFLGLPRLEA LAHAAETLMG KFRDGMPVTG EAVTLILTTI DRIKDILTQL EATQAEPEGE DGDLIGELER LSMRSPEEIA AELGGAAPVE VAEVEAPAEA VVAETADANS TEGTLVAQTL ERPLRPGEVS LDELERAFRE TEIEMASPPL QPAVSEAPAA VAEAAPPEPK PAKPAKPAAK PAAKKSGGEG EGAAEGGAAG GVANQSIRVN VDTLEHLMTM VSELVLTRNQ LLEISRRHED NEFKVPLQRL STVTAELQDG VMKTRMQPIG NAWQKLPRIV RDLAAELGKQ IELEMHGADT ELDRQVLDLI KDPLTHMVRN SADHGLEKPE DRARAGKPEQ GTIRLSAYHE GGHIVICIAD NGRGLDTERI KAKALANGLV TEAELEKMTE AQIHKFIFAP GFSTAAAVTS VSGRGVGMDV VRTNIDQIGG TIEVKSVAGE GSAITIKIPL TLAIVSALIV EAGGDRFAIP QLAVVELVRA RANSEHRIER IKDTPVLRLR DKLLPLIHLK KLLGIDEGAN SEPENGFIVV TQVGSQTFGI VVDGVFHTEE IVVKPMSTKL RHIGMFSGNT ILGDGAVIMI VDPNGIAQAL GTAVSAQHDI SDQAAASRNA SAEQLTSLLV FRAGSSQPKA VPLSLVTRLE EIASDKIEMS NGRYMVQYRD QLMPLVLMEG VEVATSGVQP ILVFADEDRS MGLVVDEIVD IVEEHLHIQV GSSREGILGS AVIKGQATEV IDVAHFLPMA FSDWLARKEM KQSLTTRSVL LVDDSAFFRN MLGPVLKAAG YKVRVATSAV EGLSVLRSGA QFDVILTDIE MPEMNGFEFA EAIRSDTKMS NLPVIALSSL VSPAAIERGR QAGLTDYIAK FDRPGLIAAL KEQTTMHATP EVLEQAA
|
| |