Gene Rpal_4788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4788 
Symbol 
ID6412474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5153135 
End bp5154820 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content67% 
IMG OID642714666 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001993753 
Protein GI192293148 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein
[COG4564] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAATTCA GCAATCTTCG TATTACGCCG AAGCTAGGCC TGCTGGTTGC AGGTACCATG 
ATCGGTCTGT GCGTGGCCGG TATCCTTGCC GGTGTGATGA TGCAGCGTGA GATGCTCAAC
GCTCGGTTCG AGCAGGCCAA AGCCATCACC GAGATGGGGC GCAACCTCGC GATCGGCCTG
CAGAAGAAGG TCGAGGTCGG CGAGCTGACC AAGGAGCAGG CCGTCGCCCA GTTCGCCCGC
GATGCCGCAA TGCTGACCTA CGACAACGGT CAGGGCTATC TGTTCGCCTA CACCATGGAC
GGCGTCACCA TCGCCACCCC GGACAAGAAG GCGATTGGTA CCAACCGGAT GAACACGCCG
ACCGGCGACC GCTATCTGGT GCGCGAGCTG CGCGACGGCG TCGCTACGAA GGGCGACGTC
ACGCTGTATT ATGATTTCCG CCGGCCCGGC ACCGAAGAGA ACATCCGCAA GATGTCCTAC
GGCCTGGCAA TTCCCGGCTG GGACATGTTT GTCGGCACCG GCGCCTATCT CGACGACGTC
GACGCCAAGC TGAAGCCGAT CTTCTGGACC CTCGGCGGCG CGATCTTCGC GATCGCCATC
GTCGCCGGCC TGTTCGCGGT GCTGATCGCC CGCGGCATCA CCCGCCCGCT CGCCAAGCTC
GGCGCCCGGA TGGATTCGCT GGCGCATGGC GAACTGGAGC AGCCGATCCC CGGCATCGAA
CGCGGCGATG AAGTCGGCGA GATGGCCAAG ACCGTGCAGG TGTTCAAGGA CAACGCACTG
CGGATTCGTG ACCTCGAGCG CGCCGAAGAG GCTGCCAAGG AGCACGCCGA AGCCGAGCGT
CGCGCCGCCA TGGAGCAGCT CGCCGACGAG TTCGAGCACA GCGTCAACGG CGTGGTGCAG
TCGGTCGCCA CCGCGACCTC CGGCATGCAG CAGACCGCGC AGTCGATGAC CGCGACCGCG
ACCGATGCCA GCGCCCGCGC TGCGACCGTG TCGTCGGCCT CGGCCAGCGC CTCCAACAAC
GTCAGCACGG TGGCATCGGC CGCCGAGGAG TTGTCGGCCT CGGTCACCGA GATCTCGCGC
CAGGTCGAGC AGTCGCGGGA GATCGCCGGC AAGGCGGTCG ACGACGCCGC GCTCACCAAC
CAGACCGTCA AGTTGTTGGC GACCGGCGCC GAGAAGATCG GCGAGGTGGT GCAGCTGATC
CACTCGATCG CGTCGCAGAC CAACCTGCTC GCGCTCAACG CCACCATCGA GGCCGCCCGC
GCCGGCGAAT CCGGCCGCGG CTTTGCGGTG GTCGCCAGCG AGGTGAAGGC GCTGGCCAGC
CAGACCGCGA AGGCGACCGA GGAAATCTCC GCGCAGGTCG CGGCGATGCA GTCCTCGACC
AACGACGCCG TACAGTCGAT CGGCGGCATC ACCGCGACGA TCGCGCAGAT GAGCGAGATC
ACCATGGCGA TCTCGACCGC GGTCGAGCAG CAGGGCGCGG CGACCCGCGA GATCGCCCGC
AACATCCAGT CGGTGGCGGC GGGATCGACC GAGATCAGCA TGCATATCGG CGGCGTCACC
GAAGCGGCGA GCGCCACCGG TTCGGCCGCC AGCCAGGTGC TCGCCAATGC CCGTGAACTC
GACGCCCAGT CCGGCGAACT CCGCCGCGCG GTCGACGACT TCCTCGGCAA GGTCCGCGCC
GCCTAA
 
Protein sequence
MKFSNLRITP KLGLLVAGTM IGLCVAGILA GVMMQREMLN ARFEQAKAIT EMGRNLAIGL 
QKKVEVGELT KEQAVAQFAR DAAMLTYDNG QGYLFAYTMD GVTIATPDKK AIGTNRMNTP
TGDRYLVREL RDGVATKGDV TLYYDFRRPG TEENIRKMSY GLAIPGWDMF VGTGAYLDDV
DAKLKPIFWT LGGAIFAIAI VAGLFAVLIA RGITRPLAKL GARMDSLAHG ELEQPIPGIE
RGDEVGEMAK TVQVFKDNAL RIRDLERAEE AAKEHAEAER RAAMEQLADE FEHSVNGVVQ
SVATATSGMQ QTAQSMTATA TDASARAATV SSASASASNN VSTVASAAEE LSASVTEISR
QVEQSREIAG KAVDDAALTN QTVKLLATGA EKIGEVVQLI HSIASQTNLL ALNATIEAAR
AGESGRGFAV VASEVKALAS QTAKATEEIS AQVAAMQSST NDAVQSIGGI TATIAQMSEI
TMAISTAVEQ QGAATREIAR NIQSVAAGST EISMHIGGVT EAASATGSAA SQVLANAREL
DAQSGELRRA VDDFLGKVRA A