Gene RPB_3522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3522 
Symbol 
ID3911324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4030345 
End bp4031760 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content68% 
IMG OID637885424 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_487128 
Protein GI86750632 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.392623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.542302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGGTC CCATGTCGAT TCTATCGGTT GCACGCAAGG CGTTGCGCGA CGGCAGCGAC 
GAAACCTCGA CCGGCGCGGC GACACCGGCC GCTGACCTGG TAAACATACG AGCCCGCGAC
GACGCGATCG CCGCCTGTCT GCGCCGGATC GCCGAAGGCG ATTACGAGGT CGTGCTGCCG
GCGGGCGACG ATCCGCTGTC GCAGGCGGTC GGCGCGCTGT TGCAGCGGCT GTCGGGCAAC
GCCTCCCGCA ATCTCGACCG CATGGTCGAC CTCAGCATCC AGGGCAGCGA GACGGCGATG
TCGTCCGCTT ATTTGCTGTC GTCGACCCGC GAGATCGACC AGCGCACCCA GGCGCTGGCG
AGCGCGAGCG AGGAGATGGT GGCGTCGATC GGCCAGATCC GCGCCACCGC CCAGGCCGCT
GCGACCGAAG CGACCGAGAT GCAGATCAGC GCTGATCGCG GCATGACGAC GGCGAACTCG
GCCTCCGCCG CGATGGGGCG GGTCAGCACC ACCGCCGAGC TGGCCTCGGC GAAGATCACC
GCGCTCAGCG AAGCCTCCGA AGCGATCGGC AGCATCGTCG GGTCGATCGA CGCCATCGCC
CGGCAGACCA ACCTGCTGGC GCTCAACGCC ACCATCGAGG CGGCGCGCGC CGGCGAGGCC
GGCCGCGGCT TCGCCGTGGT CGCCACCGAG GTCAAGAGCC TGTCGCAGCA GACCTCGAAC
GCGACCGTCG ACATCCGCAG CCGGATCGAC CGGCTGCGCG AGGACATCGC CACGATCGTC
GCCGCGATGG CGGACTGCAC CGGCGCCGCG GTCGAAAGCC GCGAGGTGGT CAACACGCTG
GGCGAAGCGA TGGCCGGCGT GTCCCGGCGC GTCACCGGCG TCACCGACGG CATGTCGGAG
ATCGCCACCA TCCTCAATCA GCAATCCGAA GCCTCGCGCG AGATCGCGAC CGGCATTTCG
GCGATCGCCG AGATGACCAA GAACAGCGTC GGCCAGGTCG GCGACATCTC CGACCAGCTC
GATCACGTGC AGTCGCTGGT CGATAGCGAA TTGTCGGAAC TGTCGCGCAT GACGTTCGAC
GGTCTGATCG AGCGCCTCGC CAAGGCCGAT CACATCACCT GGAAGAAGAA GCTCTGCGAC
ATGGCGGTGG GCCGCGCCAA GCTCAACGCC GACGAACTCA CCGACCACCA TTCCTGCCGG
CTCGGCAAAT GGTACTACGG CGACGGCTCG CTGCAGTCGC GCAACGCCCC GGCCTTCCGG
GCGCTGGAGA AACCGCACGC GCTGGTGCAC GATCACGGCA AGAAGGCCGC GCGGCTGTTC
CAGTCGGGCG ACCTCGCCGG CGCGATCGCC GAGATCGAAT GCGTCGGCGA CGCATCCAAG
GACGTGCTGC GGCTGCTCGA CGACCTGGTC AAGTAA
 
Protein sequence
MRGPMSILSV ARKALRDGSD ETSTGAATPA ADLVNIRARD DAIAACLRRI AEGDYEVVLP 
AGDDPLSQAV GALLQRLSGN ASRNLDRMVD LSIQGSETAM SSAYLLSSTR EIDQRTQALA
SASEEMVASI GQIRATAQAA ATEATEMQIS ADRGMTTANS ASAAMGRVST TAELASAKIT
ALSEASEAIG SIVGSIDAIA RQTNLLALNA TIEAARAGEA GRGFAVVATE VKSLSQQTSN
ATVDIRSRID RLREDIATIV AAMADCTGAA VESREVVNTL GEAMAGVSRR VTGVTDGMSE
IATILNQQSE ASREIATGIS AIAEMTKNSV GQVGDISDQL DHVQSLVDSE LSELSRMTFD
GLIERLAKAD HITWKKKLCD MAVGRAKLNA DELTDHHSCR LGKWYYGDGS LQSRNAPAFR
ALEKPHALVH DHGKKAARLF QSGDLAGAIA EIECVGDASK DVLRLLDDLV K