Gene RPD_3547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3547 
Symbol 
ID4024061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3942075 
End bp3943775 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content60% 
IMG OID637963751 
Productchemotaxis sensory transducer 
Protein accessionYP_570671 
Protein GI91978012 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTGGC TCAACAACGT AACAATCTCT CTAAAAATTG CGATCATCGT CGGCTTGTTA 
GCGATCGTTT GCCTGGGATC GATAAGTTTC GCGACCCTCC AGATGAAACG CATTGATGAT
TCGTACTCCG ATCTTGTCGG TCGCATCGAT AAGGGCACGG TCAGCCAGGT CCGCGCCGCC
CGGCAAGCAG AGGCCTTCGT ATCGGCGGCC TTTCAAATGG CGACTGAAAC AACCGATGAC
GGCGTGTCGC GACTCGCGGC GAGAGTCGCC GAAACCCAAC GAGATCTCGA GGCGCGCATG
GCGGACATCC TCGTCAAGCT GCCGGAAAAA TCAGCCGCGT ATGAGCCGAT TCTCGCTCAA
TACAAGCAAG CGTTTCTGGC TTGCGGGCCC GCGGTCGCAT TCGCGATGTC GACGCGCTCG
CCCGAAGACA ATATGAAGGC CGCCGAACGC CTGAAACTCG AGTGTGTACC GAGAATCGAA
GCCGCGACCC GTTCGCACGT AAAGATCAAT GACGGCATCA TTGCGTATGC CGAGAAGGCC
TCAGGCGACC TGACGGATGC GACCGATTCG ACCATTCAGA ATTGCCTGCT GTTCTCGATC
GCGGGCCTTT TGTTCGGCCT CATCGTCTCA CTGTGGATTG GCGTCAGGGG GCTATCGAAG
CCGATCGGCA AGCTGAAGGA AGTAATGGAA CTGTTCGCCA AGAACGATCT CAATGCCGAT
ATTCCGGGCA TCAAACGCGG CGACGAATTG GGCGAGATGG CCCGCGCCGT TGGAGTATTC
AAGGCAAGCG CCCTTGAAGT CGAGCGGCTG CGCTGCGGAC AGCAAGAAAG CGAACGCCGC
ATTGCGGAAG AGCGCAAGAC GGAGATGCGA AAGCTCGCCG ATGAATTCGA AGCAGCGGTC
GGTCAGATCG TCGAAACCGT GTCTTCCGCA TCGACCGAGT TGGAGGCTTC TGCGAGCACG
CTGACCAAGT CAGCGGAACG CACACAGGAG GTCACAACCA CAGTTGCAGC GGCGTCCGAA
GAAGCCTCGG CCAACGTTCA ATCAGTGGCC TCCGCGACCG AGGAAATGGC GTCGTCCATC
AATGAGATCA GCCGCCAGGT TCAGGAGTCG GCCCGGATTG CGAATGAAGC CGTTGACCAG
GCCCGCAAGA CGAACGATCG GGTCGGCGAT CTGGCCAAGG CGGCCAGCCG GATCGGGGAC
GTGGTCGATC TGATCAACAG CATTGCCGGT CAAACGAATC TGCTGGCGCT GAACGCAACG
ATCGAAGCCG CACGCGCCGG CGAAGCAGGT CGAGGATTCG CGGTGGTTGC ATCCGAGGTG
AAGGCGCTTG CCGAGCAGAC CGCGAAAGCC ACCGGGGAGA TCGGCCAGCA GATCACCGGA
ATGCAGGTGG CGACCGACGA CTCCGTCAGT GCGATCAAGG AGATTAGCGG GACGATCGCA
CGCATGTCGG AGATCTCGTC AACGATTGCT TCTGCGGTCG AGGAGCAGGG CGCGGCGACA
CAGGAGATTT CGCGCAACGT CCAACAGGCC GCACGGGGCA CGTCCGAGGT GTCGTCGAAC
ATCAGCGACG TGCGGAGTGG CGCCAGCGAG ACAGGCTCGG CCTCTTCGCA AGTTCTTTCG
GCCGCGCAAT CCTTGTCGCG CGACAGCAAC CAATTGAAAC TACAGGTCGT CAATTTCCTG
AGCAAGGTAC GCGCCGCCTA G
 
Protein sequence
MNWLNNVTIS LKIAIIVGLL AIVCLGSISF ATLQMKRIDD SYSDLVGRID KGTVSQVRAA 
RQAEAFVSAA FQMATETTDD GVSRLAARVA ETQRDLEARM ADILVKLPEK SAAYEPILAQ
YKQAFLACGP AVAFAMSTRS PEDNMKAAER LKLECVPRIE AATRSHVKIN DGIIAYAEKA
SGDLTDATDS TIQNCLLFSI AGLLFGLIVS LWIGVRGLSK PIGKLKEVME LFAKNDLNAD
IPGIKRGDEL GEMARAVGVF KASALEVERL RCGQQESERR IAEERKTEMR KLADEFEAAV
GQIVETVSSA STELEASAST LTKSAERTQE VTTTVAAASE EASANVQSVA SATEEMASSI
NEISRQVQES ARIANEAVDQ ARKTNDRVGD LAKAASRIGD VVDLINSIAG QTNLLALNAT
IEAARAGEAG RGFAVVASEV KALAEQTAKA TGEIGQQITG MQVATDDSVS AIKEISGTIA
RMSEISSTIA SAVEEQGAAT QEISRNVQQA ARGTSEVSSN ISDVRSGASE TGSASSQVLS
AAQSLSRDSN QLKLQVVNFL SKVRAA