Gene RPD_3678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3678 
Symbol 
ID4024193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4103882 
End bp4106659 
Gene Length2778 bp 
Protein Length925 aa 
Translation table11 
GC content65% 
IMG OID637963882 
ProductATP-binding region, ATPase-like 
Protein accessionYP_570801 
Protein GI91978142 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0781071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGATC TTCTGCGTGA GTTCTTGACG GAAACTTTCG AGAGTCTGGA CACGGTTGAC 
AATCAGCTTG TCCGCTTCGA GCAGGAGCCC AATAACGCAA AGATATTGGA CAATATCTTC
AGGCTTGTTC ACACCATCAA GGGCACCTGC GGATTCCTCG GCCTGCCGCG CCTGGAAGCG
CTGGCGCACG CGGCAGAAAC CCTGATGGGT AAATTCCGCG ACGGCATGCC GGTCACCGGC
GAGGCCGTCA CGCTGATCCT GACCACGCTC GATCGGATCA AGGACATTCT CGGCGGTCTC
GAGGCAAGCG AAGCAGAACC CCCGGGCGAG GACGGCGACC TGATCGGCGA ACTGGAGCGG
CTCTCGATGC GCACGCCCGA ACAGATCGCA CTCGAACTCG GCGAGTCCGC AGCGGTCGCG
GAAGCTGCTC CGGATGCTCC GGAGGCTGCT CCTGAGCCCG AACAGACCGA AGGCACCTTG
GTAGCGCAGA CGCTGGAGCG TCCGCTCCGG CCGGGCGAGG TCTCGCTCGA CGAATTGGAA
CGGGCATTCC AGGAGACCGC GATCGAGGCC GCCTCACCGG CGCTCGAGAC GGTCGTGGCG
CCCGCTGCGG TGGCCGAGGT GGCCGAGGTG GCCGCCAAGC CGGCGGCGAA GCCCGCCGTC
AAGGCCGCCA AGAAGGGTGA ACAGGACCCG GCGCCGGCGG AAGGCGACCG GATCGCCAAT
CAATCGATTC GCGTCAATGT CGATACGCTC GAGCACCTGA TGACGATGGT GTCCGAGCTG
GTGTTGACCC GCAATCAGCT GCTCGAAATC AGCCGCCGCC ATGAGGACAC CGAATTCAAG
GTGCCGTTGC AGCGGCTCTC GACCGTCACC GCCGAGTTGC AGGACGGCGT GATGAAGACC
CGGATGCAGC CGATCGGCAA CGCCTGGCAG AAGCTGCCGC GCATCGTGCG CGATCTCGCC
TCCGAACTCG GCAAGCATAT CGAGCTCGAG ATGCACGGCG CCGACACCGA GCTCGATCGG
CAGGTGCTCG ACCTGATCAA GGATCCGCTC ACCCACATGG TGCGCAACTC CGCCGATCAC
GGGCTGGAGC GGCCCGAGGA TCGGGTGCGC AACGGCAAGC CTGAGCAGGG CACCATCCGC
CTGTCCGCCT ATCATGAAGG CGGCCACATC GTGATCTGCA TCGCCGATAA CGGCCGCGGA
CTCGACACCG ATCGGATCAA GGCCAAGGCG ATCGCCAACG GCCTCGTCAC CGAGGCCGAG
GTCGAGAAGA TGACCGAGGC GCAGATCCAC AAATTCATTT TCGCGCCCGG CTTCTCGACC
GCCGCGCAGG TGACCAGCGT GTCCGGCCGC GGCGTCGGCA TGGATGTGGT GCGCACCAAC
ATCGACCAGA TCGGCGGCAC CATCGAGATC AAGTCGGTGG CCGGCGAGGG CTCGAGCGTC
ACCATCAAGA TTCCGCTGAC ACTCGCGATC GTCTCGGCGC TGATCGTCGA AGCCGGCGGC
GACCGCTTCG CGATCCCGCA GCTCGCGGTC GTCGAGCTGG TGCGGGCGCG CGCCAATTCC
GAGCACCGGA TCGAGCGGAT CAAGGATACG CCGGTGCTGC GCTTGCGCGA CAAGCTGCTG
CCGCTGATGC ATCTGAAGAA GCTGCTGCAT ATCGACGACG GCACCGCGCT TTCCGAGCCG
GAGAACGGCT TCATCGTGGT GACGCAGGTC GGTAGCCAGA CCTTCGGCAT CGTGGTCGAC
GCGGTTTTCC ACACCGAGGA AATCGTGGTC AAGCCGATGT CGACCAAGCT GCGCCACATC
GGGATGTTCT CGGGCAACAC CATTCTGGGC GACGGCGCGG TGATCATGAT CGTCGATCCG
AACGGCATCG CGCAGGCGCT CGGCACTTCG GTGTCGGCGC AGCACGATCT GTCGGAGCAG
AACGCAGCGA CCCGCGCGGC CTCGACCGAA CAGCTCACCT CGCTTCTGGT GTTCCGCGCC
GGCTCGGCGC ATCCGAAGGC GGTGCCGCTG TCGCTGGTGA CGCGCCTCGA GGAAATCGCC
GCCGAGAAGA TCGAGTACTC GAACGGCCGG CATATGGTGC AGTATCGCGA TCAACTGATG
CCGCTGGTGC TGATGGAGGG CGTTAATGTC GCCACCACCG GCGTGCAGCC GATCCTGGTG
TTCGTCGATG ACGATCGCAG CATGGGGCTC GTGGTCGATG AGATCGTCGA TATCGTCGAA
GAGCATCTGC AGATCCAGGT CGGCTCCAGC CGCGAAGGAA TTCTCGGCTC GGCGGTGATC
AAGGGGCTCG CGACGGAAGT GATCGACGTT GCTCACTTCC TGCCGATGGC TTTTGCCGAC
TGGCTCTCCC GCAAGGAAAT GAAGCAGTCG ATCGCCAGCC GCTCGGTGTT GCTGGTCGAC
GACTCCGCCT TCTTCCGCAA CATGCTCGGC CCCGTGCTGA AGGCGGCCGG CTACAGGGTT
CGGCTGGCGA CTTCGGCAAT CGAGGCGCTC GGAGTGCTGC GCACAGGTGT GCAGTTCGAC
GCGATCCTGA CCGACATCGA AATGCCGGAG ATGAACGGCT TCGAATTCGC CGAGGCGATC
CGCGCCGATG CCAAGCTCGC GCCGACGCCG GTGATCGCGC TGTCGTCGCT GGTGTCGCCG
GCGGCGATCG AGCGGGGCCG TCAGGCCGGT CTCACCGACT ACGTCGCCAA GTTCGACCGT
CCTGGCCTGA TCGCGGCGCT CAAGGAGCAG ACCGTGGACA ATGAGCCGGT CAGGGCGTTG
CAGCAACAAG CGGCGTGA
 
Protein sequence
MDDLLREFLT ETFESLDTVD NQLVRFEQEP NNAKILDNIF RLVHTIKGTC GFLGLPRLEA 
LAHAAETLMG KFRDGMPVTG EAVTLILTTL DRIKDILGGL EASEAEPPGE DGDLIGELER
LSMRTPEQIA LELGESAAVA EAAPDAPEAA PEPEQTEGTL VAQTLERPLR PGEVSLDELE
RAFQETAIEA ASPALETVVA PAAVAEVAEV AAKPAAKPAV KAAKKGEQDP APAEGDRIAN
QSIRVNVDTL EHLMTMVSEL VLTRNQLLEI SRRHEDTEFK VPLQRLSTVT AELQDGVMKT
RMQPIGNAWQ KLPRIVRDLA SELGKHIELE MHGADTELDR QVLDLIKDPL THMVRNSADH
GLERPEDRVR NGKPEQGTIR LSAYHEGGHI VICIADNGRG LDTDRIKAKA IANGLVTEAE
VEKMTEAQIH KFIFAPGFST AAQVTSVSGR GVGMDVVRTN IDQIGGTIEI KSVAGEGSSV
TIKIPLTLAI VSALIVEAGG DRFAIPQLAV VELVRARANS EHRIERIKDT PVLRLRDKLL
PLMHLKKLLH IDDGTALSEP ENGFIVVTQV GSQTFGIVVD AVFHTEEIVV KPMSTKLRHI
GMFSGNTILG DGAVIMIVDP NGIAQALGTS VSAQHDLSEQ NAATRAASTE QLTSLLVFRA
GSAHPKAVPL SLVTRLEEIA AEKIEYSNGR HMVQYRDQLM PLVLMEGVNV ATTGVQPILV
FVDDDRSMGL VVDEIVDIVE EHLQIQVGSS REGILGSAVI KGLATEVIDV AHFLPMAFAD
WLSRKEMKQS IASRSVLLVD DSAFFRNMLG PVLKAAGYRV RLATSAIEAL GVLRTGVQFD
AILTDIEMPE MNGFEFAEAI RADAKLAPTP VIALSSLVSP AAIERGRQAG LTDYVAKFDR
PGLIAALKEQ TVDNEPVRAL QQQAA