Gene RPD_0994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0994 
Symbol 
ID4021469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1120290 
End bp1122260 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content65% 
IMG OID637961185 
Productchemotaxis sensory transducer 
Protein accessionYP_568133 
Protein GI91975474 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.357708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTCC GCCTCCGCCT CAGCCACAAG ATCAACTCCA TCGCCGTCGT CGGCATCGCC 
GGCGTTCTCG CCCTGGGCGC GCTATTCACG TTCGGCAACG CGTCACAGGA CGCAGCGCGG
ATCGAGGACG AACGTGCAAG GGCGCTCGGT GACAGCAACG CGAAGCTTCA GATCGCGATG
CTGGAGCAGC GCCGCGCTGA AAAGAATTTC ATCATCCGCA AGGAAGAGAG CTACCTCCGC
CAGTACCAGG ACAACGCCAA GGCGGCGAAG GCAATTCTCG CCGACATGAC CCAGCGCGCC
GAAGCAGCCG GGCAGACGGA CCTGAGCGGC AAGATGAAGA CCATCCAGGC GGGCTACGAA
GACTATGACC GCCATTTTGC GCAACTCGCC GACGCCCAGA TCAAGCTCGG CCTGAAAGAA
GATCTCGGGC TGGAGGGCAA CATACGCACT TTGGCGAAAA CAATCGAGAC CGCGCTCAGC
ACACTCGACG AACAGAAGCT GATGGTGACG ATGTTGATGA TACGGCGGCA CGAGAAGGAC
TTCATGCTGC GCGGCAATCC GCAGTACCTC GACGACATGA AGAAGCGCAT CGAGGAATTC
TCCGCGCAAC TCGCCGCCGC CGATCTTCCC ACCGCTTCGA AGACCGACAT CGGCCGGCAG
CTCGCCGTCT ATCAACGCGA CTTCAAGGGA TGGATGGAAA CCGGGCAGGT CATGGTCCAG
GAGGAAAAAA ACCTCGTGTC CCGCTTCCGC GCGATCGAGC CGGTGCTCCA AAGCGTCGGA
GCGACTATCA ATCAGTCCGC GGAACAGGCG AAGGCTGCAG CCGCCGCGGC GCGCGAGACG
ACGACGTCGC GGATGCAGAT CGCGATCGCG CTGATCATCC TCAGCGTCAG CCTGCTCGGC
CTGTTGATCG GCCGTTCGGT GGCGCGCCCG CTGAAAGGCT TGACTTCCGG GCTCAGAGAA
CTCGGCGCCG GCAATTTCGA CGTGGTACTG CCCGGCCTCG ACCGTCACGA CGAGATCGGC
GACATGGCGC AGGCGGTGGA ATCCTTCAAA GTGATGGCGC AGGACAGGGC CCGTGCCGAG
GCCGAGGCCA AGGCTCAGCA GGAACACCTC GCCGCCGAAC AGCGCAAGCG CGACATGAAC
AAGCTGGCGG ATCAGTTCGA GGAGGCGGTC GGGGAGATCG TCGAGACCGT CTCGTCCGCC
TCGACTGAAC TCGAAGCATC GGCGACGACG CTGACCGACA CAGCGCAGCA CGCGCAGCAG
TTCACCACGC TCGTCGCAGC AGCGTCGGAG GAAGCCTCGA CCAATGTGGA GTCGGTGGCG
TCCGCCAGCG AGGAGATGGC ATCGTCGGTC ACCGAGATCA GCCGCCAGGT GCAGGAGTCC
GCGCGGATCG CCAGCGAAGC GGTGACGCAG GCGCAGGAGA CCAACGATCG CGTCAGCCAC
CTGTCGGAGG CTGCCTCGCG GATCGGCGAC GTCGTTGATT TGATCAACAC CATCGCCTCC
CAGACCAACC TTCTGGCGCT GAATGCGACC ATCGAGGCTG CGCGCGCCGG CGACGCCGGG
CGCGGCTTCG CTGTGGTGGC GAGCGAGGTC AAGGCGCTGG CTGAGCAGAC CGCGAAGGCG
ACCGAACAGA TCAGCCAGCA GGTCGGCGGC ATCCAGTCCG CGACCGGCCA GTCGGTGGCG
TCGATCCGCG AGATCAGCGG CACGATTGCG CGGATGTCGG AGATCGCCGC GACGATCGCC
TCCGCGGTCG AGGAGCAGGG CGCCGCGACC AAGGAAATCT CACGCAACGT TCACCACGCA
GCCGCCGGCA CCCATGAGGT TTCGGTCAAC ATCGTCGAAG TGCAGCGCGG CGCGAGCGAG
ACCGGTTCGG CGTCTGCGCA GGTGCTGACG GCGGCGCATT CGCTGGCCCA CGACAGCGCA
CGCCTGAAGG ACGAAGTCAG CCGCTTCCTG CGCACGGTGC GCGCCAGTTG A
 
Protein sequence
MPFRLRLSHK INSIAVVGIA GVLALGALFT FGNASQDAAR IEDERARALG DSNAKLQIAM 
LEQRRAEKNF IIRKEESYLR QYQDNAKAAK AILADMTQRA EAAGQTDLSG KMKTIQAGYE
DYDRHFAQLA DAQIKLGLKE DLGLEGNIRT LAKTIETALS TLDEQKLMVT MLMIRRHEKD
FMLRGNPQYL DDMKKRIEEF SAQLAAADLP TASKTDIGRQ LAVYQRDFKG WMETGQVMVQ
EEKNLVSRFR AIEPVLQSVG ATINQSAEQA KAAAAAARET TTSRMQIAIA LIILSVSLLG
LLIGRSVARP LKGLTSGLRE LGAGNFDVVL PGLDRHDEIG DMAQAVESFK VMAQDRARAE
AEAKAQQEHL AAEQRKRDMN KLADQFEEAV GEIVETVSSA STELEASATT LTDTAQHAQQ
FTTLVAAASE EASTNVESVA SASEEMASSV TEISRQVQES ARIASEAVTQ AQETNDRVSH
LSEAASRIGD VVDLINTIAS QTNLLALNAT IEAARAGDAG RGFAVVASEV KALAEQTAKA
TEQISQQVGG IQSATGQSVA SIREISGTIA RMSEIAATIA SAVEEQGAAT KEISRNVHHA
AAGTHEVSVN IVEVQRGASE TGSASAQVLT AAHSLAHDSA RLKDEVSRFL RTVRAS