Gene Rpal_5172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5172 
Symbol 
ID6412872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5575300 
End bp5577270 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content65% 
IMG OID642715062 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001994135 
Protein GI192293530 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCTTC GTCTCGGCCT CACGTCCAAG ATCAATTCCA TCGCTCTGGT CGGCATCATC 
GGCGTTCTCG CCTTTGGCGC GCTGTACTTG ATCGGGACAT CTTCACAGGA TGCAGCGAGG
CTGATCGACG ACCGTGCCCG CGCACTCGGC GATAGCAACG CGAAATTGCA GATCGCAATG
CTCGAGCAAC GCCGCGCCGA GAAGAACTTC ATGCTGCGTA AGGACGAGCA GTATCTCGGC
ATGTTTCAGC AGAGCGGCCG GGTCGCAACC GAGATGCTCG CCGATATGAT CCGGCAGACC
GAGGCGACCG GTCAGACTGA CCTCACGCGC AGTTTGAAGT CGGTGCAAGA TGGCTTCGAG
AACTACCAAA GCCAATTCGG CAGGTTCGCC GAGGCCACCG TGAAGCTCGG GCTCAAGGAG
GACCTTGGAC TCGAAGGCAG CCTGCGAGCT TCGGTACACG GTGTCGAAAA ATCGATCAGC
AGCTTCGACG CGCCGGCCCT GATGGTCCAG ATGCTGATGA TGCGCCGGCA CGAGAAGGAC
TTCATGCTCC GCCGTCATCC GAAATACGGC GAGGCGATGA AGAAGCAATC CGCCGAATTC
GCCAAGCTGC TCGCCGCATC GGATCTGCCG CAGACGGCCA AGACCGAGAT CACGCAAAAG
CTCGACGCCT ATCAGCGCGA TTTCTCAGCC TGGATGGAGA ATGCGCTGGC GCTTGATCGT
GCCGAGAAAG ACATGGTGAC GACGTATCGG GCGCTGCAGC CCGCGCTCGA CGAGCTCTCC
AGCACGGTGC GGCAGCAGGC GGACCTCGCA AAGACGATGG CCGCTACCGC GCGGCAGGCC
ACCGAGCAGC GCATGCAGAT CGCGATCATC GCCATCATCC TGACCGTGAT GGTGCTCGGC
ATCTCGATCG CGCGGTCGAT CACCAGGCCG CTCAGCGGGC TGAACGCCGG CATCCGCCGC
CTCGGCGACG GCGAACTCGA CCTGGTGCTC CCGGGTCTGC AACGAACCGA CGAGATCGGC
GACATGGCGC GCGCGGTGGA GTCCTGCAAG CTGAAGGCCG AGGAGCGCGC CGCAGCAGAA
GCCGCCGCCA AGGCGGATCA GGACCGGCTG GCCGCGCAGC AGCGCAAGGG CGAGATGATC
GCGCTCGCCG CCAAATTCGA AGACGCGGTC GGCGAGATCG TCGAGACCGT GTCATCGGCC
TCGACCGAGC TGGAAGCGTC GGCAACCACC CTGACCTCGA CCGCCGATCA CGCCCAGCAG
TTCACCACCC TGGTCGCGGC CGCCTCCGAG GAGGCGTCCA CCAATGTGCA GTCGGTGGCA
TCGGCCAGCG AAGAGATGGC ATCCTCGGTC AACGAGATCA GCCGCCAGGT CCAGGAGTCG
GCGCGGATCG CCAGCGAAGC GGTCACGCAA GCACAGGTCA CCAACGAGCG CGTCAGCCAC
CTGTCCGAAG CCGCATCGCG GATCGGCGAC GTGGTCGAAC TGATCAACAC CATCGCGGCG
CAAACCAACC TGCTGGCGCT GAACGCCACG ATCGAAGCAG CGCGCGCTGG CGAAGCCGGC
CGCGGCTTCG CCGTGGTGGC GGCGGAGGTG AAGCAGCTCG CCGAACAGAC CGCCAAAGCC
ACCGACCAGA TCAGCCAGCA GGTCGGCGGC ATCCAGAGCG CCACCGACCA GTCGGTGAGC
GCGATCCGGC AGATCGGCGA AACCATCGCG CGGATGTCGG AGATCGCCGC GACCATCGCG
TCCGCGGTGG AAGAACAGGG CGCCGCGACC CAGGAGATCT CACGCAACGT CCACCATGCC
GCCGAAGGTG CGCACCAGGT CTCGGTGAAC ATTGTCGAGG TCCAGCGCGG CGCCTCGGCG
ACCGGCTCGG CATCGGCGCA GGTGCTGTCC GCGGCGCAGT CGCTGGCGCA CGACAGCACC
CGGCTGAAGG ACGAAGTCGG CCGCTTCCTC CGAACAGTGC GGGCAGCGTA G
 
Protein sequence
MSLRLGLTSK INSIALVGII GVLAFGALYL IGTSSQDAAR LIDDRARALG DSNAKLQIAM 
LEQRRAEKNF MLRKDEQYLG MFQQSGRVAT EMLADMIRQT EATGQTDLTR SLKSVQDGFE
NYQSQFGRFA EATVKLGLKE DLGLEGSLRA SVHGVEKSIS SFDAPALMVQ MLMMRRHEKD
FMLRRHPKYG EAMKKQSAEF AKLLAASDLP QTAKTEITQK LDAYQRDFSA WMENALALDR
AEKDMVTTYR ALQPALDELS STVRQQADLA KTMAATARQA TEQRMQIAII AIILTVMVLG
ISIARSITRP LSGLNAGIRR LGDGELDLVL PGLQRTDEIG DMARAVESCK LKAEERAAAE
AAAKADQDRL AAQQRKGEMI ALAAKFEDAV GEIVETVSSA STELEASATT LTSTADHAQQ
FTTLVAAASE EASTNVQSVA SASEEMASSV NEISRQVQES ARIASEAVTQ AQVTNERVSH
LSEAASRIGD VVELINTIAA QTNLLALNAT IEAARAGEAG RGFAVVAAEV KQLAEQTAKA
TDQISQQVGG IQSATDQSVS AIRQIGETIA RMSEIAATIA SAVEEQGAAT QEISRNVHHA
AEGAHQVSVN IVEVQRGASA TGSASAQVLS AAQSLAHDST RLKDEVGRFL RTVRAA