Gene RPB_0189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0189 
Symbol 
ID3907794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp208007 
End bp209626 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content66% 
IMG OID637882070 
Producthypothetical protein 
Protein accessionYP_483811 
Protein GI86747315 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATG TCGAACGCGA TCCCCCGATG ACGGCCGAAA CACGGGCGTC CCCCTGGATC 
GCGTTCCGCC ACACCGCCTT TACGGTGGTC TGGACCGCAA CCGTCGTCGC CAACGTCGGC
ACCTGGATGT ACAACGCGGC ATCCGGCTGG CTGATGACGA GCCTGGAAGC CGATCCACTG
ACCGTTTCGC TCGTTCAAGT CGCGTCCAGC CTTCCGATGT TCCTGTTCGC GATCCCGGCC
GGCGCGCTGG CGGACATCGT CGACAAGCGG CGCTTCCTGA TCCTGATCGA GATCGTGCTC
ACCGTGTTTG CGGCCGCGAG CGCGGTGCTG GTCTGGCTCG GGCTGATGAA CCCGTTCCAG
CTGTTGCTGT TCACGTTTCT GCTCGGCGCG GGTGCGGCGT TCGCGGCGCC GGCCTGGCAA
TCGATCGTGC CGGATCTCGT GCCCAAGGAG CACCTCGCAT CGGCGGTGGC GAGCAATGGC
GTCGGCATCA ATGTCAGCCG GGCGATCGGC CCGGCGCTGG GCGGCGTGGT GATCGGCGTC
GCGGGCATCG CGGCGCCGTT CTGGATCAAC GCGCTGAGCA ATTTCGCGGT GATCGGCGCG
CTGCTGTGGT GGCGCCCCGC CGCCAAGCGC GCGGCGACGC TGCCTCCGGA ACGGTTGTTC
AGCGCGATCG TGATCGGCTT TCGCCACGCG CGATACAATC TCGATCTGCG CGCCACACTG
GTGCGCGCGG TCGCGTTCTT CTTTTTCGCC AGCGCGTATT GGGCGCTGCT GCCGCTTGTC
GCCCGCTCGC GCATCGCCGG GGGACCGGAA CTGTACGGCA TTTTGCTCGG CGCAATCGGC
CTCGGCGCGA TTGTCGGCGC GTTCCTGTTG CAGGGGTTGA AATCGGCGCT GGGGCCGGAT
CGCTTGGTCG CGGCCGGCAC GCTCGGGACA GCGGTCAGCC TCGTTCTGCT CGGCAGCGTC
CAACGCGTCG AACTCGCGAC CGCGGCCTGT TTCATCGCGG GCGTTTCGTG GATCGCGGTT
CTCGCCAACC TCAACGTTTC CGTTCAGGTC GCGCTACCGG ACTGGGTGCG CGGCCGCGGC
CTGGCGATGT TCGTCACGGT GTTCTTCGGC GCGATGACGG CCGGCAGCGC GCTCTGGGGT
CAGTTGGCGT CGTCGTTCGG ATTGCCGGCA GCGCATTTCG CCGCGGCCGT CGGCGCCGTC
GTCGGCATCG CCGTGACGTG GCGCTGGAAA CTGCGCGGCA GCGCCGAGCA CGATCTCGCA
CCCTCGATGC ATTGGCCCGC GCCGGTGCTG GCCATCGATG CCGATGCCGA TCAAGGCCCG
GTGCTGATTA CGGTCGAGTA CCATGTCGCA GCGGACAGGC GCGACGCCTT CCTGCTGGCT
ATGCGAAAAT TGAGCCGACA GCGACGGCGT GACGGCGCAT ATGCGTGGGA CGTGTTCGAA
GATACCTCGG AGCGCGGACG ATTCGTCGAG GTGTTCAAAG TTGCGTCATG GCTCGAGCAT
CTTCGGCAAC ATGATCGCGT CACCAATGCG GATCGGATCG ATCAGAATGC GATCCGCCAT
TTCCACGCGA GCGCAGAGCC TCGCGTGACG CACTTGCTCG CTGCGAAGTT CCCGGCATGA
 
Protein sequence
MTNVERDPPM TAETRASPWI AFRHTAFTVV WTATVVANVG TWMYNAASGW LMTSLEADPL 
TVSLVQVASS LPMFLFAIPA GALADIVDKR RFLILIEIVL TVFAAASAVL VWLGLMNPFQ
LLLFTFLLGA GAAFAAPAWQ SIVPDLVPKE HLASAVASNG VGINVSRAIG PALGGVVIGV
AGIAAPFWIN ALSNFAVIGA LLWWRPAAKR AATLPPERLF SAIVIGFRHA RYNLDLRATL
VRAVAFFFFA SAYWALLPLV ARSRIAGGPE LYGILLGAIG LGAIVGAFLL QGLKSALGPD
RLVAAGTLGT AVSLVLLGSV QRVELATAAC FIAGVSWIAV LANLNVSVQV ALPDWVRGRG
LAMFVTVFFG AMTAGSALWG QLASSFGLPA AHFAAAVGAV VGIAVTWRWK LRGSAEHDLA
PSMHWPAPVL AIDADADQGP VLITVEYHVA ADRRDAFLLA MRKLSRQRRR DGAYAWDVFE
DTSERGRFVE VFKVASWLEH LRQHDRVTNA DRIDQNAIRH FHASAEPRVT HLLAAKFPA