Gene RPB_4368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4368 
Symbol 
ID3912183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4951999 
End bp4953276 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content66% 
IMG OID637886274 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_487966 
Protein GI86751470 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGA AGCCCCCCTC CGACATCCTC GATCGCCGCC GGTTTCTCGG CGCTGCCGGC 
CTCGCCGGCG CCGGCGCGCT GCTGCCGCTC GCCGCCAAGG CCGGCGAGGC GGCGAAGCCC
GACCCGATGA TCACCGAGCT GCAGGACTGG AATCGCTTTC TCGGCGACGG CGTCGACAAG
AAGCCCTATG GCGTGCCGTC GAAATTCGAG AAGGACGTGA TCCGCCGCGA CGTGTCGTGG
CTCACGGCGT CGCCGGAATC CTCGGTCAAT TTCACGCCGC TGCACGCGCT CGACGGCATC
ATCACGCCGT CCGGCCTGTG CTTCGAGCGC CATCACGGCG GCGTCGCCGA GATCGATCCG
GCGCAGCACC GGCTGATGAT CAACGGCCTG GTCGACACCC CGATGGTGTT CACCATGGAC
GACATCCGGC GGATGCCGCG GGTCAACAAG GTGTACTTCC TGGAATGCGC GGCGAATTCC
GGCATGGAGT GGCGCGGTGC GCAGCTCAAC GGCTGCCAGT TCACCCACGG CATGATCCAC
AATGTGATGT ACACCGGCGT GACGCTGAAG ACGCTGCTGG ATCAGGCCGG GCTGAAGCCG
AACGCCAAAT GGCTGATGCT GGAGGGCGCG GACTCCGCCG GCATGAACCG CTCGCTGCCG
GTGTCGAAAG CGCTCGACGA CGTGCTGATC GCGTTCGCGA TGAACGGCGA GGCGCTGCGT
CCGGAAAACG GCTATCCGCT GCGCGCGGTG ATTCCCGGCT GGCAGGGCAA TCTCTGGGTC
AAATGGCTGC GCCGCATCGA GGCCGGCGAC ATGCCGTGGC AGGCCCGCGA GGAGACCTCG
AAATACACCG ACCTGATGCC GGACGGCCGC GCCCGCAAAT ACACCTTCGT GATGGACGCG
AAAAGCGTGA TCACCAATCC GTCGCCGCAG GCGCCGCTGA AATTCAAGGG CCGCAACGTG
CTGAGCGGCG TCGCCTGGTC GGGGCGCGGC ACCGTCAAGC GCGTCGACGT CACCATGGAC
GGCGGCCGCA ACTGGCGCGA GGCGCGGATC GACGGGCCGG TGCTCGACAA GTCGATGGTG
CGTTTCTACG TCGATTTCGA CTGGAACGGC GAAGAGTTGA TGTTGCAGTC GCGCGCCATC
GACGAGACCG GCTACGTGCA GCCGAGCAAG GCCGAGCTGC GCAAGGTCCG CGGCGTCAAT
TCGATCTACC ACAACAACGG CATCCAGACC TGGCTCGTGC ATCCGGACGG AGTGACTGAA
AATGTCGAGA TCGCTTAG
 
Protein sequence
MSEKPPSDIL DRRRFLGAAG LAGAGALLPL AAKAGEAAKP DPMITELQDW NRFLGDGVDK 
KPYGVPSKFE KDVIRRDVSW LTASPESSVN FTPLHALDGI ITPSGLCFER HHGGVAEIDP
AQHRLMINGL VDTPMVFTMD DIRRMPRVNK VYFLECAANS GMEWRGAQLN GCQFTHGMIH
NVMYTGVTLK TLLDQAGLKP NAKWLMLEGA DSAGMNRSLP VSKALDDVLI AFAMNGEALR
PENGYPLRAV IPGWQGNLWV KWLRRIEAGD MPWQAREETS KYTDLMPDGR ARKYTFVMDA
KSVITNPSPQ APLKFKGRNV LSGVAWSGRG TVKRVDVTMD GGRNWREARI DGPVLDKSMV
RFYVDFDWNG EELMLQSRAI DETGYVQPSK AELRKVRGVN SIYHNNGIQT WLVHPDGVTE
NVEIA