Gene RPB_3133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3133 
Symbol 
ID3910934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3579560 
End bp3581149 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content63% 
IMG OID637885035 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_486740 
Protein GI86750244 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.362143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAC GCGAATTCAC CAAGCTCGGC CTGCTGGCCG GCGCGGCCGG CATCGGCGGC 
ATTCCGCTCG GCATCACCCG GGCGGTGGGG CAAACCCGCG GCGGCACGCT CAACACCATC
ATTCAGCCGG AGCCGCCGAT CCTGGTCACC GCGCTCAACC AGCAGCAGCC GACGCTGACG
CTCGGCGGCA AGATCTACGA AAGCCTGCTG CGCTACGATT TCGATCTCAA GCCGCTGCCC
GGCCTGGCGC AGTCCTGGGA AGTGTCGCCC GACAAGCTGA CCTATACATT CAAGCTGCAT
CCCAACATCA CCTTCCACGA CGGCGCGCCG CTGACGTCCG AAGACGTGGT GTTCTCGATC
ATGAAGGTGC TGATCGAGAA CCACGCCCGC GCGCGCAACA CGTTCTCGCG CGTCGAGAAG
GCCGAGGCGC CGGATCCGCT GACCGTGGTG TTCAAACTGA AGGCGCCGTT CGCGCCGTTC
CTCACCGCGT TCGACTGCAC CACGGCCCCG ATCGTGCCGA AGCATATTTA CGAAGGCACG
GACTATCGCA AGAACCCGGC CAATGCGAAG GCGATCGGCT CGGGTCCGTT CAAGCTGAAG
GAATGGGTGC GCGGTTCGCA CGTCCATCTG GTCAAGCACG AAGGCTATTA TCGCCCGGGC
GAGCCGGTTC TCGACGAGAT TATCTATCGG GTCATCCCGG ACTCGGCGTC GCGCTCGGTG
GCGCTGGAGC AGGGGACCGT GCAGCTCACG CAATGGACCG ACGTCGAACT GTTCGAGGTG
CCGCGGCTGT CCAAGCTGCC GCATCTGACG ATGACCACCA AGGGCTACGA ATTCTTCGCC
CCCCATACGT GGCTCGAGTT CAACACCCGG ATCGCGCCGA TGAACGACAA GCGGTTCCGG
CAGGCGGTGA TGTATGCGAT CGACCGCAAG GCGTTGCTGG CCCGCATCTA TTTCGGTCTC
GGCAAGGTCG CGACCGGCCC GGTGTCGTCG AAGACCAAAT TCTACGAGAA GAACGTCAAG
GCCTACGACT TCTCGCCCGA GAAGGCGAAG GCTCTGCTCG ACGAGATGGG GCTGAAGCCG
GGTGCCGACG GCAAGCGCGT GACGATTCCC TATCTCGTGC CGCCCTACGG TGAAACGCAT
CAACGGACCT CCGAATTCAT TCGCCAGTCG CTCGCCCGCG TCGGCATCGA CCTGCAACTC
CAGGGGATCG ATGTCGCCGG ATGGGCCGAG AAGTTCAGCA ACTGGGACTT CTCGATGACG
GCGACCACGG TCTATCAGTT CGGCGATCCG GCGCTCGGCG TGTCACGGAG TTATGTCTCC
TCCAACATCC GCAAGGGCAT TCTGTTCTCC AACACCTGCG GCTACTCCAA TCCGGAGGTC
GACCGGCTGT TCGAGGAGGC CGCGGTCGCG ACCTCGGACG ACAAGCGCCA GGAGTTCTAC
AGCGAGGTTC AGAAGATCAT GGTCGAGGAC GTGCCGGTCG CCTGGCTGCT CGAGATCGAC
TATCCGAACT TCATGGACAA ACGTCTGAAG AACGTGGTGA CGACGGCGAT CGGCGTGCAC
GACACGTTCG GGACGGTTTC GTTCGGATGA
 
Protein sequence
MNRREFTKLG LLAGAAGIGG IPLGITRAVG QTRGGTLNTI IQPEPPILVT ALNQQQPTLT 
LGGKIYESLL RYDFDLKPLP GLAQSWEVSP DKLTYTFKLH PNITFHDGAP LTSEDVVFSI
MKVLIENHAR ARNTFSRVEK AEAPDPLTVV FKLKAPFAPF LTAFDCTTAP IVPKHIYEGT
DYRKNPANAK AIGSGPFKLK EWVRGSHVHL VKHEGYYRPG EPVLDEIIYR VIPDSASRSV
ALEQGTVQLT QWTDVELFEV PRLSKLPHLT MTTKGYEFFA PHTWLEFNTR IAPMNDKRFR
QAVMYAIDRK ALLARIYFGL GKVATGPVSS KTKFYEKNVK AYDFSPEKAK ALLDEMGLKP
GADGKRVTIP YLVPPYGETH QRTSEFIRQS LARVGIDLQL QGIDVAGWAE KFSNWDFSMT
ATTVYQFGDP ALGVSRSYVS SNIRKGILFS NTCGYSNPEV DRLFEEAAVA TSDDKRQEFY
SEVQKIMVED VPVAWLLEID YPNFMDKRLK NVVTTAIGVH DTFGTVSFG