Gene RPB_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2047 
Symbol 
ID3909862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2325976 
End bp2327271 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content62% 
IMG OID637883940 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_485665 
Protein GI86749169 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAGCA AGGTCTCGCG TCGAAAACTG CTTCACATGG CGTCAGCAGG CACGGCCGCT 
GCCGCTTTTC CCGCCCCATT CGTGTCCGGC GTCACGCGTG CGGCATCGGC AGATCCGATC
CTGCTCGGGG TTCCGACGGC TCAAACCGCC CAGGCGGGCG TCGCGGATCA TCAGGACTAT
CTGAACGGGA CGACGTTGGC CCTGGAAGAA ATCAACGGCG CCGGCGGCGT GCTCGGGCGT
CAGGTCAAAG CCGTCGTGGT CGATATCGAC CCGCTGTCCC CGGAAAGCGG GCAGGTTGCA
ATCAACAAGC TGATCGACGC CAAGGTGCAC GCGATGTCCT GCGCCTTCGT GTTCACGCCG
GTCCCGGTGG CGGACGTGTC GGCGCGCTAC AAGGCGCCGT TTCTGTGGGG CCTCACTCAG
CGCAACATGA CCGATCTCGT CGCCAAGCAG CCCGACAAAT ACTCGCACGT GTTTCAGACT
GACCCGTCCG AGGTCCACTA CGGGCACACG TTCCCGGTGT TTTTGAAAGC GATGAAGGAC
CAGGGGGTGT GGAAGCCGCT GAATAACGGC GTGCACATCG TCCAGGAACA GATCGCCTAC
AACCAGACGA TCTCGAAGGC GCTGCAGGCG TCGCTCCCCA AGAGCGAGTT CAAGCTCGCC
GGCATCACCG ACATCCAGTA TCCGGTGCAG GACTGGGGCA CAGTCATCCA GGAGATCAAG
AAGGTCGGGG CCGGGGCGGT GATGATCGAC CATTGGGTCG CCGCCGAATA CGCCGCCTTC
GTCAAACAGT ACAGCGCCGA TCCGTTGAAG GGCGCGCTCG TCTATCTGCA ATACGGACCG
TCGCAGCCCG AGTTCCTCGA ACTGTCGGGG CCCGCCGCTG AAGGCTTCGT CTGGAGCACC
GTGCTCGGCG TCTATGCGGA CGAGAAGGGC AAGGCATTCC GCGCCAAATA CAAGAAGCGG
TTTCCCGGCA TCATGGGGCT TTGCTACACC GGCAACGGTT ACGACACGAC GTATTATCTC
AAGGCCGCCT GGGAGGCCGT CGGCGATCCG TCGAACTTCA AGGGCGTCAG TGACTGGATC
CGCAAGAATT CCTATCGCGG CGTCTGCGGC TTCATGAGCA TGGACAATCC CTATCAGGAA
TGTGCGCACT ATCCGGACAC GGGTGATGCG ATCGGAGCCG CTGAGCTCGA GAAGGGCATG
GCGCAACTGT TCTTCCAGGT CCAGAACAAC GAGCACAAGA TCATCTATCC GGACGTGCTC
GTCGAGAACA AGCTGCAGAA GGCGCCGTGG TGGTGA
 
Protein sequence
MVSKVSRRKL LHMASAGTAA AAFPAPFVSG VTRAASADPI LLGVPTAQTA QAGVADHQDY 
LNGTTLALEE INGAGGVLGR QVKAVVVDID PLSPESGQVA INKLIDAKVH AMSCAFVFTP
VPVADVSARY KAPFLWGLTQ RNMTDLVAKQ PDKYSHVFQT DPSEVHYGHT FPVFLKAMKD
QGVWKPLNNG VHIVQEQIAY NQTISKALQA SLPKSEFKLA GITDIQYPVQ DWGTVIQEIK
KVGAGAVMID HWVAAEYAAF VKQYSADPLK GALVYLQYGP SQPEFLELSG PAAEGFVWST
VLGVYADEKG KAFRAKYKKR FPGIMGLCYT GNGYDTTYYL KAAWEAVGDP SNFKGVSDWI
RKNSYRGVCG FMSMDNPYQE CAHYPDTGDA IGAAELEKGM AQLFFQVQNN EHKIIYPDVL
VENKLQKAPW W