Gene RPB_3147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3147 
Symbol 
ID3910948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3599915 
End bp3600949 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content70% 
IMG OID637885049 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_486754 
Protein GI86750258 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCCC GCTCCCTCAC GCGCCGTCGT TTGATCACCA TCGCCGCCGC GGCTGCCGCA 
GGATCGCTGT GGCCCGGCCG ATCACGCGCG GCAGCAGGGC CCGAGCCGGT GCGCTGGCAA
GGCGCCGCGC TCGGCGCGCA GGTGTCGATC GAGATTCACC ATCCGGATCG CGCCGCCGCC
GCGCGACTGG TTGAGCGCTC GATCGCCGAA GTGCGGCGGC TGGAGCGGCA GTTCAGCCTG
TATCAGCCGG ACTCCGCGAT CTGCGAACTT AACCGCAGCG GCGTGCTGAT TGCGCCTGAT
CCCGACATGG TGACGCTGCT GCAGGCCTCG CTCGGCTATG CCGATCTGAC CGGCGGCGCG
TTCGATCCGA CGGTGCAGCC GTTGTGGCGC CTGTATCAGC AGCACTTCTC ATCCGACCGG
ACCGATCCCG CAGGCCCCTC CTCGGCATGG CTCGAACAGG CGCTGGAGAA GGTCGGATAT
GATGGACTGC GCGTCACGCC CGACCGCATC GTGTTGCTCA AGCGCGGCGC CGCGATCACG
CTGAACGGCA TCGCCCAAGG TTATGCGACC GATCGCGTCG TCGAATTGCT CCGGAATGCG
GGGCTGTCGA CGACGCTGGT CGATATCGGC GAAGTCCGCG CGCTCGGCGG GCGGCCGGAC
GGCACGCCCT GGCGCGTCGG CCTCGCCGAT CCGGACCAGC CCGGCCGATC CGGCGAGATC
GTCGAAATCG CCGACCGGGC CGTGGCGACG TCTGCGGGCG CCGGCTTCCG GTTCGATCCG
GCGGGCCGCT TCACCCATTT GCTCGACCCG CGGACCGGCC GCAGCCCGCG CTCGTACAAT
TCGGTCAGCG TCATCGCGCC GACGGCGACC GCAGCGGACG CGCTGTCGAC CGGCTTCAGC
CTGATGCCGC TGCCGATGAT CCAGCGCATC GTCGATCAAT CCCATGGCGT GGAGGCGCGC
ATTCTCGACC TCGCCGGTCA GAGGCTCCAC CTCCAGGCGG CGTCCGGACG GAGGGGACAC
CGCTCCGCGT CGTGA
 
Protein sequence
MMPRSLTRRR LITIAAAAAA GSLWPGRSRA AAGPEPVRWQ GAALGAQVSI EIHHPDRAAA 
ARLVERSIAE VRRLERQFSL YQPDSAICEL NRSGVLIAPD PDMVTLLQAS LGYADLTGGA
FDPTVQPLWR LYQQHFSSDR TDPAGPSSAW LEQALEKVGY DGLRVTPDRI VLLKRGAAIT
LNGIAQGYAT DRVVELLRNA GLSTTLVDIG EVRALGGRPD GTPWRVGLAD PDQPGRSGEI
VEIADRAVAT SAGAGFRFDP AGRFTHLLDP RTGRSPRSYN SVSVIAPTAT AADALSTGFS
LMPLPMIQRI VDQSHGVEAR ILDLAGQRLH LQAASGRRGH RSAS