Gene RPB_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0101 
Symbol 
ID3909687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp108459 
End bp109907 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content69% 
IMG OID637881982 
ProductPepSY-associated TM helix 
Protein accessionYP_483724 
Protein GI86747228 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCA TCACCAGGCG GCTGAAGCGG TGGCTGTATC TCGGCCACCG CTGGCTCGGG 
ATCGCGTGCT GTCTGTTGTT CGCGATCTGG TTCATCTCCG GCGTGGTGAT GATGTATGTG
GCGTTCCCGC AATTCGACGC TAAGGAGCGC CGGGCAGCGC TGCCGGATCT CGCCTTCAGC
GAAATCCGGC TCGCCCCCGA TCAGGCGATG GCGGCGGCCG GGCTGACGAC CTATCCGCGT
GAACTCCGTC TTGCGATGCA GGACGGCGAG CCGGTGTACC GACTCGCCGG CACGAACAGA
CGGCGGCAGG CGATCTCCGC CGTCGACGGC CGGGCGCTCG GCGACATATC GCCCGAACAG
GCGCTGGCGG TGGCGCGGCA TCATCCCGCC GCTGTGGCGC CGGCGCTGCT CGACATCGTC
GATCGCGACC AATGGAGCGT CACCGCGCGG TTCGATCCGT TGCGGCCGCT GTTTCTGATC
GGGCTCGGCG ACGATGCCGG CACCGAGCTT TACGTCTCGC AGAAGACCGG CGAGATCGTG
CTCGACACCA ATCGTCACGA GCGGGTCTGG AACTGGCTCG GCGCGATTCC GCACTGGATC
TATCTCACCT TGCTGCGCCA GGACGCGCCG CTGTGGCGAC AGGTGGTGAT GTGGACCTCT
GGGATCTGTT TGCTGGTCGC GATCAGCGGG ATCTGGATCG GGCTGCTGCG CGCCGGATTG
CGGCGGCGCT ATGCGTCGGG CCGGATCACG CCCTATCGCG GCTGGATGGC GTGGCATCAC
CTCACCGGCC TCGTCGCCGG CGTACTGGTG CTGACCTGGA TGGCCTCGGG CTGGCTGTCG
GTCAATCCGT TTGAGCTGTT CGCGCGCCGC GGCGACAGTC GCGAGGCGCT GCAGCGCTAT
GCCGGCCACG ACGCGCCGAC GATCGCGTCC ACACTCCCCG CACGCGACCG GCCCGGCGTC
GTCGAAGCGC GCTTCATCTG GGTCGGCGGC GCGCCGCTGA TGCTGCTGGC GCATCGCGAC
GGATCGCAGA GCGTGGCCGA TCCGGCGAGC GGCGCATCGC GGACGCTGTC ACCGGAGCGG
ATCTTCGACG CGGCGGCGCG GCTGCTCCCT GACGCCACGA TGACGCTGCG GCAGCGGCTG
GAGGAGCCCG ACGCCTATTG GTACTCGCAT CATCATCAGC GCGTGCTGCC GGTGCTGCGC
GCCGGCTTCG ACGATGCCGC CGCTACCTGG TATCATCTCG ACCCGTCGAC CGGCGAAATT
CTCGGGCGCA GCGATCGCAG CCGCCGGGTC TATCGCTGGC TGTTCAACGC GCTGCACAGC
TTCGATTTCC CGGTGCTGAT CGCGCACCGG CCGGCGTGGG ACATCGTGGT GATTGTATTG
TCGCTGGCGG GGCTGGTGAT TTCGGTCAGC GGCATCCTGC TCGGCTGGCG GCGGCTGCGC
CGCGCGTGA
 
Protein sequence
MRRITRRLKR WLYLGHRWLG IACCLLFAIW FISGVVMMYV AFPQFDAKER RAALPDLAFS 
EIRLAPDQAM AAAGLTTYPR ELRLAMQDGE PVYRLAGTNR RRQAISAVDG RALGDISPEQ
ALAVARHHPA AVAPALLDIV DRDQWSVTAR FDPLRPLFLI GLGDDAGTEL YVSQKTGEIV
LDTNRHERVW NWLGAIPHWI YLTLLRQDAP LWRQVVMWTS GICLLVAISG IWIGLLRAGL
RRRYASGRIT PYRGWMAWHH LTGLVAGVLV LTWMASGWLS VNPFELFARR GDSREALQRY
AGHDAPTIAS TLPARDRPGV VEARFIWVGG APLMLLAHRD GSQSVADPAS GASRTLSPER
IFDAAARLLP DATMTLRQRL EEPDAYWYSH HHQRVLPVLR AGFDDAAATW YHLDPSTGEI
LGRSDRSRRV YRWLFNALHS FDFPVLIAHR PAWDIVVIVL SLAGLVISVS GILLGWRRLR
RA