Gene RPB_4238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4238 
Symbol 
ID3912046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4814010 
End bp4815203 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content69% 
IMG OID637886141 
ProductPepSY-associated TM helix 
Protein accessionYP_487840 
Protein GI86751344 
COG category[S] Function unknown 
COG ID[COG3182] Uncharacterized iron-regulated membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.593429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAAGGT CGTCCGCGAT CAAACCGGCG CTGCTGCAAC TGCATTCGAT CGCCGGCCTG 
ATCCTGTCGC TGTTCCTCGC CGTGATCGCG CTGAGCGGCG CGGTGCTGAG TTTCGAGGAC
GAGATCCAGG CCGCGCTGAA TGCCGATCGC ACAACGGTCG AGCTCCGCGC GACGCCGCGG
CTGCAGCCCG ACGAATTGAT CGCCCGGCTG CAGGCCGCCA CCGGCGCCGG CAAGATCGCC
TCGCTCACGC TGGCGCGCGA TCCGGCGGCG GCGGTTCACA TCCGCTTCGC GCGCAATGAC
GACGCCTCGC GGCCGTCGTC GCTCTATGTC GATCCCTATG ACGCCCGCGT GCTCGGCCAT
CCGGTCGGCG AGGGCTTCTT CGCCACGGTG CGCAAGCTGC ATCGCTGGCT GCTGCTGCCG
GGCGATGCCA AGGGTTGGGG CCGGCCGATC GGCGGCATCG TCGCAATGGG CCTGATCGCG
ATGCTGATCA CCGGGCTGGT GCTGCGCTGG CCGCATCGCG CCGGCAGCGT GAAGGTCTGG
CTGAAGCCGA ATTGGCGGCT GCGCGGCCGC GGCCTGCACC GCTCGTTGCA CGCGGTGATC
GGCACTTGGG CGATGCTGAT CTATCTGGTG ATGGTGCTGA CCGGATTGTG GTGGTCGTTC
GACTGGTACA GAGACGGCGC AATCTGGCTT TTGTCCAGCG CGCCCCCGCG CGCCGAACCG
ATGCAGCCAG CGCCGAAGCG CATGGCCGCC GCATCCGACA AGTCCGACAG CAAGCGCGAG
GCTGCATCGG CGCTGCCGCT CGATCGTGTG TGGTCGGCCT TCCTGCAGCA GCAGGGCGAG
CGCTTCGTCA CCGCGCGCCT GACGCCGCCG GCCGGCGGCG GCACGCTGGT GCGCGTCCGA
TCCTGGAGCG CCGCCGCCGA CGGCGTCCGC GACGAATTCC GCATCGATGC CGCGAGCGGC
AAGATCGTCT CCGCCGACCT CTACGCCGCC AAGCCGCTCG GCGACCGCAT CCTCGCCCGC
GTGCTCGACA TCCACCGCGG CGCGATCCTC GGCTGGCCGG GCAAGCTGCT GTTCATGCTC
GCCGCGCTTG CAATGCCGCT GTTCGTGATC ACCGGATTGC TGCTGTATCT GTCACGCCGA
CGCCACAACC GCCTCACGCG CGCGCCGGTC GGAGAACTGG CGGCAGGAAA ATAG
 
Protein sequence
MARSSAIKPA LLQLHSIAGL ILSLFLAVIA LSGAVLSFED EIQAALNADR TTVELRATPR 
LQPDELIARL QAATGAGKIA SLTLARDPAA AVHIRFARND DASRPSSLYV DPYDARVLGH
PVGEGFFATV RKLHRWLLLP GDAKGWGRPI GGIVAMGLIA MLITGLVLRW PHRAGSVKVW
LKPNWRLRGR GLHRSLHAVI GTWAMLIYLV MVLTGLWWSF DWYRDGAIWL LSSAPPRAEP
MQPAPKRMAA ASDKSDSKRE AASALPLDRV WSAFLQQQGE RFVTARLTPP AGGGTLVRVR
SWSAAADGVR DEFRIDAASG KIVSADLYAA KPLGDRILAR VLDIHRGAIL GWPGKLLFML
AALAMPLFVI TGLLLYLSRR RHNRLTRAPV GELAAGK