Gene RPB_4400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4400 
Symbol 
ID3912215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4984989 
End bp4986041 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content72% 
IMG OID637886305 
Productputative FecR 
Protein accessionYP_487997 
Protein GI86751501 
COG category[P] Inorganic ion transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG3712] Fe2+-dicitrate sensor, membrane component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0568232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.23003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCGTG GAAGAGCGGC GCTTGCCATA ACAGCAGACG CTTTTGCGAG AGAATGCGCG 
GCGATGGAAT TCGCATCCGA CGACAGCAAG GACCGGGAGC GGGCGTCGCG AGAGGCGACC
GAGTGGTTCG TGCGTCTGCA GAACCCCCTC GCCACCGACG ACACGCGGCG GGCTTACCGG
GATTGGCTGA TGGCCGACCC CGCGCATCGC GAAGCGATCC GCGACGTCTC CGAATTGTGG
GGCGCGCTCG ATCGGCCCGC CGCGCAGCTC GCCAGCACCG GCTGGCACCG CTCCGCCGAC
GAACCGGCAC CCCGGCCGCG CCGATGGTTC GCGACGGGAT CGAGGTTCGC GACCGCCGCC
GTCGTCGTCG TCGCGCTGGC GGGCGGCCTT GCCGTGTGGC GCGATCCAGG GCTGCTCGAC
CGGGCTTTCG CCGATGTCGC GACGCATCCC GGCGAGCGGC GCGAAGTGAG TCTCGCCGAC
GGGACGCTTG CTGTTCTCGA TGGCGACACG GCCCTCAAGA GCCACATGAG CGGCCCCCGC
CGCGACGTGA CCGTGTTGCG CGGCCGGGTC TGGCTCGATG TGGCCCGCGA TCCAGCGCGG
CCGTTCACGG TGCATGCCGG AGGCGTCGAT GCCCGGGTGC TCGGCACCGC CTTCGAGGTC
AATCGCGAGG CCGCCGCGGT CACCGTCGAG CGCGGCGAGG TCGCGGTGTC CGGCGTCGAC
AGTCGGCTCG GCCCGGTCAA GCTGACGGCC TGGCAGCGCG TTGCGCTTCA GGACGGCACG
CTGGGCGCGC CGGTCACGGT CGACCCGGAG CAGATGTTCG CGTGGCGGCG GGGGCTGATC
ATTCTCGATC GTGCGCCGCT GTCGCAGGTC GTCGAAGAAC TCGACAAAAT GGCGCCCGGC
CGCGTGCTGA TCGCCGATCC GGAGCTGAAG CGCCTGACGC TCTCCGGCGC CTTTCGCACC
GACGAGCCCG GCGCCGTGCT GGAAGCCCTG CGGAGCGCGC TCGGGCTCCG CACCGTCTCC
GTCCCGGGCT TCGCGACGCT GATCTACCGC TGA
 
Protein sequence
MRRGRAALAI TADAFARECA AMEFASDDSK DRERASREAT EWFVRLQNPL ATDDTRRAYR 
DWLMADPAHR EAIRDVSELW GALDRPAAQL ASTGWHRSAD EPAPRPRRWF ATGSRFATAA
VVVVALAGGL AVWRDPGLLD RAFADVATHP GERREVSLAD GTLAVLDGDT ALKSHMSGPR
RDVTVLRGRV WLDVARDPAR PFTVHAGGVD ARVLGTAFEV NREAAAVTVE RGEVAVSGVD
SRLGPVKLTA WQRVALQDGT LGAPVTVDPE QMFAWRRGLI ILDRAPLSQV VEELDKMAPG
RVLIADPELK RLTLSGAFRT DEPGAVLEAL RSALGLRTVS VPGFATLIYR