Gene Rsph17029_3600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3600 
Symbol 
ID4898786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp689644 
End bp691299 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content69% 
IMG OID640114208 
ProductPepSY-associated TM helix domain-containing protein 
Protein accessionYP_001045462 
Protein GI126464349 
COG category[S] Function unknown 
COG ID[COG3182] Uncharacterized iron-regulated membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.270401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGCT CGTTCCGCCA ATCGATGGCA TGGCTTCATA CCTGGACGGG TCTCGTGGTC 
GGCTGGGTGC TGTTTCTGGT CTTCGTGACC GGGACGGCGG GCTATGTGCA GTACGAGATT
TCGCGCTGGA TGCGCCCCGA GCTGCCGATG GAGGCGCGGG GCGATCTGCC CTCCGTGGAT
GTGATGATCG GCCATGCGCT GGCGCGGCTC GAGGCGAACG CGCCCGAGGC GAAGTCATGG
CAGATCGTTC TGCCGCATGC GGCGCGCCAG CCGCGCGGCT GGCAGCCGCT GTCGATCCGC
TGGGAAGAGA TGCCGCCCGA CCGGCAGGAC TTCGGGCGCA CCGGATCCGA GGTGCTGGAT
GCGGTCACGG GTCAGGCGCT GGCCTCTCCC GAGCCGCGGG ACACATGGGG CGGGGCGGGT
CTCTACCGGA TGCACTATGC GCTGCATTAT GTGCCCTACT GGGTCGGCTA TTACATCGTC
GGCATCTGCA CGATGCTGAT GCTGATGGCG GTCTTTTCCG GCGTCATCAC GCACAAGAAG
ATCATCGCCG ACTTCTTCAC CTTCCGCCCC GGCAAGGGTC AGCGGTCGTG GCTCGACGCG
CATAACGTCA TCAGCGTCAT GTGCCTGCCG TTCTTCACCA TGATCACCTA CAGCGGGCTG
GTCTTCTTCA CCACATGGTA TGCGCCGGCG CCGGTCGCGG CGGTCTATGG CACGGGCGAT
GCGGCGATGA ACCGCTATTG GGACGACCGT TCCCCCGTGC ACAGGGCCGG CTATCAGCCC
GCCCGCGCCG AGGCCGCGCT TCTGGCCCGG CTGGTGACGC AGGCCGAGGC GGACTGGGGC
ACCGGCCGCG TGGCCGAGCT ACGGATCGAG CATCCGCGCG GCGAGCCGCC CTTCGTCGAG
CTGTCGGGCG TCGCCGGAGA CCGGATGGGC GGGCTGCGGC CCGCGCTTCT GCGCTTCGAC
GCGGTGGAGG GCGCGCCACT GCCGCCGGAC GACCGCGACG GCGCCGCCGC CCGCACCGAG
CGCCTCCTCT TCGATCTGCA CGAGGGGATC TTCGCCGGCT GGGTGCTGCG CTGGCTCTAT
GTGGTCTCGG GGCTCCTCGG CTGCGGGGTG ATCGCGACGG GGCTGGTGCT CTGGACCGTC
AAGCGCCGGC AGAAGCACAT CAAGGGCGCC TCGGCGGGCG CACGCTTCGG CCTGCGTCTG
GTCGAAGTGC TGAACGCCGG CACCGTCATC GGCCTGCCCT TCGGCATCGC GCTGTTCTTT
CTGGCCAACC GTCTCTTGCC CCTGCAGATG GAGAGCCGCG CGGAATGGGA GTTTCACGCG
CTGTTTCTCG GCTGGGGCTG GGCCCTGCTG TGGGCCTCGG TCCGGCCCCT GAAGCGCGCA
TGGATCGACC TGTGCCGGCT CGCGGCAGCC GCCTGTCTGG CGATCCCGCT GGTCAATGCG
CTGACCACCG ACCGGCATCT GGGCGCGAGC CTTCCGGCGG GAGACTGGGC GCTGGCAGGC
TTCGACCTGT CGATGCTGGG CTTTGCCGCC TTCTTTGCGC TGATGGCCGG GAAGCTGCGG
CGCAAATGGG CGGTGCCGGA TGAGGACGGC GCATCCGCCG GAGCCCCGGC ACTTTCAGCC
GGCCATGGGA GGGTCCGCCA TGAACCTGCC GAGTGA
 
Protein sequence
MHSSFRQSMA WLHTWTGLVV GWVLFLVFVT GTAGYVQYEI SRWMRPELPM EARGDLPSVD 
VMIGHALARL EANAPEAKSW QIVLPHAARQ PRGWQPLSIR WEEMPPDRQD FGRTGSEVLD
AVTGQALASP EPRDTWGGAG LYRMHYALHY VPYWVGYYIV GICTMLMLMA VFSGVITHKK
IIADFFTFRP GKGQRSWLDA HNVISVMCLP FFTMITYSGL VFFTTWYAPA PVAAVYGTGD
AAMNRYWDDR SPVHRAGYQP ARAEAALLAR LVTQAEADWG TGRVAELRIE HPRGEPPFVE
LSGVAGDRMG GLRPALLRFD AVEGAPLPPD DRDGAAARTE RLLFDLHEGI FAGWVLRWLY
VVSGLLGCGV IATGLVLWTV KRRQKHIKGA SAGARFGLRL VEVLNAGTVI GLPFGIALFF
LANRLLPLQM ESRAEWEFHA LFLGWGWALL WASVRPLKRA WIDLCRLAAA ACLAIPLVNA
LTTDRHLGAS LPAGDWALAG FDLSMLGFAA FFALMAGKLR RKWAVPDEDG ASAGAPALSA
GHGRVRHEPA E