Gene Rsph17029_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0158 
Symbol 
ID4896381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp173191 
End bp174639 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content69% 
IMG OID640110741 
ProductHemY domain-containing protein 
Protein accessionYP_001042049 
Protein GI126460935 
COG category[S] Function unknown 
COG ID[COG3898] Uncharacterized membrane-bound protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0856182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.993204 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTGGT CCTTGATCAA GATCCTGATT TTCGTGGCCC TCGTGGTCGC GCTCACCTTC 
GGCGCCTCGC AGCTCATGGA AAGCGGCGGC GCGCTGCGGC TGGCGGTGGG CGATCTCGAG
CTCAACCTCG GGCCGCTGCA GGCGGTGATC GCGGCGTTGC TCCTGATCCT CGCGGTCTGG
CTGTTCCTGA AGATCGTGGG CTTCCTCTTC GCGGTGCTGC GCTTCCTGAA CGGGGACGAG
ACCGCCGTCT CGCGCTACTT CGACCGCTCG CGCGAGCAGA AGGGCCTGCG CGCGCTTTCG
GAAGGGATGA TGGCGCTGGC CGCGGGCGAG CCGCGCACCG CCATGTCCCG CGCGGCGAAG
GCGCGGAAAT ACCTCGGCCA GAACGCGATG ACCACGCTTC TGAACGCGCA GGCCGCGCAG
CAGGCGGGCG ATTCGCGCCG GGCGCAGGAA TCCTACAAGC TGCTGCTCGA GGATGAGCGC
ACCCGCTTCG TGGGCGTCCG GGGGCTGCTG AAGCAGAAGC TGGACGAGGG CGACACCGAG
ACGGCGCTGG CACTTGCCCA GAAGGCCTTC GAGATCAACC CCAAGCACTC CGAGACGCAG
GACATCCTGC TGAAGCTGCA GGCCGATCTG CACGACTGGT CGGGCGCGCG CAGCACGCTC
TCCGCCAAGA TGAAGTCGGG CGCGCTGCCG AAGGCCGTCT ACAAGCGGCG CGATGCGGTG
CTGGCGCTGC AGACCGCCAA GGACGTGTTC GACGAGAATG CCTCGATCGA GGCGCGCGAG
GCGGCGATCC TCGCCAACAA GCAGTCGCCC GACCTGATCC CGGCCGCGGC GATGGCGGCG
CGCAGCTATC TCGCGCAGGG CAACAAGAAA TATGCCACGC GCGTTCTGAA GAAGGCCTGG
GAGGCCGAGC CGCATCCCGA TCTCGCCGCG GCCTTCGCCG AGATCGAGCC CGACGAGACG
CCGGCCGAAC GGCTCAAGCG GTTCCGCACC CTGACCGCGA TCCATCCCGA TCACGACGAG
ACGCGGATGC TGATCGCCGA ACTGTCGCTG GCGGCCGAGG ATTTCCCGGG TGCCCGGCGC
GCTCTGGGCG ACATCGTGGC GCGCCATCCG ACGCAGCGGG CGCTGACGAT CATGGCGGCG
GTCGAGCGCG GCGAGGGCGG AGACGAGGCC GTCGTGCGCG GCTGGCTGGC GCGGGCGCTG
ACGGCGCCGC GCGGCCCGCA ATGGTGCTGC GACAACTGCC AGACGGTCCA TGCGACCTGG
GCGCCGATCT GCGACAACTG CGGCGGCTTC GACACGCTGA GCTGGCGCGA ACCCACGCAG
AAGTCGACGC CGAGCGCCAC GGGAACCGAG CTTCTGCCGC TGATCGTGGG CGCGCCCGCG
GCTCGGCCGG CCGATCCCAT GGACGAGGAC GTGATCGACG AAAAAGCGGT TGAGCCCACC
TCAAAATAG
 
Protein sequence
MLWSLIKILI FVALVVALTF GASQLMESGG ALRLAVGDLE LNLGPLQAVI AALLLILAVW 
LFLKIVGFLF AVLRFLNGDE TAVSRYFDRS REQKGLRALS EGMMALAAGE PRTAMSRAAK
ARKYLGQNAM TTLLNAQAAQ QAGDSRRAQE SYKLLLEDER TRFVGVRGLL KQKLDEGDTE
TALALAQKAF EINPKHSETQ DILLKLQADL HDWSGARSTL SAKMKSGALP KAVYKRRDAV
LALQTAKDVF DENASIEARE AAILANKQSP DLIPAAAMAA RSYLAQGNKK YATRVLKKAW
EAEPHPDLAA AFAEIEPDET PAERLKRFRT LTAIHPDHDE TRMLIAELSL AAEDFPGARR
ALGDIVARHP TQRALTIMAA VERGEGGDEA VVRGWLARAL TAPRGPQWCC DNCQTVHATW
APICDNCGGF DTLSWREPTQ KSTPSATGTE LLPLIVGAPA ARPADPMDED VIDEKAVEPT
SK