Gene RPD_0467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0467 
Symbol 
ID4020935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp537900 
End bp539666 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content71% 
IMG OID637960654 
ProductHemY-like 
Protein accessionYP_567606 
Protein GI91974947 
COG category[S] Function unknown 
COG ID[COG3898] Uncharacterized membrane-bound protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.270683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGCA TCATCCTGTT TCTCGTGATC ATCGCGCTCG CGGCCGCGGG CGCAGCCTGG 
GTGGCCGAAC AGCCCGGCGA TGTCGTGCTG TCGTGGAACG ACTGGCGCGC CGAGATGCGT
CTGCCTGTGT TCGTCCTCGG GCTCGGCGCC GCGATCGTCT CGATCGTGCT CGCCTGGGCG
ATCATCACCG GGCTGTGGCG TGCGCCGAGC CGGATGAAGC GCGGCCGCTT CGAACGTCGC
AGCGGCCGGG CCCGCCACGC CATCACCCAG GGACTGCTCG CGGTCGGCCA TGGCGACGCC
GCGGCGGCGC GCAACCACGC CAGCGCCGCG CGGCGGCACG CGCCGCACGA TCCGCTGGCG
CTGCTACTGC AGGCGCAATC GGCGCAGCTC GAAGGCGACC GCGACGGCGC CCGCCGCGCT
TTCCTGGCGA TGGCCGGGCG CGACGACACC AAATCGCTCG GGATGCGCGG CCTCTATATC
GAGGCGCAGC GCGCCGACGA TCCCTACGGC GCGCTGGCGA TCGCCGAGGA AGCGCTGCGG
CTGCAGCCGA ATTCGACCTG GGCGTCGCAG GCGGTGCTCG GCTTCCGCTG CGCCCGCGCC
GACTGGTCCG GGGCGCTGGA TATTCTCGAA ACCAACCTGT CCTCCGGGCT GGTCGACAAG
AAGGTCTTTC GCCGGCAGCG CGCGGTGCTG CTGACGGCGC GCGCGATCGA CCTCGAGGAC
AGCGACGAGA GCCTGGCACG CGACAGCGCG CTCGAGGCCA ACAAGCTGGC GCCGACGCTG
ATCCCGGCCG CGGTGCTGGC CGCGAAATGT CTCGCCGAGA CCCATCAGGT GCGCCGCGCC
ATGAAGGTGA TCGAGGCGGC CTGGCAGGCC CAGCCGCATC CCGATCTGGC CGCCGCCTAT
GCGAATATCA AACCGGGCGA TCCGGCCAAT ATCCGGCTGG CGCGGGTGCA GAACCTGATC
GCCAAGAACC CGGCCGATTT CGAAAGCGCG CTGGCGATCG CCCGCGCCGG GATCGACGCC
GGCGAATTTT CGCGCGCGCG CCGGGCGCTG CAGCCGTTCA TCGACAATCC GACCCAGCGC
GTCGCGATGC TGATGGCCGA GATCGAGCAC GGCGAACGCG GCGACACCGC GAAAGCGCGC
GCCTGGACGC TGCGCGCGGT GCGCGCGCTG CCGGATGCGA TGTGGACCGC CGACGGCTAT
ACATCGGATC ATTGGCGCCC GGTGTCGCCC GTGACCGGCC GGCTCGATGC GTTCCAGTGG
CAGGTGCCGA TCGCCGCGCT GCCGGCGAGG AAGGCGGTGG TGATCGAGGA CAACCCGTTT
CACGACGCCC TGATCGCCTC CTCGGCGACC GAGGCGCTGC CGGCGGCCAA CGCCCATGAT
CCGGTGACGG TGACGATCGA GTCGGTCGTC GAGACCACGG TGGTGGCGCC GAAGCAGGCG
GAGGCAACCG TGGTGACGGT CGAGCCCGAG GCCGCCGCCG ACAAGCCGAA GGACCAGAAG
GGCAGCTCCC GGGGAGGCGC AACCAAGGAT GCGGGCAAGA GCGCGCCGGA AGCGCCGGTC
GCCGCGTCGG AGACCGTGAT CGCGATGCCG TCAACGCCGC TGTTCCATCG CCGCCCGAGC
CAGGCCACGC CGCCGGTGAT CCCGATCGTC CGCGCCCCGG ACGATCCGGG CGTCGACGAA
GAGGCCGCGC CGGGGGATTT TACCGAACAA TCGGCCGCGC CCGCCGGCCA GACCGGCAAC
TGGCGCGGCT ACCGACCGCC GCGATAG
 
Protein sequence
MLRIILFLVI IALAAAGAAW VAEQPGDVVL SWNDWRAEMR LPVFVLGLGA AIVSIVLAWA 
IITGLWRAPS RMKRGRFERR SGRARHAITQ GLLAVGHGDA AAARNHASAA RRHAPHDPLA
LLLQAQSAQL EGDRDGARRA FLAMAGRDDT KSLGMRGLYI EAQRADDPYG ALAIAEEALR
LQPNSTWASQ AVLGFRCARA DWSGALDILE TNLSSGLVDK KVFRRQRAVL LTARAIDLED
SDESLARDSA LEANKLAPTL IPAAVLAAKC LAETHQVRRA MKVIEAAWQA QPHPDLAAAY
ANIKPGDPAN IRLARVQNLI AKNPADFESA LAIARAGIDA GEFSRARRAL QPFIDNPTQR
VAMLMAEIEH GERGDTAKAR AWTLRAVRAL PDAMWTADGY TSDHWRPVSP VTGRLDAFQW
QVPIAALPAR KAVVIEDNPF HDALIASSAT EALPAANAHD PVTVTIESVV ETTVVAPKQA
EATVVTVEPE AAADKPKDQK GSSRGGATKD AGKSAPEAPV AASETVIAMP STPLFHRRPS
QATPPVIPIV RAPDDPGVDE EAAPGDFTEQ SAAPAGQTGN WRGYRPPR