Gene RPC_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0049 
Symbol 
ID3971436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp56432 
End bp58117 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content71% 
IMG OID637923163 
ProductHemY-like 
Protein accessionYP_529947 
Protein GI90421577 
COG category[S] Function unknown 
COG ID[COG3898] Uncharacterized membrane-bound protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGTA TCATTCTGTT TCTTGTCGTG ATTGCCGCGG CTGCGGCGGG AGCCGGTTGG 
CTTGCCGAGC AGCCGGGCAA TGTGGTGCTG TCCTGGCGCG GCTGGCAGGC CGAGATGACG
CTGGCGGTGG CCGCGCTGGC GCTGCTTTCC GCCATCGTCG CGGTCGCGCT CGGTTGGACG
ATCCTCGCCG GGGTGTTGCG CTCGCCCGGC CGGCTCAGGC GCAGCCGCCG TGCGCGCCGC
GAGGCCCGCG CCCGCCGCGC CATCACCCAG GGGCTGCTCG CGGTCGGCCA TGGCGACGCC
GCCATCGCCC GCGCCCACGC CAATGCCGCC AAGCGGCATG CGCCGCAGGA TCCGCTGGCG
CTGTTGCTGC AGGCGCAATC CGCCCAGCTC GACGGCGATC GTGACGGCGC CAAGCGCGCC
TTCCTGGCGA TGGCCGGGCG CGACGACACC AAGTCGCTCG GTATGCGCGG GCTGTTCATC
GAGGCGCAAC GCGCCGAAGA CCCCTACGCC GCGCTGACCA TCGCCGAAGA GGCGCTGCGG
CTATCGCCGG CGTCCAGTTG GGCCTCGCAG GCGGTGTTGG GCTTCCGTTG CGCCCGCGGC
GACTGGTCCG GGGCGCTGGA AATCCTGGAA ACCAACCTGA CCTCCGGGCT GATCGACAAG
AAGACCTATC GGCGTTTGCG CGGCGTGCTG CTGACCGCGC GGGCGATCGA ATGCGAGGAG
ACCGACGTCA GCCTGTCGCG CGACAGCGCG CTGGAAGCGG TCAAGCTGGC GCCGACACTG
GTGCCGGCCG CGGTGCTGGC CAGCAAATAT CTCAGCGAGG CGCATCAGAT CCGCCGCGCC
ATGAAGACCA TCGAGACCGC GTGGTTGGCG CATCCGCATC CGGATCTGGC CGAGGCCTAT
GCCCACATCA AGCCGGGCGA TAGCGCGCAG GTGCGGCTGC AACGGGTGGA AGCGCTGGCC
GCCAAGGCGT CCGGCGATAC CGAAGGCGCG GTCGCGGTGG CCCGCGCGGC GATCGACGCC
GGCGACTTCA ATCGCGCCCG CGGCGCGCTG ATGCCGTTCG TCGACGCGCC GACCCAGCGG
GTCGCCATGC TGATGGCCGA GATCGAGCAC ACCGAGCGCA AGGATTCGCG CAGCGCGCGG
GCCTGGACGC TGCGGGCGGT GCGGGCGTTG CGCGACCCGG TGTGGACCGC CGACGGCTGC
GTCTCCGACC GCTGGCGGCC GGTGTCGCCG GTGACCGGCC GGCTCGACGC CTTCCAATGG
ACGACGCCGC TCGCCGAACT GCCGACCAAC AAGGCGGTGG TGCTGGAATC CGATCTGTTC
GACGAGACGC TGATCGAATC GCCGACCGAG GAGGTCACCG AGGGCGTGAC GTCCGAGCCG
GCCACGCCGG AGGTCAGCAA AGCCGAACCG AGCAAGCCGG AGACCTCCGG CACCCCGGTC
GAAGTGGTGA TGGAGAGCAA GCCCGCGGCC GAAGCGCCGG TGGTCGGCCC CACCGAATCA
TCGCCGCCGC TGTTCCATCG GCCGCAGCGC TCCGCCGCAG CGCCAGTGAT CCCGATCGTC
CGCGCCCCCG ACGACCCGGG GATCGATGAA GACGAGGCCG CGACCGGCGA TTTCGACGAC
AAGGCCAATC CGGCTGCCAG CCAGGCCGGC AATTGGCGGG GCTATCGGCC GCGCCGCGAC
AATTGA
 
Protein sequence
MLRIILFLVV IAAAAAGAGW LAEQPGNVVL SWRGWQAEMT LAVAALALLS AIVAVALGWT 
ILAGVLRSPG RLRRSRRARR EARARRAITQ GLLAVGHGDA AIARAHANAA KRHAPQDPLA
LLLQAQSAQL DGDRDGAKRA FLAMAGRDDT KSLGMRGLFI EAQRAEDPYA ALTIAEEALR
LSPASSWASQ AVLGFRCARG DWSGALEILE TNLTSGLIDK KTYRRLRGVL LTARAIECEE
TDVSLSRDSA LEAVKLAPTL VPAAVLASKY LSEAHQIRRA MKTIETAWLA HPHPDLAEAY
AHIKPGDSAQ VRLQRVEALA AKASGDTEGA VAVARAAIDA GDFNRARGAL MPFVDAPTQR
VAMLMAEIEH TERKDSRSAR AWTLRAVRAL RDPVWTADGC VSDRWRPVSP VTGRLDAFQW
TTPLAELPTN KAVVLESDLF DETLIESPTE EVTEGVTSEP ATPEVSKAEP SKPETSGTPV
EVVMESKPAA EAPVVGPTES SPPLFHRPQR SAAAPVIPIV RAPDDPGIDE DEAATGDFDD
KANPAASQAG NWRGYRPRRD N