Gene RPC_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0471 
Symbol 
ID3970233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp507582 
End bp508844 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content68% 
IMG OID637923587 
ProductCBS 
Protein accessionYP_530365 
Protein GI90421995 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.603908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGATC CCGACCCAGC GCAAGACAAT CCCGTGAGCG ACACGATGCC AAGTAGTAGC 
AGTCTTCCCG CGGTGGTGCA TCAGGGCGAC GTGATGCGGC CGGCCGCCGA CAATTGGCTG
CTGCGGGCGA TCCGCACGCT GTTCGGCTGG AAGCCGGGCT CGGTGCGCGA GGACCTGCAG
GTGGTACTCG ATGCTACCAC GCCGGACGAC ACCGGCTTCT CGGCGATCGA GCGCACCATG
CTGCGCAACA TTCTCGGTCT GCACGAACGG CGGATCGCCG ACGTCATGGT GCACCGCGCC
GACATCGTCG CGATCAAGCA GGACATCACG CTCGGCGAAC TGATGGGGCT GTTCGAGAGC
GCCGCGCATT CCCGCCTGGT GGTTTACAAC GAAACCCTCG ACGATCCGGT CGGCATCGTC
CACATCCGCG ATCTGTTGGC CTACATGACG GCGCGGGCGC GCGACGAACT ACCGAGCAAG
AGCAAGGCGT CGAGCAAGGC GGCGGCCGCT GCGACCCCTG CCGCAGCCGT CAAGACTTCG
CCGGCCAAGA CGTCGCCGGC CAAGACGTCG CCCGGCAAAT CCTCGCGCAA GAAACCCTCG
CCCAATAGCC TCGATCTGCG CGCCATCGAC CTCAAGATCC CGCTCACCGA GACCGGGATC
ATCCGCAAGC TGCTCTACGT GCCCCCCTCG ATGCGGGCGA TCGATCTGTT GGCGCAGATG
CAGGCGTCGC GGATTCATCT GGCGCTGGTG GTCGACGAAT ATGGCGGCAC CGACGGCCTG
GTTTCGATCG AAGACATCGT CGAGCAGATC GTCGGCGAGA TCGACGACGA GCACGACAGC
GCCGAGCCGC CGTCGATCGT GCGGCAGGCC GACAACTCCT TCATCGCCGA CGCCCGCGCC
AGCCTCGAGG ACGTCCGCCA GGTGATCGGC GAGGACTTCG TCACCGGCGA GGCCGGCGAG
GAGGTCGAGA CGCTGGGCGG CTATCTGGTC ACCCATGTCG GACGCCTGCC GGTGCGCGGC
GAAGTGATCT CCGGCCCCGG CAACTACGAG ATCGAAGTGC TCGACGCCGA CCCGCGCCGG
GTCAAGCGGC TGCGCATCGG CGTCCGCAAG GAACGCCCGG CCCCGCGGCA ACGCGAATTG
CGCCGGCGCG ACGCGCCGAA CGAGCCCGGT CCAGCGCAGG GCAACGAGCC CGGTCCGCCT
CAGGGCAACG ACAATGCCAA TGTCGGCCCG GCCACGCCGG GCGACGGAGT CGGCTCGCCG
TGA
 
Protein sequence
MPDPDPAQDN PVSDTMPSSS SLPAVVHQGD VMRPAADNWL LRAIRTLFGW KPGSVREDLQ 
VVLDATTPDD TGFSAIERTM LRNILGLHER RIADVMVHRA DIVAIKQDIT LGELMGLFES
AAHSRLVVYN ETLDDPVGIV HIRDLLAYMT ARARDELPSK SKASSKAAAA ATPAAAVKTS
PAKTSPAKTS PGKSSRKKPS PNSLDLRAID LKIPLTETGI IRKLLYVPPS MRAIDLLAQM
QASRIHLALV VDEYGGTDGL VSIEDIVEQI VGEIDDEHDS AEPPSIVRQA DNSFIADARA
SLEDVRQVIG EDFVTGEAGE EVETLGGYLV THVGRLPVRG EVISGPGNYE IEVLDADPRR
VKRLRIGVRK ERPAPRQREL RRRDAPNEPG PAQGNEPGPP QGNDNANVGP ATPGDGVGSP