Gene RPC_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4047 
Symbol 
ID3969296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4496181 
End bp4497761 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content66% 
IMG OID637927151 
Producthypothetical protein 
Protein accessionYP_533892 
Protein GI90425522 
COG category[S] Function unknown 
COG ID[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.237674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCGGT CCATGCCTTT TTCCTGCGCA CCCGACGATC TGCAACAGAA AGTCTTCGCC 
TTCTTGGCGA ATTCGGCGAA CCATGGCGAT CGACCGGTGC ATGTCGTCAC CACGCATGGC
GCAGCGGTGT TTCTGGCGGG TGACCGCGCG CTGAAGATCA AGCGCGCGGT GCGGTTTCCC
TATCTGGACT ATTCCACGCT GGACAAGCGC AAAGCCGCTT GTGACCAGGA AGTGAGCATC
AATCGGCTGT TCGCGCCGCA GATCTATCGC GGCGTCGTTC CGATCACGCA GCGGGCCGAC
GGCCGGTTCG AGATCGCCGG AGACGGCCGG GTGGTCGAAT GGGCGATCGA TATGACGCGG
TTCGACGAGC GGCAAACCGT CGATCTTCTG GCCGAAGCCG CGCCGCCGGA AACCGCGCTG
CTGCTCGACA TCGCCGAGGT CATCGCAGCC TCGCACGCGG CGGCGCCGAT CGTCGCACGC
GGTCCTTGGA TCGATTCTAT CTTGCGGATC GTTGCCGGTA ACACCAAAGC GTTCCGCGCC
GGCGGCTTCG ACGGAGCGGC GATCGCGGCG CTGGATGCCG CCAGCCGCGC CGGCTTTGCG
CGGCTGCAAC CGCTGCTCGA TCGACGCGGC GAGCAGGGGT ACGTGCGACG CTGCCACGGC
GACCTGCACC TGGCGAACAT CGTGGTGATC GACGGCAAGC CGGTGCTGTT CGACGCCATC
GAATTCGATC CCTCGATCGC GTCGACCGAT GTGCTGTACG ATCTCGCCTT CGTGGTGATG
GATTTCATCC ACTACGACCG CGCAGCGGCC GCCAGCGTTG TTCTCAACCG TTACCTCGCC
ATCACCTCCG ACCAGCATCT CGACGCGCTG TCGGCGCTTC CCTTGCTGAT GTCGATGCGC
GCGGCGATCC GCGCCAATGT GATGCTGTCG CGGCCCGCAC AGGACGCCGC GCATCTGGCC
GAGATCCGAC GCACCGCCGA GAGCTATTTC ACGCTGGCCT GCCGGCTGAT CGCGCCGCCG
CAGCCGCGCT TGATCGCGAT CGGCGGATTG TCGGGGACCG GCAAATCGGT GCTGGCGCGC
AGTCTGGCGA GCACCATCGC GCCGCTGCCG GGCGCGATCG TGCTGCGCTC CGACGTGACG
CGCAAGCAGC AGTTCAACGT CAAGGACACC GATCGATTGC CGGCCGAGGC CTATCGGCCG
CAAGTGACCG CTGAGGTTTA TCGGACGCTC TGCCAACGCG CGGCAAGAAT CCTCGCCCAG
GGACACTCGG CGATGGTCGA TGCGGTCTTC GCACGTGAGG ACGAGCGTCG CGCGATCAGT
GAGGTTGCCG AGCGGGCCCA GGTTCCGTTC GATGGCTTGT TTCTGGTCGC TGACTTGGCG
ACCAGAATCG CGCGAGTCAG CAGCAGGATC GGCGACGCCT CCGATGCAAC GGCCGAGATC
GCCAAAGCGC AGCAGGCCTA CGACGCCGGT GTCATCGATT GGACGATCGT CGATGCCGCG
GGCACACCCG ACCAGACGCT GCTGCGGGCA ACCGAGGCGC TCACCACTCC TCGCGCAATA
CCCCGCGGCG CCGGGGCATA A
 
Protein sequence
MYRSMPFSCA PDDLQQKVFA FLANSANHGD RPVHVVTTHG AAVFLAGDRA LKIKRAVRFP 
YLDYSTLDKR KAACDQEVSI NRLFAPQIYR GVVPITQRAD GRFEIAGDGR VVEWAIDMTR
FDERQTVDLL AEAAPPETAL LLDIAEVIAA SHAAAPIVAR GPWIDSILRI VAGNTKAFRA
GGFDGAAIAA LDAASRAGFA RLQPLLDRRG EQGYVRRCHG DLHLANIVVI DGKPVLFDAI
EFDPSIASTD VLYDLAFVVM DFIHYDRAAA ASVVLNRYLA ITSDQHLDAL SALPLLMSMR
AAIRANVMLS RPAQDAAHLA EIRRTAESYF TLACRLIAPP QPRLIAIGGL SGTGKSVLAR
SLASTIAPLP GAIVLRSDVT RKQQFNVKDT DRLPAEAYRP QVTAEVYRTL CQRAARILAQ
GHSAMVDAVF AREDERRAIS EVAERAQVPF DGLFLVADLA TRIARVSSRI GDASDATAEI
AKAQQAYDAG VIDWTIVDAA GTPDQTLLRA TEALTTPRAI PRGAGA