Gene RPC_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0078 
Symbol 
ID3971335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp86951 
End bp87991 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content67% 
IMG OID637923194 
Producthypothetical protein 
Protein accessionYP_529976 
Protein GI90421606 
COG category[R] General function prediction only 
COG ID[COG5621] Predicted secreted hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTGC TGGCGCTCGG CGGCAATGCA CGCGCGCAAG GCTTTGCCGG GCTCGGCCGC 
GACGCCGGCG AATTTGCGCC TGTGCTGCCG GGACGCCAAC TGAGCTTTCC GCTCGATCAT
GGCGCGCATG CGGAGTTTCG CATCGAGTGG TGGTACCTCA CCGCGAATTT GCAGGACGCA
GCAGGCCAAG CCTATGGCGT GCAGTGGACC TTGTTCCGGC AGGCGATGCG GCCGGGCGCG
CAGCAGGAGG GCTGGGCCAA TCAGCAGATC TGGATGGCGC ATGCCGCCCT GACCCGCGCT
GATACGCACC GCAGCGCCGA GCGCTTTGCC CGCGGCGGCA TCGGCCAGGC CGGCGTCACC
GCCACGCCGT TCCGCGCCTG GATCGACAAT TGGCAGATGC AAGGCGGGGA GGCGATGGCG
CCGGCGACAC TGTCGCCGCT CGACCTCACC GCATCGGGCG CGGATTTCAG CTACGCGCTG
CGGCTCGCCG CGCCACAGCC CTTGGTGCTG CAGGGCGACA ACGGCTACAG CAAGAAATCC
GAGCGCGGCC AGGCGTCGTA TTACTACAGC CAACCGTATT TTGCAGCGAC CGGCAGCATC
ACGCTCGACG GCAACGCGGT CGAAGTCAAC GGCCAAGCGT GGATGGACCG CGAATGGTCG
AGCCAGCCGC TGGCCTCCGA CCAGACCGGC TGGGACTGGT TCTCGCTGCA TCTCGACAGC
GGCGACAAGG TGATGCTGTT CCGGCTGCGG CAGAGCGACG GCGCGAATTA TTTCGCCGGC
AACTGGATCG GCACCGACGG CCAATCGGTG CAGCTTGCGC CCGACGCGAT CGCCATTACC
CCGACCGGCT TGACGCAGAT CGGCAAGCGC CAACTGCCGA CCTCGTGGCG GATCGCGATC
GCGCCGCGCG GGCTTGCGAT CGACACCACG CCGCTGAACG CGCAGAGCTG GATGGGCACC
AGCTTTCCCT ATTGGGAGGG GCCGATCGCG CTCCGCGGCA GTCACGCCGG CGTCGGTTAT
CTTGAGATGA CGGGCTATTG A
 
Protein sequence
MALLALGGNA RAQGFAGLGR DAGEFAPVLP GRQLSFPLDH GAHAEFRIEW WYLTANLQDA 
AGQAYGVQWT LFRQAMRPGA QQEGWANQQI WMAHAALTRA DTHRSAERFA RGGIGQAGVT
ATPFRAWIDN WQMQGGEAMA PATLSPLDLT ASGADFSYAL RLAAPQPLVL QGDNGYSKKS
ERGQASYYYS QPYFAATGSI TLDGNAVEVN GQAWMDREWS SQPLASDQTG WDWFSLHLDS
GDKVMLFRLR QSDGANYFAG NWIGTDGQSV QLAPDAIAIT PTGLTQIGKR QLPTSWRIAI
APRGLAIDTT PLNAQSWMGT SFPYWEGPIA LRGSHAGVGY LEMTGY