Gene RPB_1112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1112 
Symbol 
ID3910198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1282739 
End bp1283746 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content69% 
IMG OID637883005 
Producthypothetical protein 
Protein accessionYP_484733 
Protein GI86748237 
COG category[R] General function prediction only 
COG ID[COG4111] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0529668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.319534 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA AGGCTTTGAG TGGCAACAAG GCTTGGAGCG GCGACAAGCC CCCGATTCCA 
ATCGAGATCG GACTGACCGC GGCGATCGTC GCGATCGAGA ACAACGAGCC GCTGATCCTG
ACGTCGTCGG GCGGCAACGA CCTGATCGGC CTGCCCTACG GGCCATTCGA CGCGATCTCG
CATCGCACGC TCGACATCGG GCTGCGCGCC TGGGTGGAAG AGCAGACCGG ACTGCGTCTC
GGCTATGTCG AACAGCTCTA CACGTTCGGC GATCGCGGCC GCCATGCGCG GGTCGGCGAC
ACCGACGTCC ACGTCGCCTC GATCGGCTAT CTGGCGCTGA CCCGCGCGGT CGACAACGCC
GCCCGTGCGG CCGGCGCGAC GTTCGAGCCC TGGTATCGCT TCTTCCCGTG GGAGGACTGG
CGCCAGCAGC GCCCCGAGAT CATCGCGCGC GACATCATCC CCGAACTCAC CGCCTGGGCC
AGCCAGGCCG AACAGCCCGA CACGACGCGC GCCCTCGCCC GCAAGGATCG CGTCCGGCTG
TATTTCGGCA TCGACGGCGC GCAATGGGAC GAGGAGCGCG TGCTCGACCG CTACGAACTG
CTGTACGAAG CCGGCCTGAT CGAAGAGGCG CGGCGCGACG GGCGGCCCGC TGCGCTCGCG
CGCGCCAAGG TGCCGCCGCT CGGCGTGGCG ATGCGGTTCG ATCACCGCCG GATTCTCGCC
ACAGCGATCG CGCGGCTGCG CGCCAAGCTG AAATACCGGC CGGTGGTGTT TGAACTTCTG
CCGCCGGAGT TCACACTCAC CGAGTTGCAG CATACCGTGG AAGCGATCTC GGGCCGGCAT
CTGCACAAGC AGAATTTCCG GCGGCTGGTC GAAGCCGGCG CGCTGGTCGA ACCGACCGGC
GAGATGTCGA CACGAACAGG CGGACGTCCC GCCGCGTTGT TTCGCTTCCG CCGCGAGGTG
CTGCAGGAGC GCCCCGCGCC CGGCCTGCGC GTGCGCGGTC GGCGCTGA
 
Protein sequence
MTDKALSGNK AWSGDKPPIP IEIGLTAAIV AIENNEPLIL TSSGGNDLIG LPYGPFDAIS 
HRTLDIGLRA WVEEQTGLRL GYVEQLYTFG DRGRHARVGD TDVHVASIGY LALTRAVDNA
ARAAGATFEP WYRFFPWEDW RQQRPEIIAR DIIPELTAWA SQAEQPDTTR ALARKDRVRL
YFGIDGAQWD EERVLDRYEL LYEAGLIEEA RRDGRPAALA RAKVPPLGVA MRFDHRRILA
TAIARLRAKL KYRPVVFELL PPEFTLTELQ HTVEAISGRH LHKQNFRRLV EAGALVEPTG
EMSTRTGGRP AALFRFRREV LQERPAPGLR VRGRR