Gene RPB_4691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4691 
Symbol 
ID3912509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5307665 
End bp5309353 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content66% 
IMG OID637886596 
Producthypothetical protein 
Protein accessionYP_488285 
Protein GI86751789 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3637] Opacity protein and related surface antigens 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTTT TCGCTTCGAC CACGGCCATG CTGATACTGG CCGCCACAAC CGCAGCCGCC 
GCCGATCTGC CGCGGTTGCC TTTGTCCCCG GCCGCGCCAG TGCAGTGGAG CTGGACCGGC
CTGTATTGGG GCGCGCATCT CGGCGGCAGC TTCGGGCAGA CCAGCTTCAG CGATCCCGCC
GGTCCCGGCC TCTATGGCGG CCAGGTCCGC AGCCCGGCGG CGCTTGCCGG CATTCAGCTC
GGCTACAATT ATCAACCCAA CAGGAACTGG CTGATCGGTG TCGAGGCCGA TGTCAGCGCG
ATGAACGGCA ACGGCACGCA GAGCTGCCTG GTCTCCTCCG GCCTGTTCAT CTCGGCGAAC
TGCCGCGTCC GCCAGGACGC ACTGGCCACG CTCACGGGCC GCGCCGGCTT CGTGACCGGC
CCCGGCGGCC GAACGTTGCT CTACGCCAAA GCCGGCGCCG CCTTTCTCAG TGAACGGCTC
GACATCACCA TCGGCAATCC GATCGGATCG TCGACCGAGA GCACTGACGG CCGCTGGGGC
TGGACCGCGG GCGCCGGCAT CGAGCGGGCG CTGGCGCCGG CCTGGTCCGT CAAGTTCGAA
TACGACTACG CGAATTTCGG CAGCCGCGAC ATGGCGACGC CGTCGAGCTA CCGGCTGGTG
CCGGGCGTCG ACTATTTCGC CACGCCGCAG GGCGTCAGCA AGGTCAGCCA GGACCTGCAC
GCCGTGAAGG TCGGCCTCAA TCTCAAATTC GGCGGCGACG TCGACGCCCG CTTCGACGAC
TATCATCTGC GCGGCACGCA GGCGGCGGAC GATCGTGTCG AGCGCGGCGC GGTCGAGGTC
GGCGGCCGGG TCTGGTACTC TTCGGGCCGG TTCCAGAAAG ACCTCGGCGC GACCGTCAAT
CAGGGCCAGC AGAACATCCT GATCTCGCGG CTGACCTATC AGAGCACGGC GGCGTCGGGC
GAAATGTTCG GCCGCGTCGA CGGACCCTAC GACACCTTCC TCAAGGGCTT CGCCGGCGGC
GGTACGCTCG TGAGCGGCAA CATGCATGAC GAGGACTGGA TCGCCAATGA CGGCATCCCG
TATTCGAACA CGCTGCACGA CCCGGTGAAG GGCAGCATCG CCTATGCGAC GCTCGACGTC
GGCTACAATC TGCTGCGCGG GTCGGACTAC AAATTCGGCG GCTTCGTCGG CTACAACTAC
TATCGCGAGA ACAAATCGGC CTATGGCTGC GTGCAGACCG CCGGCGCGAC CGCGTCCCAG
ATCTGCGGCT CGCCGATCTC GAATGCCGTT CTCGCCATGA CCGAAAACGA CACCTGGCAT
TCGCTGCGGG TCGGCTTCAA CGGCGAGCTC GGACTCGGCC GCGGATCGAA ACTCTCCGCC
GACGCGGCCT ATCTGCCCTA TGTGAAGATG TTCGGAACCG ACAATCACGT GATGCGCACC
GACGTCACCG ACACCGTCTC GCCGGAACAG GGAACCGGGC AGGGCGTGCA GCTCGAGGCG
ATCCTGTCGT ATCAGGTCAC GAACGCCTTC AGCGTCGGCG CCGGCGCGCG CTACTGGGCG
ATGTGGGCGA CCACCAACGC CTACACCAAC ATCTTCGGCT CGGAGTGTCC CTGCCAGACC
CTGCCGACGC GTACCGAACG CTACGGAACC TTCCTGCAGG CGGCCTACAA GTTCGATGCG
CCGCGATAG
 
Protein sequence
MRLFASTTAM LILAATTAAA ADLPRLPLSP AAPVQWSWTG LYWGAHLGGS FGQTSFSDPA 
GPGLYGGQVR SPAALAGIQL GYNYQPNRNW LIGVEADVSA MNGNGTQSCL VSSGLFISAN
CRVRQDALAT LTGRAGFVTG PGGRTLLYAK AGAAFLSERL DITIGNPIGS STESTDGRWG
WTAGAGIERA LAPAWSVKFE YDYANFGSRD MATPSSYRLV PGVDYFATPQ GVSKVSQDLH
AVKVGLNLKF GGDVDARFDD YHLRGTQAAD DRVERGAVEV GGRVWYSSGR FQKDLGATVN
QGQQNILISR LTYQSTAASG EMFGRVDGPY DTFLKGFAGG GTLVSGNMHD EDWIANDGIP
YSNTLHDPVK GSIAYATLDV GYNLLRGSDY KFGGFVGYNY YRENKSAYGC VQTAGATASQ
ICGSPISNAV LAMTENDTWH SLRVGFNGEL GLGRGSKLSA DAAYLPYVKM FGTDNHVMRT
DVTDTVSPEQ GTGQGVQLEA ILSYQVTNAF SVGAGARYWA MWATTNAYTN IFGSECPCQT
LPTRTERYGT FLQAAYKFDA PR