Gene RPB_3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3050 
Symbol 
ID3910851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3476888 
End bp3479014 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content62% 
IMG OID637884958 
Producthypothetical protein 
Protein accessionYP_486663 
Protein GI86750167 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGCAG GCGCAACCCG AGGTTCCGGC GGCCCCGCCC TTGCTCGACA TCTCATGTCC 
CGCAAAGGCG GCCAAACGGT TGAAGTCATG CCCGCACGCG GGCTGGCGGC GGAGCATCTG
CGCGATCAGC TTCGGGAGCT TGTAGCGTCA TCCGCTCACG GCCGGACGGA TCGTCCGGTG
CACCACGTGC ATATCGATCC TCCGCCGGAC TGCTCCGATC CGGACGCGGT TATCGCGACA
TTTCTCGATC GTTACGAAGC CGAATTCGGG CTACAAAACT CGCAGCGCGC GGGTGTTTAT
CACGTCAAGG CCGGTCGCAA ACACGCTCAC GTCGTCTGGT CTCTCGTTCG TGACGACGGC
TCCGTTGTCT CGCTCGCGCA CGATCACGCG CGAAGAGAAA AAGTCAGCAG AATCGTGGAG
TTTGAATGCG GTTTGCCGTT CACGAAGGGC AAACACAATC GCAGTGCGGC GGCAGCGCTC
CTGAAAGAGG GCCGCGCCGA CGTTTCCGAC GCAATGCTTG CTGCCGGTTT GCTCGATGGT
CGCCGACCGG TGGCGCATTC GACGCCGCGC CAACGCGCCC AAGCCGAGCG CACTGCCGTG
CCGCTCGACG AAATCCGGAC GCAAGCTCTC GCAGCGTGGC AAGCATCGGA CGACGCGCGG
TCGTTCGCCG TCGCGCTCCA CTCCTTTGAT TTTTCGGTCG CGACCGGCGA GAGCGGCTAC
GTCCTGATCG ACCGGTCCGG GAGCGTGCAT AGCCTGAATC GCGTCCTCGC TGCAGCTGCA
CGTGCAGAAG GCGTGGAAAA GATCACCTCC GCGGCTGTCC GCTCACGCTT GCGCGGCATC
AACTTTCAAA ATGTGGAGGA AGCCAAAAAT GCCCGATCAG AACGACGCAA TACTAACTCA
GGCGACCGGC GCGCGGGAGG CATCGGAGCG CCTGTTACCG CTCCTGAGCC TCCTCGACGA
GCAGACCGGC GAGACGAGCC CGTTGGACGA GCTCAAGGGC TTGTTGCAGG CGATCGTCGA
GATTTTGGGT CATCACACGA GCGCGTTGCA GCGGCTCGAA AGCGCATCCG TGACCACGCC
GCCGCGCAAC GCATAGGCGA CATCGATCTG CAAGACTTAA TTCGTAACAA GGGGGAAGTG
ATGGCAAAGA TAAGAGCTCA GAACTTTAAG GCGAAAATCC TCGCAGAAGT CGCCCCCGCG
GGGTTCAACG CGCATGCATT TTCCGATGAT CTGCGCATGA TTCAGAAACC GACGCCCGCG
CGACCGGCGG CTCGGATTAT GATGATCGAT GGCGGCTGGC TTGAGTACGA TGCAGCCGGG
CGTAGCATCC GAACATGGGG TCCCACCGGC CGCGCGCAAA TACTCGCCGC CGCGCTCGCC
GATAAGCTCG GCGTCGAGCC GGAGCATCTG GCGAAAACCG CATCAGTCGG AGCCGATGTC
GACGCTCTGA AAGTGACAAA AGTGTCGGAA GACACAGTCA ACTCGCTGGT TTTGTGGTGG
ACCGCTCGTG GCTATTCGGC GACCGACGGC CCAGACGGAT GCTGGGTCAC GGCAGGCTAC
GCTCGCATCC GAGACACTGG CGATCAGCTT GAGATTCACG GCGGCCTGAC GGTCAAAGTC
ATCGATGCAA CGATCCTGAA AGCAAAGGAA GCTTGGGGCG GCGGCGTGTA TCTCGACGGC
GACTGGACTC AAGATGAGCA AGATAAGCTC TGGATCGCCG CCCAGCGCGC GGGCGTAAAA
GTCGAAAATT GCCAGCCGTC GCTGTCGATT CAAGATGCGT GGCAGCGGGA GCAAGCCGCA
TCCGCGAAAT CCATAAAAAC GATTTGCGCC GCGCGATCCG CAATCGCAGA AGCTACAGAC
GTTCGCGATG CCGCCGCCGG TGATCTCGAA GCTATGCACC GGCTTCCGCA GCCGTTGCAG
GCCTTCGTCG TCTCACACCT CGACGACGAT CAGCGCAGGC ATCTGTCTGG GCAGTCCGTC
GCCGACATCA CCGCCGCCTT GCCGCGCTTC CGCGACCTCG GTCAGTCTGA ACTCGAGGAA
TACGAGCGCA CCGGCCGCAC GTTCACGCCG CCGAAGCCGC GTCGCGATGA TCGCGACCGC
AACGCCGCGC ACACGTATTC GCAATGA
 
Protein sequence
MIAGATRGSG GPALARHLMS RKGGQTVEVM PARGLAAEHL RDQLRELVAS SAHGRTDRPV 
HHVHIDPPPD CSDPDAVIAT FLDRYEAEFG LQNSQRAGVY HVKAGRKHAH VVWSLVRDDG
SVVSLAHDHA RREKVSRIVE FECGLPFTKG KHNRSAAAAL LKEGRADVSD AMLAAGLLDG
RRPVAHSTPR QRAQAERTAV PLDEIRTQAL AAWQASDDAR SFAVALHSFD FSVATGESGY
VLIDRSGSVH SLNRVLAAAA RAEGVEKITS AAVRSRLRGI NFQNVEEAKN ARSERRNTNS
GDRRAGGIGA PVTAPEPPRR ADRRDEPVGR AQGLVAGDRR DFGSSHERVA AARKRIRDHA
AAQRIGDIDL QDLIRNKGEV MAKIRAQNFK AKILAEVAPA GFNAHAFSDD LRMIQKPTPA
RPAARIMMID GGWLEYDAAG RSIRTWGPTG RAQILAAALA DKLGVEPEHL AKTASVGADV
DALKVTKVSE DTVNSLVLWW TARGYSATDG PDGCWVTAGY ARIRDTGDQL EIHGGLTVKV
IDATILKAKE AWGGGVYLDG DWTQDEQDKL WIAAQRAGVK VENCQPSLSI QDAWQREQAA
SAKSIKTICA ARSAIAEATD VRDAAAGDLE AMHRLPQPLQ AFVVSHLDDD QRRHLSGQSV
ADITAALPRF RDLGQSELEE YERTGRTFTP PKPRRDDRDR NAAHTYSQ