Gene RPB_2533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2533 
Symbol 
ID3910322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2896598 
End bp2898997 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content63% 
IMG OID637884432 
Producthypothetical protein 
Protein accessionYP_486149 
Protein GI86749653 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCTT GGCCGGCCCG CCCCGGCGGC GCGTTCGTGG CGACCTGCGC GCTTCTCAGG 
GAAGCCCGGT CTTCCGGCCG CCTCGCCCAC GCGACCATCG CCTATTGGCC CTGGCGCGAG
GGAGCCAAGC AGGCGGCCCG GTCGATCCTC GTTAATCCGC AGGACATCGC GGACGTCTCC
CTGGCCGTCT GCAACGCTCC GGACACTTCC TGGCAGGAAC GTACCCTGGC GCACATGTCG
CTCTGCATGA TCGAGATGCG CCTTCGCGAT CTGAAGCCCA GGGCTCGGGC GTCTGGCGAG
ATCGTCGTAC GAAATCCGAC CTTGCTTGAG ACAACTTCGG TTTTCGCTCC TGCGAACTCC
AAGGGTGAAG GGGCCTATCT TCCGGACTCC GAGCAAGTGC TTCGCCGAGT TCGGGACTAC
ACATCAATGG GAGAGCCGCA CGCCGGGCTG ACCGAGCATG TGAGAGCTGT GGGTGACCCG
GACTCAACCC CGTTCGCCCT GTTCGGTCTT CCACCGGCGA CGGCGCCCGA AGGCCTCGCT
CGCTATCTGG GAGCTCCACG CATCAAGGAC TTGGGACTCG ATCTTGTTGT CGCCGATCTG
ACCAGCACCG GTCGATCCGA GATCCGTAAC TCCTGGGAAA AGAAGCTGGG CGTCTTGTTG
ACTGCTCTCG GTGCGATCCA GGGTCGGCGC CCGGCGCTAT TCGTCCTGAC GGACGATAGC
TTCACCCATC GGCGCGCATT CAGTCTTCTG CGCATGCATG CCGAGAAGCG GCGTCCGAAG
ATCAGGGCGC AACAACTGGG ATTGTTTCTG GAGAAACCGA CGCTGCGCGG GCCGGCGGCC
GAGCCGCCGA GGGATACTGC GCCGCTCTCC GTCCAGGCGG ACATCAAGGA TGCCGCCCTG
GCTCCTCTTC GACAGGAATT GCTGGCGATC GGCGGCAAGC TGCGCGATTA TGGAGCTGAC
GACGACGCGG ACCAGATCAA GCGTGCCCTG GCTTTTGTCA GACGCTCAGC CTCGCTTCCC
CTCGGCATGC GCGAAGCTCG CGACATCACC GATGTCCTCT ACGACGAAGT CGGAGAGTTC
GACGATGCCC TCAAGAAGCT TTTCCGTCCC AAGATGGCCC TCAGCGACCT CCTCGCGGTC
GGCCTGCGTC AGCCCACGTT CGCACCAGCC ATCGACGCGG TCGTCCAGCA GATCGAACGC
AAAGTCGCGA ATTGGGAAGA GGATACCCCG GTCGCGGCAA AGCTCGCCGA ACTACTCACA
TCCGCCGATA TCAATTCTGG TAAGACCTCG ATCGCACTGC CCGGCCGACG CATCAGCGAG
GTCTACCTGG CTTCCGATCG GGCGGTCCAT TGCAACTGCG CCATTGTCGA CCATCACAGT
CTGCTCGATC ATCTCGAAGG TCAGGATCCG GAGAGGCTGA TCGTCATCGG TCCGACGCCG
GAATCCATCC GCGCCTTGCT GACCGCCCGC AAGGTCCCCA GTACGGTCTA TCTCCTCGGC
GACGCTGCAG GAAGCTCCCT GCTCTCATCC GAATTGGCCG CGATCGAAAC TATTCCGGAA
TTCTCCCAGT TCGCCGTCAG AGCCAAAGCC TTGACGACGG CGCTGCGACG CGGCGGGGCC
GACGAGTCTC TAGATCAAGC GGAGGCAGAA TTCCATGCCG CACCGCTCGT CAAGGAAAGA
GGAGTCGATT TTACGCAATC CGACGGCAGG TATCGCGGTG ACGTCGTTCA TCTCATGATG
CAGAGCGGGA TCCGGCTGGA TTATCGCCCG GGCGGCGAAG TCCTCAAACA GTCCCCGGGC
GAATTGAGAC CATTCGAACG GGCACCCGCG CGGGAGATCA GGAAGGATGA CCGCATCCTC
GTCCTTGATG CTTCGATCCG CGAGCCGCTC CGGCTCGCAC TCGCGACGTC CCGCACCAGT
CAAGCAGGGC TCTGCGTCTA TCATGGCGAG ATCGAAAGGA TCCGCACCAG ACTACCGGGG
GGAACGATAG CCGAAAAGGC GCGGCACGTC CTTGCGATCA TGAAACGGAT CGACGCGACC
GTCGGCGATG AGCAGTACAA CATTCAACGG TGGCTGAGAG CCGACATCGC GCCAGCGACG
GCCATTGGGA CGCGAGCGCC GGGGGCCGCC CGTGACTGGA ATCGCTTCCG GATATTCATG
GAGGCGGTTG GCGTCGATAG CCAGATGGCC GAAGTGTACT GGAAAGCCGC CGTCCTTCCG
ACGCGTTCGT ACCGAGCCCA CGAGGGACAC CAATTCAATC AGCGCGTCGT GAGCTTCGTG
CTGGACAAGG AGGCCGAAGA GGCCTGGAAG ACGAAGCAAG GACTGTGGCA GCAGGTGCTG
GAGTCCGTCG ACGTCGTCAT CGACGTCGAG AAAAAATCCG TCGGAGCGAG CAATGGCTGA
 
Protein sequence
MLSWPARPGG AFVATCALLR EARSSGRLAH ATIAYWPWRE GAKQAARSIL VNPQDIADVS 
LAVCNAPDTS WQERTLAHMS LCMIEMRLRD LKPRARASGE IVVRNPTLLE TTSVFAPANS
KGEGAYLPDS EQVLRRVRDY TSMGEPHAGL TEHVRAVGDP DSTPFALFGL PPATAPEGLA
RYLGAPRIKD LGLDLVVADL TSTGRSEIRN SWEKKLGVLL TALGAIQGRR PALFVLTDDS
FTHRRAFSLL RMHAEKRRPK IRAQQLGLFL EKPTLRGPAA EPPRDTAPLS VQADIKDAAL
APLRQELLAI GGKLRDYGAD DDADQIKRAL AFVRRSASLP LGMREARDIT DVLYDEVGEF
DDALKKLFRP KMALSDLLAV GLRQPTFAPA IDAVVQQIER KVANWEEDTP VAAKLAELLT
SADINSGKTS IALPGRRISE VYLASDRAVH CNCAIVDHHS LLDHLEGQDP ERLIVIGPTP
ESIRALLTAR KVPSTVYLLG DAAGSSLLSS ELAAIETIPE FSQFAVRAKA LTTALRRGGA
DESLDQAEAE FHAAPLVKER GVDFTQSDGR YRGDVVHLMM QSGIRLDYRP GGEVLKQSPG
ELRPFERAPA REIRKDDRIL VLDASIREPL RLALATSRTS QAGLCVYHGE IERIRTRLPG
GTIAEKARHV LAIMKRIDAT VGDEQYNIQR WLRADIAPAT AIGTRAPGAA RDWNRFRIFM
EAVGVDSQMA EVYWKAAVLP TRSYRAHEGH QFNQRVVSFV LDKEAEEAWK TKQGLWQQVL
ESVDVVIDVE KKSVGASNG