Gene RPB_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2034 
Symbol 
ID3909849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2313277 
End bp2314794 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content65% 
IMG OID637883927 
Producthypothetical protein 
Protein accessionYP_485652 
Protein GI86749156 
COG category[S] Function unknown 
COG ID[COG3333] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.32397 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.23521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGATC TGTTCTCCAA TCTCGCGCTC GGCTTCCAGG TCGCGGCCTC GCCGATGAAT 
CTCGGGCTCT GTCTCGTCGG CGCCCTGGTC GGCACGCTGA TCGGCGTGCT GCCCGGCATC
GGCACCATCG CCACCGTGGC GATGCTGCTT CCGATCACCT TCGGCCTGCC GCCGATCGGC
GCGCTGATCA TGCTCGCCGG TATCTATTAC GGCGCGCAAT ATGGCGGCTC GACCACATCG
ATCCTGGTCA ACATTCCGGG CGAGGCGACC TCGGTGGTCA CGACCCTCGA CGGCTTCCAG
ATGGCCAAGC AGGGCAGGGC CGGTCCGGCG CTGGCGATCG CTGCGATCGG CTCCTTCGCC
GCCGGCTGTT TCGCCACCGT GCTGATCGCG GTGCTGGGCG CGCCGCTGAC CAAGCTGGCG
CTGGAGTTCG GGCCGGCAGA ATATTTTTCC CTGATGGTGC TCGGTTTGAT CTTCGCGGTG
GTGCTGGCGA AGGGGTCGGT GCTCAAGGCG GTCGCGATGA TCGCGCTCGG CCTGCTGCTG
TCGATGATCG GCTCCGACAT CGAAACCGGC GCGTCCCGCA TGACCTTCGG CATTCCCGAA
CTCGCCGACG GTCTGGGCTT CGCCACCGTA GCGATGGGGG TGTTCGGCTT CGCCGAGATC
ATTCGCAACC TCGACGGCGG CACCGAGGCC GATCGGCAAT TGGTGCAGCA GAAGATCACC
GGCCTGATGC CGACCCGGAA GGATCTGCGC GACGCCGCGC CCGCGATCGC CCGCGGGACC
GTGCTCGGCT CGATTCTCGG CATCCTGCCG GGCGGCGGCG CCGTGATCGC GTCGTTCGCG
GCCTATACGC TGGAAAAGAA GATCTCCCGG ACGCCCTACC GGTTCGGCCG GGGCGCGATC
GAAGGCGTGG CGGGGCCGGA AAGCGCCAAC AATGCCGCTG CGCAAACGTC TTTCATCCCG
CTGCTGACGC TCGGCATTCC GCCGAACGCG GTGATGGCGC TGATGGTCGG CGCAATGACC
ATTCACGGCA TCGTGCCGGG ACCGCAGGTG ATGCAGAATC AGCCGGAACT GGTGTGGGGC
ATGATCGCCT CGATGTGGAT CGGCAATCTG ATGCTGCTGA TCATCAACCT GCCGCTGGTC
GGAGTCTGGG TACGGTTGCT GCGCGTGCCG TACCGGCTGA TGTTTCCGGC GATCGTGGTA
TTCTGCGCCA TCGGCATCTA TTCGGTGAAC AATGCGCCGA TCGACGTCGT GATGGCGGGT
ATTTTCGGAC TGATCGGCTA TTGGCTGGTC AAGCACGATT TCGAACCGGC TCCGCTGCTG
CTCGGAATGG TGCTCGGACC GCTGATGGAG GAGAATCTGC GCCGGGCGCT GCTGATTTCG
CGCGGCGACG CGACGATCTT CGTGACCCAG CCGCTGTCGG CGACGTTGCT CGCGGTAGCC
GCAGGACTTC TGGTGCTCGC GGTGCTTCCG TCGCTGCGCA GCAAGCGCGA CGAGGTTTTC
GTCGAGTCCG AGAACTGA
 
Protein sequence
MLDLFSNLAL GFQVAASPMN LGLCLVGALV GTLIGVLPGI GTIATVAMLL PITFGLPPIG 
ALIMLAGIYY GAQYGGSTTS ILVNIPGEAT SVVTTLDGFQ MAKQGRAGPA LAIAAIGSFA
AGCFATVLIA VLGAPLTKLA LEFGPAEYFS LMVLGLIFAV VLAKGSVLKA VAMIALGLLL
SMIGSDIETG ASRMTFGIPE LADGLGFATV AMGVFGFAEI IRNLDGGTEA DRQLVQQKIT
GLMPTRKDLR DAAPAIARGT VLGSILGILP GGGAVIASFA AYTLEKKISR TPYRFGRGAI
EGVAGPESAN NAAAQTSFIP LLTLGIPPNA VMALMVGAMT IHGIVPGPQV MQNQPELVWG
MIASMWIGNL MLLIINLPLV GVWVRLLRVP YRLMFPAIVV FCAIGIYSVN NAPIDVVMAG
IFGLIGYWLV KHDFEPAPLL LGMVLGPLME ENLRRALLIS RGDATIFVTQ PLSATLLAVA
AGLLVLAVLP SLRSKRDEVF VESEN