Gene RPB_0795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0795 
Symbol 
ID3909609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp892288 
End bp893631 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content70% 
IMG OID637882687 
ProductL-sorbosone dehydrogenase 
Protein accessionYP_484417 
Protein GI86747921 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.796858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGAT CCTTCACAAC CAGCCGCCTC GCATTCTGCG CCGCCATGGC GCTGCCGCTG 
GCGGCCTGCG GTGACCAGGC CAATTTCACC CAGGGCGAGG ATTACGGCCC GACGCCGAAG
CTGGCCGAGC CGGCCAACAA TCTGCTGCCG ACCGTCAACA TCGCCAAGGC GGTGGGATGG
CCGCAGGGCG CCAAGCCGAA GCCCGCCGAG GGCATGGCGG TGACCGCCTT CGCCACCGGG
CTGGATCATC CGCGCACCCT GCATGTGGCG CCGAATGGCG ACGTGCTGGT GGCCGAGACC
AACGCGCCGG AGCGGCCCGA GCAGGGCAAG GGCCTGAAGC GGATGGCGGC CGAATTCGTG
ATGGGACAGG CCGGCGCCAA GACCCCGAGC GCCAACCGCA TCACGCTGCT GCGCGACGCC
GACGGCGACG GCGTGGCGGA AGTCCGCTCC ACTTTCCTCG ACGGGCTGAA TTCGCCGTTC
GGCATGGCGC TGGTCGGCAA CGACCTCTAC GTCGCCAACA CCGACGCGAT CATGCGCTTC
CCCTACAACC CCGGCGACAC GCGGATCACC GCCAAGGGCG AAAAGCTCGC CGATCTGCCG
GCGGGGCCGC GCAACCATCA CTGGACCAAG GACCTGATCG CCAGCCGCGA CGGCGCTAAG
CTGTACGCGA CCTCCGGCTC CAACAGCAAC GCCGCCGAGC ACGGCATGGA CGAGGAGGTC
AACCGCGCCG CGATCCTCGA AGTCGACCGC GCCACCGGCG CGACGCGGCT GTTCGCCTCC
GGCCTGCGCA ATCCGAACGG CCCGGCCTGG GAGCCCGAGA CCGGCGCGCT GTGGGTCGTC
GTCAACGAGC GCGACGAGAT CGGCAGCGAT CTGGTGCCGG ACTACATGAC CTCGGTGCAG
GACGGCGGCT TCTACGGCTG GCCCTACAGC TATTACGGCC AGAACGTCGA CACCCGCGTC
GAGCCGCCGC GGCCGGACCT GGTGGCGAAG GCGATCGTGC CGGACTACGC GCTCGGCAAC
CACACCGCCT CGCTCGGCCT CGCCTTCAAC ACCGGCGATC TTTTCCCGGC CGAGATGAAA
GGCGGCGCCT TCGTCGGCCA GCACGGCTCG TGGAACCGCA AGCCGCATTC CGGCTACAAG
GTGATCTACG TGCCGTTCCA GGGCGGCAAG CCGTCGGGCC CGCCGCGTGA CGTCCTCACC
CGCTTCCTCA ACGACAAGGA CGAGGCGCAG GGCCGTCCGG TCGGCGTTGC GATCGACGGA
CGCGGCGCCT TGCTGGTCGC CGACGACGTC GGAAAATCGG TGTGGCGCGT CACGCCGGCG
GGGCGCGAGC AAGCGGCGCG GTAG
 
Protein sequence
MRRSFTTSRL AFCAAMALPL AACGDQANFT QGEDYGPTPK LAEPANNLLP TVNIAKAVGW 
PQGAKPKPAE GMAVTAFATG LDHPRTLHVA PNGDVLVAET NAPERPEQGK GLKRMAAEFV
MGQAGAKTPS ANRITLLRDA DGDGVAEVRS TFLDGLNSPF GMALVGNDLY VANTDAIMRF
PYNPGDTRIT AKGEKLADLP AGPRNHHWTK DLIASRDGAK LYATSGSNSN AAEHGMDEEV
NRAAILEVDR ATGATRLFAS GLRNPNGPAW EPETGALWVV VNERDEIGSD LVPDYMTSVQ
DGGFYGWPYS YYGQNVDTRV EPPRPDLVAK AIVPDYALGN HTASLGLAFN TGDLFPAEMK
GGAFVGQHGS WNRKPHSGYK VIYVPFQGGK PSGPPRDVLT RFLNDKDEAQ GRPVGVAIDG
RGALLVADDV GKSVWRVTPA GREQAAR