Gene RPC_4899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4899 
Symbol 
ID3973722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5468884 
End bp5470227 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content67% 
IMG OID637928012 
ProductL-sorbosone dehydrogenase 
Protein accessionYP_534740 
Protein GI90426370 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.569758 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGCC AATCTGTCAT TGCCGGCTGT CTCACCGCGC TAACGCTGCC GCTCGCCGCC 
TGCAATGAGC AGGCCACCGT CACCGGCGGC GAGGATTTCG GCCCCTCGCC GACGCTGGTG
GCGCCGAAGT CGTCGCTGAT CCCCACCATC AACATCGCCA CCGTCAACCG CTGGTCCGAC
GGCGCCAAGC CGACGCCCGC GAGCGGCATG GCGGTGAACG CCTTCGCCAC CGGGCTCGAC
CATCCGCGCC AGCTCTACGT GCTGCCGAAC GGCGACGTGC TGGTCGCCGA GAGCAACGCG
CCGCCGAAGC CGGAGGACGG CAAGGGCATC CGCGCCTGGG CGCAGCGGCT GTTCCAGAAA
CGCGCCGGCG CGGTGACGCC CAGCGCCAAC CGCATCACCC TGTTGCGCGA TGCCGACGGC
GACGGTACAG CCGAAACCCG CAGCGCATTT CTCGAGGGAC TAAAGTCGCC GTTCGGCATG
GTGCTGGTCG GCGACGAGCT TTATGTCGCC AACGCCGATG CGGTGGTGAA ATTTCCCTAT
CGCAGCGGCG ACACCAGGAT CACCGCTGCC GCGGTGAAGG TCGCCGATCT GCCCGGCGGG
CCAATCAATC ATCACTGGAC CAAGGACATC ACCGCGAGCG CCGACGGAGC CAAGCTCTAC
GCCACCGTCG GCTCCAACAG CAACGTCGGC GAGAACGGCA TCGAAGCGGA AACCGACCGT
GCCGCGGTTC TGGAAATCGA CCGCGTCAGC GGCGCAAAGC GAGTGTTCGC CTCGGGCTTG
CGCAATCCGA ACAGTCCGTC CTGGCAGCCG CAGAGCGGCG CGCTGTGGGT CACGGTGAAC
GAACGCGACG AGATCGGCAG CGATCTGGTG CCGGACTACA TGACCTCGGT GCAGGACGGC
GGCTTCTACG GCTGGCCGTA CAGTTACTAC GGCCAGCACG TCGATGTGCG GGTCGAACCG
CAGCGGCCGG ATCTGGTGGC CAAGGCGATC GCGCCGGACT ACGCGCTCGG CGCCCATACC
GCCTCGCTGG GGCTGACCTT CAACACCGGC GAATTGTTTC CGGAAGCCAT GAAGGGCGGC
GCCTTCGTCG GCCAGCACGG CTCGTGGAAT CGCAACCCGC TTTCCGGTTA CAAGGTGATC
TTCGTGCCGT TCAAGGACGG CAAGCCCGCG GGCAAACCGC AGGACGTGCT GACCGGCTTT
CTCAACGACA AGGACGAAGC CCAGGGCCGC CCGGTCGGGG TCAAGATCGA CAAGCGCGGC
GCGCTATTGG TCGCCGACGA CGTCGGCAAT GTGGTGTGGC GGGTGACGCC GGACAGCGCG
CCGAAGGCGG CAGAGGCGAA GTAA
 
Protein sequence
MRRQSVIAGC LTALTLPLAA CNEQATVTGG EDFGPSPTLV APKSSLIPTI NIATVNRWSD 
GAKPTPASGM AVNAFATGLD HPRQLYVLPN GDVLVAESNA PPKPEDGKGI RAWAQRLFQK
RAGAVTPSAN RITLLRDADG DGTAETRSAF LEGLKSPFGM VLVGDELYVA NADAVVKFPY
RSGDTRITAA AVKVADLPGG PINHHWTKDI TASADGAKLY ATVGSNSNVG ENGIEAETDR
AAVLEIDRVS GAKRVFASGL RNPNSPSWQP QSGALWVTVN ERDEIGSDLV PDYMTSVQDG
GFYGWPYSYY GQHVDVRVEP QRPDLVAKAI APDYALGAHT ASLGLTFNTG ELFPEAMKGG
AFVGQHGSWN RNPLSGYKVI FVPFKDGKPA GKPQDVLTGF LNDKDEAQGR PVGVKIDKRG
ALLVADDVGN VVWRVTPDSA PKAAEAK