Gene RPB_1886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1886 
Symbol 
ID3908081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2158605 
End bp2160410 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content65% 
IMG OID637883780 
Productglycoside hydrolase 15-like protein 
Protein accessionYP_485505 
Protein GI86749009 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.177282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGGGTA GGATCGAAGA CTATGCGCTG ATCGGCGATT GCGAAACCTC GGCACTGGTC 
GGTCGCAATG GATCGATCGA CTGGCTGTGC TGGCCGTCGT TCGATTCCGA CGCCTGCTTC
GCAGCTTTAT TGGGCGATGA AAATCACGGC CACTGGCAGA TCGGCCCGTC GCAGGAGCTC
AGGACGACGA CGCGGCGCTA TCGCGGCGAC AGTCTGATCC TCGAGACGCA ATTCGAGACC
GACACCGGAA CCGTCACGCT GATCGATTTC ATGCCGATCC GCGGCAAGGC GTCGGACATC
GTGCGCCTGG TCCGCGGCGA CGCCGGAAAA GTCGAGATGC GCATGCATCT GGTGCTGCGC
TTCGGCTTCG GCGTCAACAT TCCGTGGGTC AAGCGCGGCG AGAAGCCGGG TGAATTGCTG
GCGATCTGCG GTCCGGACAT GACGGTGCTG CGCTCGCCCG TGCGCACCCA CGGCGAAGGC
CTGACGACGG TCGCTGAATT CGTGGTGAGC GAAGGCGACA CGGTGCCGTT CGTGATGACC
TACGGCGCGT CGCACCTGCC GTTGCCCGGC ACGATCGATC CGATCGCCGC ACTGCGCGAC
ACCGAGGAGT TCTGGTCGGA CTGGACCGGC CAATGCACGT ACACCGGCGA GTATCGCGAT
TTGGTGATGC GGTCGCTGAT CACGCTGAAG GCGCTGACCT TCGCGCCCAC CGGCGGCATC
GTCGCGGCGC CGACGACGTC GCTGCCGGAG CAACTGGGCG GCGCGCGCAA TTGGGACTAT
CGGTTCTGCT GGCTGCGCGA TGCGACCTTC ACGCTGTTCG CGCTGATGAA CAACGGCTAC
ACCGCGGAGG CTGCGGCCTG GCACGGCTGG CTGCTGCGCG CAGCCGCCGG CGCGCCGGCC
AAATTGCAGA TCATGTACTC GATCGACGGC CATCGCCGGC TGCTGGAATG GCAGGCCGAC
TGGCTCCCCG GCTACGAGGG GGCGCAGCCG GTGCGGATCG GCAATGCGGC CCACGCGCAA
CTGCAGCTCG ACGTCTACGG CGAGCTGATC GACGCCTTCC ACCAATGGCG GATGGCTGGG
ATCGAACTCG ACGGCGATTC CTGGGCGATG GAATGCGCCG TACTCAAGCA TCTGTCGACG
ATCTGGAGTC AGCCGGACAG CGGCATCTGG GAGCGCCGCG GCCCGGCCAA ACATTACGTC
TTCTCGAAGG TGATGACCTG GGTCGCTTTC GATCGCGGCA TCAAAAGCGC GGAAAAATTC
GGCCTCAAGG CGCCGCTGGC GGCATGGCGC GCGCTGCGCG ACGAGATTCA TCGCGACGTC
TGCGCCAAGG GCTTCGATGC GAAACAGAAT GCCTTCGTCG ATCACTACGG CGCCGACGTG
CTGGACGCCA GCGTGCTGTT GATCCCCGCG GTCGGCTTCC TGCCGCCGGA CGACCCGCGC
GTGCGCGGCA CGGTCGCGGC GATCGAGGCC CACATGATTC ATGATGGATT CGTGCTGCGC
CACGATCCGC GCGAAACCCC CGACGAACCG CTTCCGGTCG AAGGCGCGTT CCTCGCCTGC
AGCCTGTGGC TGGCCGACGC CTATGTGTTC GACGGCAGGA TCGACCAAGC CAAGGTGCTG
TTCGATCGCG TCGTCGGCAT CGCCAACGAC GTCGGCCTTC TCGCCGAGGA ATATGATTCC
GTTGCCGGCC GGCAGACGGG CAATTTCCCG CAGGCGCTCA CTCACATCGC GCTGATCATC
ACCGCCAATA ATCTCAGCGC GGCGAAGGCC GCAACCGGCA AGCCGGCGGT GCAGCGCTCG
AAATAG
 
Protein sequence
MPGRIEDYAL IGDCETSALV GRNGSIDWLC WPSFDSDACF AALLGDENHG HWQIGPSQEL 
RTTTRRYRGD SLILETQFET DTGTVTLIDF MPIRGKASDI VRLVRGDAGK VEMRMHLVLR
FGFGVNIPWV KRGEKPGELL AICGPDMTVL RSPVRTHGEG LTTVAEFVVS EGDTVPFVMT
YGASHLPLPG TIDPIAALRD TEEFWSDWTG QCTYTGEYRD LVMRSLITLK ALTFAPTGGI
VAAPTTSLPE QLGGARNWDY RFCWLRDATF TLFALMNNGY TAEAAAWHGW LLRAAAGAPA
KLQIMYSIDG HRRLLEWQAD WLPGYEGAQP VRIGNAAHAQ LQLDVYGELI DAFHQWRMAG
IELDGDSWAM ECAVLKHLST IWSQPDSGIW ERRGPAKHYV FSKVMTWVAF DRGIKSAEKF
GLKAPLAAWR ALRDEIHRDV CAKGFDAKQN AFVDHYGADV LDASVLLIPA VGFLPPDDPR
VRGTVAAIEA HMIHDGFVLR HDPRETPDEP LPVEGAFLAC SLWLADAYVF DGRIDQAKVL
FDRVVGIAND VGLLAEEYDS VAGRQTGNFP QALTHIALII TANNLSAAKA ATGKPAVQRS
K