Gene RPB_1891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1891 
Symbol 
ID3907970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2164010 
End bp2165047 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content67% 
IMG OID637883785 
Product6-phosphogluconate dehydrogenase-like protein 
Protein accessionYP_485510 
Protein GI86749014 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1023] Predicted 6-phosphogluconate dehydrogenase 
TIGRFAM ID[TIGR00872] 6-phosphogluconate dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTCG GCATGGTCGG CCTCGGACGA ATGGGCGGCA ACATCGTTCG CCGCCTGATG 
AAAGATGGCC ACCACGCCGT GGTGTATGAT CGGGACCCCC AAGCGGTCGA CGCCCTGACG
CGCGAAGGCG CGACGGGCGC CCAGGGCCTG GAAGATCTGG TCCGCAAGCT CGACGCGCCG
CGCGCGGTGT GGGTGATGCT GCCGGCCGGC CACATCACCG AGACCACCAT CGAAGCGCTG
GGCAAGCTGC TGGCGCCGGG TGATGTGATC ATCGATGGCG GCAACACCTT CTGGCAGGAC
GACATCCGCC GCGCCAAAAC GCTCGAGGAA AACAGCATCA GCTACGTCGA CGTCGGCACC
TCCGGCGGCA TCTGGGGCTA TGAGCGCGGC TATTGCATGA TGATCGGCGG CGACAAACCG
GTCTTCGACC GGCTCGATCC GATCTTCGCC ACGCTCGCCC CGGGGATCGG CGACATCCCG
CGCACGCCGG GCCGCGACGA CCGCGACCCG CGCGTCGAGC AGGGCTACAT CCATGCCGGC
CCGGTCGGCG CCGGGCACTT CGTCAAAATG GTGCACAACG GCATCGAATA CGGCCTGATG
CAGGCCTATG CCGAAGGCTT CGACATTCTC AAGAACGCCA ATATCGACGC GCTGCCGAGC
GAGCACCGGT TCGATCTCGA CATCGCCGAC ATCGCCGAAG TGTGGCGGCG CGGCAGCGTG
ATCCCGTCCT GGCTGCTCGA CCTCACCGCC TCCGCGCTCG CGCGCAACGG CGAGCTCGAC
ACTTACTCCG GCTTCGTCGA GGATTCCGGC GAGGGCCGCT GGACCATCAA CGCGGCAATC
GAGGAAGCCG TGCCCGCCGA AGTGCTGACC TCGGCGCTGT ATGCGCGCTT CCGCTCGCGC
AAGCAGCACA CCTTCGCGGA GAAGATCCTG TCGGCGATGC GCGCGGGATT CGGCGGGCAC
AAGGAGCCGC AACAGCACCC GGACGCCGCG CATCAGGCCG CGCCGGAAAT CCTCAAGCCG
AAAGCGGAGC GCGCGTGA
 
Protein sequence
MQLGMVGLGR MGGNIVRRLM KDGHHAVVYD RDPQAVDALT REGATGAQGL EDLVRKLDAP 
RAVWVMLPAG HITETTIEAL GKLLAPGDVI IDGGNTFWQD DIRRAKTLEE NSISYVDVGT
SGGIWGYERG YCMMIGGDKP VFDRLDPIFA TLAPGIGDIP RTPGRDDRDP RVEQGYIHAG
PVGAGHFVKM VHNGIEYGLM QAYAEGFDIL KNANIDALPS EHRFDLDIAD IAEVWRRGSV
IPSWLLDLTA SALARNGELD TYSGFVEDSG EGRWTINAAI EEAVPAEVLT SALYARFRSR
KQHTFAEKIL SAMRAGFGGH KEPQQHPDAA HQAAPEILKP KAERA