Gene Rpal_4156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4156 
Symbol 
ID6411840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4450168 
End bp4451208 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content66% 
IMG OID642714038 
Product6-phosphogluconate dehydrogenase-like protein 
Protein accessionYP_001993127 
Protein GI192292522 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1023] Predicted 6-phosphogluconate dehydrogenase 
TIGRFAM ID[TIGR00872] 6-phosphogluconate dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.685432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATCG GCATGATCGG CCTCGGCCGG ATGGGTGGCA ATATCGTCCG GCGCCTGATG 
AAGGATGGCC ATCATGCCGT GGTGTACGAC AGGGACCCGC AAGCGATCGA GGCGCTGACG
CGCGAAGGCG CAACCGGAGC CGGCGGGCTC GAAGACCTGG TGCGCAAGCT CGACGCGCCG
CGCGCGGTGT GGGTGATGCT GCCCGCCGGA CAGATCACCG AGACCACCAT CGAACAGCTC
GCCAAGCTGC TCGCCGCCGG CGACGTCGTC ATCGATGGCG GCAACACCTT CTGGCAGGAC
GATATCCGCC GCGCCAAGAC GCTGAAGGAA ACCAGCATCG ACTACGTCGA TGTCGGCACC
TCCGGTGGCA TCTGGGGCTT CGAGCGCGGC TACTGCATGA TGATCGGCGG CGACAAAGCC
GTCGTCGACC GGCTCGATCC GATCTTCGCC ACACTGGCGC CGGGCATCGG CGACATCCCG
CGCACGCCGG GCCGCGACGA TCGCGATCCC CGCGTCGAGC AGGGCTATCT GCACGCCGGC
CCGGTTGGCG CCGGCCATTT CGTCAAAATG GTTCACAACG GCATCGAATA CGGCCTGATG
CAGGCCTATG CCGAAGGCTT CGACATTCTC AAGAACGCCA GCAGCGACTC CCTGCCCGAA
GCGCACCGCT TCGATCTCGA CATCGCCGAC ATCGCCGAGG TCTGGCGCCG CGGCAGCGTG
ATCCCGTCCT GGCTGCTCGA CCTGACAGCA ACGGCGCTGG CGAAAAACGA TCAGCTCGAC
AACTACTCGG GCTTCGTCGA GGACTCCGGC GAAGGCCGCT GGACCATCAA CGCCGCGATC
GAAGAAGCGG TGCCGGCCGA AGTGCTCACC GCCGCGCTGT TCGCGCGTTT CCGCTCGCGG
CGGGACCATA CGTTTGCGGA GAAGATTCTC TCGGCGATGC GGGCGGGCTT CGGCGGCCAC
AAAGAGCCGC AGCAGCATCC TGAGCCGGAG CAGCAAGCCG CTCCGCAGCA GAAACTGAAA
CCGAAAGCGG AGCGCGCGTG A
 
Protein sequence
MQIGMIGLGR MGGNIVRRLM KDGHHAVVYD RDPQAIEALT REGATGAGGL EDLVRKLDAP 
RAVWVMLPAG QITETTIEQL AKLLAAGDVV IDGGNTFWQD DIRRAKTLKE TSIDYVDVGT
SGGIWGFERG YCMMIGGDKA VVDRLDPIFA TLAPGIGDIP RTPGRDDRDP RVEQGYLHAG
PVGAGHFVKM VHNGIEYGLM QAYAEGFDIL KNASSDSLPE AHRFDLDIAD IAEVWRRGSV
IPSWLLDLTA TALAKNDQLD NYSGFVEDSG EGRWTINAAI EEAVPAEVLT AALFARFRSR
RDHTFAEKIL SAMRAGFGGH KEPQQHPEPE QQAAPQQKLK PKAERA