Gene RPB_4539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4539 
Symbol 
ID3912356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5133173 
End bp5134183 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content67% 
IMG OID637886443 
ProductKpsF/GutQ family protein 
Protein accessionYP_488133 
Protein GI86751637 
COG category[M] Cell wall/membrane/envelope biogenesis
[T] Signal transduction mechanisms 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.937702 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTTT CAAAACCCCG TACGACAAAA CCCGCGATGA CAGATTCAGC CGCCGCGATC 
CCCTCGGCCC TGCGCACGCT GGAGGCCGAA GCCGACGGCG TCACCGCGCT CGCCGCAGCG
CTGCGATCCG ACCTCGGCAG CGCCTTCGCG GCGGCGATCG AGACCATCCG CAACGCCAAG
GGCCGGCTGA TCATCACCGG GCTCGGCAAA TCCGGACATA TCGGCCGCAA GATCGCCGCG
ACCTTCGCCT CGACCGGCAC GCCGGCGTTC TTCGTCCACG CCTCCGAAGC CAGCCACGGT
GATCTCGGCA TGATCACCGC CGACGACATT ATTCTCGCGA TGTCGTGGTC CGGCGAGCAG
CCGGAAATGA AGAACCTGAT CTCTTACGCC AAGCGGTTCA GGATCGCGCT GATCGCGATG
ACCTCCGACT CGGGCTCGAC GCTGGCCAAG GCCGCGGATA TCTCGCTGAC GCTGCCGAAG
GCGCGCGAGG CCTGCCCGCA CAATCTGGCG CCGACCACCT CGTCGCTGAT GATGCTGGCG
CTCGGCGACG CGATCGCGAT CGCGCTGCTG GAGAGCCGCG GCTTCACCTC GACCGATTTC
AGCGTGCTGC ATCCCGGCGG CAAACTCGGC GCGATGCTGA AATACGCCCG CGACCTGATG
CACACCGGTG ACGCCGTGCC GCTGAAGCCG CTCGGCACCA AGATGTCGGA TGCACTGGTC
GAGATGTCGG CCAAGGGCTT CGGCTGCGTC GGCATCGTCG ACGCCAGCGG CGCCGTCGCC
GGCATCGTCA CCGACGGCGA TCTGCGCCGC CACATGCGCC CCGATCTGAT GACCGCGACC
GTCGACGAGG TGATGACCAA GCGGCCGAAG ACGATCAGCC CCGGCCTGCT CGCCGGCGAG
ACGCTGGAAT TGCTGAACTC CTCGAAGATC ACCGCGCTCC TGGTGACCGA AGGCAACAAG
CCGGTCGGCA TCGTGCATCT GCACGACCTG CTGCGGGCGG GCGTGGCGTA G
 
Protein sequence
MALSKPRTTK PAMTDSAAAI PSALRTLEAE ADGVTALAAA LRSDLGSAFA AAIETIRNAK 
GRLIITGLGK SGHIGRKIAA TFASTGTPAF FVHASEASHG DLGMITADDI ILAMSWSGEQ
PEMKNLISYA KRFRIALIAM TSDSGSTLAK AADISLTLPK AREACPHNLA PTTSSLMMLA
LGDAIAIALL ESRGFTSTDF SVLHPGGKLG AMLKYARDLM HTGDAVPLKP LGTKMSDALV
EMSAKGFGCV GIVDASGAVA GIVTDGDLRR HMRPDLMTAT VDEVMTKRPK TISPGLLAGE
TLELLNSSKI TALLVTEGNK PVGIVHLHDL LRAGVA