Gene RPB_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1703 
Symbol 
ID3908228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1937547 
End bp1939223 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content68% 
IMG OID637883597 
Producthypothetical protein 
Protein accessionYP_485322 
Protein GI86748826 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.603628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCGT CGACGAAGCC GACTAGCTCC AATCCAACAA CGAGACCCAA AATGAGCATC 
ATGACCGGCG GCGAAGCGAT CGTGCAGACC CTCGTCGCGC ACGGCGTCGA CACCGTGTTC
GGCCTGCCCG GCGCGCAGAT CTACGGGTTG TTCGACGGCT TCGCAAAGGC GCAACTCCGC
GTCGTCGGTG CGCGCCATGA GCAGGCCTGC GGCTACATGG CGTTCGGCTA TGCCCGCGCC
TCGGGCCGCC CCGGCGTGTT CAGCGTCGTG CCCGGCCCCG GCGTGCTGAA TGCCGGCGCG
GCGCTGCTCA CCGCCTTCGG CTGCAACGAG CCGGTGCTGT GCCTCACCGG TCAGGTGCCG
AGCGCCTATC TCGGCAAGGG CCGTGGTCAC CTGCATGAGA TGCCGGATCA GCTCGCGACG
TTGCGCAGCT TCATCAAATG GGCCGAGCGC ATGGAGTATC CCGGCAACGC GCCGGCGCTG
GTCGCGCGCG CGTTCCAGGA GATGATGAGC GGCCGGCGCG GCCCGGTGGC GCTGGAAATG
CCCTGGGAGG TGTTCACCCA GCGCGCCGAG ACCGCAGCGG CGATCAAGCT CGATCCGGTC
GTACCGCCGC AGCCCGACCC GGACCGCGTC GCCGCCGCCG CGAAGCTGAT CGCCGCGAGC
AAGACGCCGA TGATCTTCGT CGGCTCCGGT GCGCTCGACG CTGGCGACGA GATCCTCGAA
CTGGCCGAAG CGATCGACGC GCCGGTCGTC GCATTTCGCA GCGGCCGCGG CATTGTCAGT
AACCGGCACG ACCTCGGCCT GACCTTCGCC GCCGCCTATC GGCTGTGGCC GCAGACCGAT
CTGATCATCG GCATCGGCTC GCGGATGGAA CTGCCGACGA CGTTCCGCTG GCCGTTCCGG
CCGGACGGCC AGAAGTCGGT GCGGATCGAC ATCGATCCCG CCGAGATGCG CCGCTTTTCG
CCGGACGCTT CGATCGTCGC CGATGCCAAG GCCGGCACTC GTGCGCTGGT CGACGCGGTG
AGCAAGCGTG GCTACAACAA GACCCAAGGG CGGCGCGCGA CCATTCGCGA GGCGACCGCG
CTCACGCTGG AAGCGATCCA GTCGGTGCAG CCGCAGATGG CCTATTTGAA GATCCTGCGC
GAGGTGCTGC CGGACGACGC CATCGTCACC GACGAGCTGT CGCAGGTTGG ATTCGCCTCG
TGGTACGGCT TCCCGATCTA CCAGCCCCGC ACCTTTCTCA CATCGGGCTA TCAGGGCACG
CTCGGCTCCG GCTTCCCGAC CGCGCTCGGC GCCAAGGTCG CCTGCCCCGA CAAGCCGGTC
GTCGCCATCA CCGGCGACGG CGGTTTCATG TTCGCTGTGC AGGAGCTCGC CACCGCGGTG
CAGTTCAACA TCGGCGTGGT GACGCTGGTG TTCGACAATT CGGCCTATGG CAACGTCCGG
CGCGACCAGG TCACCCAGTT CGAAGGCCGC GTGGTGGCGT CCGATCTGGT CAATCCGGAT
TTCGTCAAGC TCGCGGAATC CTTCGGCGTC GCGGCGTCGC GGGTCGGCTC GCCCGATCAC
TTCCGCGCCG CGCTGGAGAA GGCGCTGGCG CATGGCGGGC CGTATCTGAT CGCGATCGAC
GTTCCGCGCG ACAGCGAAGC CAGCCCCTGG CCGTTCATCC ATCCGGCGAA GCCGTGA
 
Protein sequence
MASSTKPTSS NPTTRPKMSI MTGGEAIVQT LVAHGVDTVF GLPGAQIYGL FDGFAKAQLR 
VVGARHEQAC GYMAFGYARA SGRPGVFSVV PGPGVLNAGA ALLTAFGCNE PVLCLTGQVP
SAYLGKGRGH LHEMPDQLAT LRSFIKWAER MEYPGNAPAL VARAFQEMMS GRRGPVALEM
PWEVFTQRAE TAAAIKLDPV VPPQPDPDRV AAAAKLIAAS KTPMIFVGSG ALDAGDEILE
LAEAIDAPVV AFRSGRGIVS NRHDLGLTFA AAYRLWPQTD LIIGIGSRME LPTTFRWPFR
PDGQKSVRID IDPAEMRRFS PDASIVADAK AGTRALVDAV SKRGYNKTQG RRATIREATA
LTLEAIQSVQ PQMAYLKILR EVLPDDAIVT DELSQVGFAS WYGFPIYQPR TFLTSGYQGT
LGSGFPTALG AKVACPDKPV VAITGDGGFM FAVQELATAV QFNIGVVTLV FDNSAYGNVR
RDQVTQFEGR VVASDLVNPD FVKLAESFGV AASRVGSPDH FRAALEKALA HGGPYLIAID
VPRDSEASPW PFIHPAKP