Gene RPB_4511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4511 
Symbol 
ID3912327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5097853 
End bp5100096 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content69% 
IMG OID637886414 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_488105 
Protein GI86751609 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.850162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.518502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGCG GCCAGCCCGA TCCGCTGATC CTCAGCAAGC ACGGCCAGAT CGGCGCCTCG 
GTGCCGCGCG TCGACGGTCC GCTGAAAGTC CGCGGCGCGG CACCTTTCGC GGCCGAGTTC
GCGCTCGACG GCATGGTCTA TGCCGCGCTG AAATTCAGCA CCGTGCCGAA AGGCCGGATC
GCGTCGCTCG AGACCACGCA GGCGGAGGCG GCCCCCGGCG TCGTCGCGGT GATGACCCAT
CGCAACGCCC CGCGCATGGC GCCGATGCCG ATGTTCATGA CCGCCGAAAA AGCCGGCGGG
AGCGACGAAC TGCCGATCAT GCAGGACGAT CGGATCTACT GGAACGGGCA GCCGGTCGCG
GTGGTGGTCG CCGAGACTCA GGAGCAGGCG GACTACGCCG TTTCGCTGAT CCGCGCCACT
TACGAATCCG ACGCGGCCAT CACCAGCTTC GCGACCGCCA AACGCAAAGG CACCGAGCCT
GCCCTGTTCA TGGGGCAGCC GCTGAAGGTG GAAAAGGGAT CGGCCGACGC GGCGTTCAAG
GCCTCGGCCG CCAAGGTCGA CGAGACCTAC ACCACGCCGC GGCACAATCA CAATGCGATC
GAGCCGCACG CCGCCACGGT GACGTGGGAC GGCGATCGGC TGATCGTGCA CGATGCCTCG
CAGGCGGTGT CGCACACCGC GTGGTCGCTG GGCCAAGTGT TCGGCATCGC CGAGGATCAG
GTGCGTGTGA CGTCGCCCTT CGTCGGCGGC GGTTTCGGCG GCAAGTGCCT GTGGCAGCAT
CAGGTGCTGG GCGCGGCAGC CTCGAAGCTT GCCGGCCGCC CGGTCCGGAT CGCGCTTTCG
CGCGAGGGCG TCTATCGCCT GATCGGCGGC CGCACGCTGA CCGAGCAGCG CGTCGCCCTC
GGCGCCGATC CGGACGGCCG CTTCAACGCC ATCATCCACA CCGGCGCGGT GGCGATGAGC
AACCACAGCG TGATGCCTGA GCCGTTCATC CTGCCGACGA TGTCGAGCTA TGGTTCGCCG
AACATCAAGC TGGACGTCCA GGTGGCGCGG CTCGACATGC TCGCCAACAC CTTCATGCGG
GCGCCTGGCG AATCCGTCGG CACCTTCGCG CTGGAATCCG CGATCGATGA GCTGGCGGTG
GCGCTCGGCA TGGACCCGGT CGAACTGCGC ATCCTGAACC AGCCGGACGA GGACCCATTG
AAAGGCACGC CGTTCTCGTC GCGACACATC GCCGAGGCGT GGCGCGCCGG CGCCGAGCGG
TTCGGCTGGT CGAAACGCAA CCCGACCGCC GCCAGCGTGC GCGACGGCGA ATGGCTGGTC
GGCACGGGCT GCGCCACCGC GACCTATCCG TATCACCGGA TGCCGGGCGG GGCGGCGCGG
ATCACGCTGA CGCGTGACGG CGCGGCCAAG GTCGAGGTCG CGGCGCACGA GATGGGAATG
GGCACCGCCA CCGCCCACAC CCAGGTCGTT GCCGAACGCC TTGGGCTCAC GCGCGATCAG
GTGAGCTTCG CCTATGGCGA CTCGCTGATG CCCGGCGTCG TGCTCGCCGG CGGCTCGCAG
CAGACCGCCT CGATCGGCGC CTCGGTGATC GCCGCGCATC ACGTGCTGAT CGCCGAGCTG
CTCAAGCTCG CCGGCAACGA CTCGCCGCTG GCGGGGCTGG GCGCCGACGA GGTCGGCACG
GTGAATGGCG GCCTCGCCAA GCTCGACGAT CCGTCGCGGC ACGAGAGCTA CGTCTCGATC
CTCACCCGCT CGGGCCGCGA TCACGTCGCC GTCGAAGGCA GCGCCTCGGC GCCGCTCGAG
ACCCAGCATT GGTCGATGCA TTCGTTCGGC GCGCTGTTCT GCGAGGTCGG CGTCAACAGC
GTCACCGGCG AAGTCCGGGT CCGGCGTTTT CTCGGATCGT ATGATTGCGG CCGCATCCTC
AATCCGAAGA CCGCGGCGAG CCAGTTCCGC GGCGGCATCA TCATGGGCCT CGGCCTGGCG
CTGATGGAGG AGACCCAGCT CGACGACCGC AACGGCCGGG TGATGAATCC GAGCTTCGGC
GATTATCACG TCCCGGTGCA TCTCGATGTG CCGGCGATCG ACGTGATCTG GACCGACATT
CCGGATCCGC GCGCCCCGAT GGGCGCCCGC GGCATCGGCG AGATCGGCAT CACCGGCGTC
GGCGCCGCCG TCGCCAACGC GGTCTTCAAC GCCACCGGCA AGCGCGTGCG CGATCTGCCG
GTCACGCTCG ACAAGTTGCT GTGA
 
Protein sequence
MPGGQPDPLI LSKHGQIGAS VPRVDGPLKV RGAAPFAAEF ALDGMVYAAL KFSTVPKGRI 
ASLETTQAEA APGVVAVMTH RNAPRMAPMP MFMTAEKAGG SDELPIMQDD RIYWNGQPVA
VVVAETQEQA DYAVSLIRAT YESDAAITSF ATAKRKGTEP ALFMGQPLKV EKGSADAAFK
ASAAKVDETY TTPRHNHNAI EPHAATVTWD GDRLIVHDAS QAVSHTAWSL GQVFGIAEDQ
VRVTSPFVGG GFGGKCLWQH QVLGAAASKL AGRPVRIALS REGVYRLIGG RTLTEQRVAL
GADPDGRFNA IIHTGAVAMS NHSVMPEPFI LPTMSSYGSP NIKLDVQVAR LDMLANTFMR
APGESVGTFA LESAIDELAV ALGMDPVELR ILNQPDEDPL KGTPFSSRHI AEAWRAGAER
FGWSKRNPTA ASVRDGEWLV GTGCATATYP YHRMPGGAAR ITLTRDGAAK VEVAAHEMGM
GTATAHTQVV AERLGLTRDQ VSFAYGDSLM PGVVLAGGSQ QTASIGASVI AAHHVLIAEL
LKLAGNDSPL AGLGADEVGT VNGGLAKLDD PSRHESYVSI LTRSGRDHVA VEGSASAPLE
TQHWSMHSFG ALFCEVGVNS VTGEVRVRRF LGSYDCGRIL NPKTAASQFR GGIIMGLGLA
LMEETQLDDR NGRVMNPSFG DYHVPVHLDV PAIDVIWTDI PDPRAPMGAR GIGEIGITGV
GAAVANAVFN ATGKRVRDLP VTLDKLL