Gene RPC_3464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3464 
Symbol 
ID3971749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3847802 
End bp3850009 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content67% 
IMG OID637926575 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_533323 
Protein GI90424953 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCATG ACCGCGCCAT CCACGACCGG ACCACGCAGG CCAAATCCGC TCCCACGCTG 
TCGCGCCGCG CGCTGCTGCA GGCCGGCGCT GGCCTGGCGA TCGGCGTCTA TATCGCCGGC
CGCACGCCGT CGTTCGCGCA GAACCGGCCG GCGCCGCAAA GCGTCAACAT CGTGCCGAAC
ACCTTTTTGA TCATCGCGCC GGATGACACC GTGACGGTGC TGTGCAAGGC GATCGAATTC
GGCCAGGGGC CGTTCACCGG CTTTGCCACG CTGGTCGCCG AAGAACTCGA CGCCGACTGG
TCGCAGATGC GCGCCGCGCA CGCGCCGTCC AATCCGGCGC TGTACAAAAA TCTGTTGTTC
GGCGTGCAGG GCACCGGCGG CTCCAGCGCG ATCGCCAATT CCTTCGAGCA GATGCGCAAG
GTCGGCGCCG CCGCGCGGCA GATGTTGGTC GCCACGGCGG CCGAGCGCTG GCGGGTCGAG
GCTTCCGAGA TCAGCGTCGA GAACGGCGTG ATCAAGCACG CGTCCGGCAA GCAGGGCCGG
TTCGGCGAAT TCGCAACTGA AGCGATGCAG CGGCCGGTGC CGGGCGATCC GAAGTTGAAG
GATCCGGCCG ACTTCAAGCT GATCGGCAAA GAGGGCGCGG TGCAGCGGCT CGACAGTGCC
GCGAAATCGA ACGGCTCGGC GATCTTCACG CTCGACCTCG ACGAGCCCGA CATGCTCACC
GTGCTGATCG CGCGCTCGCC GAAATTCGGT GGCGTGGTGA GTTCGTTCGA TGCCGGCGCC
GCCAAGCAGA TCGCCGGCGT GGTCGACGTC AAACAGGTGC CGACCGGCGT CGCGGTCTAT
GCCAAGGGTT TTTGGCCGGC GAAGACTGGT CGCGACGCGT TGAAGATCGT CTGGGACGAC
AGCAAGGCGG AGCAGCGCGG CACGCCCGAG ATTCTGTCGC AGTTTCGCGC GCTGGCGAAG
ACGCCGGGCA AGACCGTGAA GCAGCACGGC GACGTCGACG CCGAATTCGC CAAGGGCGGC
CGGCTGATCG AAGCGGAATA TGTGTTTCCC TATCTGGCGC ATGCGGCGAT GGAGCCGCTC
AACGGCTTCA TCAAATGGGA CGGCGACACC GCGCTGGCGC GCTACGGCTG CCAGTTTCCC
ACCCCCGATC ACGGCGCCAT TGCGCAGGTC CTAGGGATCG GCGTCGACAA GGTGAAGCTG
GAAGTGCTGC TGGCCGGCGG CAGCTTCGGC CGCCGCGCGC AGCAGACCGT GCACGCCGCG
ATTGAACTCG CCGAAGTCGC CAAGGCGATC GGGCCGGGCA AGCCGCTGAA ACTGGTGTGG
ACCCGCGAGG ACGACATGCG CGGCGGTTAT TACCGGCCGT TCGGCGTGCA TCGGATGCGC
GGCGTGGTGC GCGACGGCAA GATCGAGGGC TGGACCGACA CCATCGTCGG GCAGTCGATC
ATGAAGGGCA CGCCGTTCGA GGCGATGACC TTCAAGGACG GCATGGATTC CACCACCTAT
GAGGGCTCCA ACGAGATCCC CTACGAGGTG GCGAATTTCC GCTGCGATCT GCATCAGGTC
GATGTCGGCG TCCCGGTGCT GTGGTGGCGC TCGGTCGGCC ACACCCACAC CGGCTACGCG
GTCGAAGCTT TTATCGACGA GTTGCTGGAG GTCGCCGGGC AGGACCCGGT CGACGGCCGG
CTGGCGCTGA TGGGCGATCG GAAGCCGCGG CATGCTGGCG TGCTGAAGGC GGTCGCCGAA
TTGGCGAACT GGAAGGGCGC CAAGATTGAA GCCGGACGCG CCCGCGGCGT CGCGGTGGTC
GAGAGCTTCA ACACTTTCGT GGCGCAGGTG GTCGAGCTGT CGATGACCGC GGAGGGGCCG
AAGCTGCACA AGGTGTGGTG CGCGGTGGAT TGCGGCGTCG CGGTCAATCC GGACATCATC
CGCGCCCAGA TGGAGGGCGG CATCGGCTTT GCGCTCGGCC ACATCCTCTA TGCCGAGCAG
ACCATCGAGG CGGGCGCGCC GGTGGCCGGC AATTTCGACA AATATCGCTC GCTGCGCATC
AACGAGATGC CCGAAGTCGA AGTGGTGATC GTCAACTCCG GCGAAAAGCC GACCGGGGTC
GGCGAGCCCG GCGTGCCGCC GCTCGGACCG GCAGTGGCGA ATGCGATGGC GAAACTGGGA
CTGCCGCGGC CGCGGCAATT GCCGATCGTG CCGGGAGCCA CCGCATGA
 
Protein sequence
MIHDRAIHDR TTQAKSAPTL SRRALLQAGA GLAIGVYIAG RTPSFAQNRP APQSVNIVPN 
TFLIIAPDDT VTVLCKAIEF GQGPFTGFAT LVAEELDADW SQMRAAHAPS NPALYKNLLF
GVQGTGGSSA IANSFEQMRK VGAAARQMLV ATAAERWRVE ASEISVENGV IKHASGKQGR
FGEFATEAMQ RPVPGDPKLK DPADFKLIGK EGAVQRLDSA AKSNGSAIFT LDLDEPDMLT
VLIARSPKFG GVVSSFDAGA AKQIAGVVDV KQVPTGVAVY AKGFWPAKTG RDALKIVWDD
SKAEQRGTPE ILSQFRALAK TPGKTVKQHG DVDAEFAKGG RLIEAEYVFP YLAHAAMEPL
NGFIKWDGDT ALARYGCQFP TPDHGAIAQV LGIGVDKVKL EVLLAGGSFG RRAQQTVHAA
IELAEVAKAI GPGKPLKLVW TREDDMRGGY YRPFGVHRMR GVVRDGKIEG WTDTIVGQSI
MKGTPFEAMT FKDGMDSTTY EGSNEIPYEV ANFRCDLHQV DVGVPVLWWR SVGHTHTGYA
VEAFIDELLE VAGQDPVDGR LALMGDRKPR HAGVLKAVAE LANWKGAKIE AGRARGVAVV
ESFNTFVAQV VELSMTAEGP KLHKVWCAVD CGVAVNPDII RAQMEGGIGF ALGHILYAEQ
TIEAGAPVAG NFDKYRSLRI NEMPEVEVVI VNSGEKPTGV GEPGVPPLGP AVANAMAKLG
LPRPRQLPIV PGATA