Gene RPB_0017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0017 
Symbol 
ID3910222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp16792 
End bp18438 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content64% 
IMG OID637881898 
Productcbb3-type cytochrome c oxidase subunit I 
Protein accessionYP_483640 
Protein GI86747144 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3278] Cbb3-type cytochrome oxidase, subunit 1 
TIGRFAM ID[TIGR00780] cytochrome c oxidase, cbb3-type, subunit I 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAAG CTCTCACATC CAAACGGATC ACCTTCGGCG AGGGTGGTCT GACGCTCGTG 
TTCACGCTGA CCACGGCGCT TTGCATCTAC GCGTCGATCT TCGCCAAGGA CGCGCCGTTC
GCCTTCCACG CCGCTCTCGG CGCTGCCGTC AGCGCCTGGG CCGTGTTCGC GATCATCACC
CGCTACAGCC GCCGCACCGG CGTGCCGCCG CAGGAAATCA ACGGCGTGCC GAACTACAAT
CTCGGCCCGA TCAAATTCCT GTCCTTCATG GCGATGTTCT GGGGCATCGC CGGGTTCTCG
GTCGGCCTCT ACATCGCGTT GGAGCTGGCC TATCCGGGGC TCAACGTCGC GCAATGGGTC
AATTTCGGCC GGCTGCGCCC GCTGCACACC TCCGCGGTGG TGTTCGCGTT CGGCGGCAAC
GTGCTGCTGG CGACCTCGTT CTACGTGGTG CAGCGGACGA CGCGGGCGCG GCTCGCGGGC
GATCTGGCGC CGTGGTTCGT GGCGATCGGC TACAACTTCT TCATCGTCAT CGCCGGCACC
GGCTATCTGC TCGGCGTGAC GCAATCCAAG GAATACGCCG AACCCGAATG GTACGCCGAC
CTGTGGCTGA CCATCGTCTG GGTGACCTAT CTGCTCGTGT TCGTCGCGAC ATTGATGAAG
CGCAAAGAGC CGCACATCTA CGTCGCGAAC TGGTTCTACC TCGCGTTCAT CCTGACCATC
GCCGTGCTGC ATCTCGGCAA CAATCCGACG CTGCCGGTGT CGCTGCTCGG CTCGAAGTCG
TACATCGCCT GGGCCGGCGT GCAGGACGCG ATGTTCCAGT GGTGGTACGG CCACAACGCG
GTCGGCTTCT TCCTCACCGC CGGCTTCCTC GCCATCATGT ACTACTTCAT CCCGAAGCGG
GCGGAACGGC CGGTGTATTC CTATCGGCTG TCGATCATCC ACTTCTGGGC GCTGATCTTC
CTCTACATCT GGGCCGGCCC GCACCATCTG CACTACACCG CGCTGCCGGA CTGGACGCAG
ACGCTCGGCA TGACCTTCTC GATCATGCTG TGGATGCCCT CCTGGGGCGG CATGATCAAC
GGCCTGATGA CGCTGTCGGG CGCCTGGGAC AAGCTGCGCA CCGACCCGGT GCTCCGCATG
ATGGTGGTGT CGGTCGCGTT CTACGGCATG TCGACCTTCG AAGGCCCGAT GATGGCGATC
AAGGCCGTCA ACTCGCTCAG CCACTACACC GACTGGACCG TCGGCCACGT CCACTCCGGC
GCGCTCGGCT GGGTCGGCTT CGTCTCGTTC GGCGCGCTGT ACTGCCTGGT GCCGTGGATC
TGGGGCCGCA AGGAGCTCTA CAGCCTCCGG CTGGTGAACT GGCACTTCTG GATCGCGACG
CTCGGCATCG TGCTCTACAT CTCGGCGATG TGGGTGTCGG GGATCCTGCA GGGCCTGATG
TGGCGCGCCT ACACCTCGCT CGGCTTCCTC GAATATTCGT TCATCGAGTC CGTCGAGGCG
ATGCATCCCT TCTATGCCAT CCGCGCCGCG GGCGGCGGAC TGTTCCTAAT CGGCGCGCTG
ATCATGGCCT ACAACCTCTG GATGACGGTT CGCGTCGGCG AATCGTCGGA GGCCCAGCCC
CGCGTCGCCC TGCAGGCCGC CGAGTAA
 
Protein sequence
MNQALTSKRI TFGEGGLTLV FTLTTALCIY ASIFAKDAPF AFHAALGAAV SAWAVFAIIT 
RYSRRTGVPP QEINGVPNYN LGPIKFLSFM AMFWGIAGFS VGLYIALELA YPGLNVAQWV
NFGRLRPLHT SAVVFAFGGN VLLATSFYVV QRTTRARLAG DLAPWFVAIG YNFFIVIAGT
GYLLGVTQSK EYAEPEWYAD LWLTIVWVTY LLVFVATLMK RKEPHIYVAN WFYLAFILTI
AVLHLGNNPT LPVSLLGSKS YIAWAGVQDA MFQWWYGHNA VGFFLTAGFL AIMYYFIPKR
AERPVYSYRL SIIHFWALIF LYIWAGPHHL HYTALPDWTQ TLGMTFSIML WMPSWGGMIN
GLMTLSGAWD KLRTDPVLRM MVVSVAFYGM STFEGPMMAI KAVNSLSHYT DWTVGHVHSG
ALGWVGFVSF GALYCLVPWI WGRKELYSLR LVNWHFWIAT LGIVLYISAM WVSGILQGLM
WRAYTSLGFL EYSFIESVEA MHPFYAIRAA GGGLFLIGAL IMAYNLWMTV RVGESSEAQP
RVALQAAE