Gene Rpal_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0020 
Symbol 
ID6407661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp20369 
End bp22015 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content62% 
IMG OID642709927 
Productcbb3-type cytochrome c oxidase subunit I 
Protein accessionYP_001989058 
Protein GI192288453 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3278] Cbb3-type cytochrome oxidase, subunit 1 
TIGRFAM ID[TIGR00780] cytochrome c oxidase, cbb3-type, subunit I 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAAG CCCTGAATTC GAAGCGGATC ACCCTTGGCG AGGGTGGTCT CACGGTCGTG 
TTCGCATTGA CCACGATCCT GTCCATCTAT GCGGCGATTT ACGCCAAGGA TGCTCCCTTC
GCATTTCATG CCGCGCTAGC GGCCGCGTTC AGCGCATGGG CGGTGTTCGC GATCATCGTT
CGGCATCGCA GCCGTTCGGG CCTGCCGCCG CAGGAAATCA ACGGCGCTCC GAATTACAAT
CTCGGGCCGA TAAAGTTCCT CTCGTTCATG GCGATGTTTT GGGGCATCGC CGGCTTCTCG
GCCGGCCTCT ACATTGCCCT TGAGCTTGCC TATCCGGGCT TGAACATCGC CCAGTGGGTC
AACTTCGGCC GGCTCCGTCC GCTGCACACT TCGGCGGTGA TCTTCGCATT CGGCGGCAAC
GTGCTGCTCG CCACCTCGTT CTACGTGGTG CAGAAGACCA CCCGGGCACG GCTCGCCGGT
GACTTGGCGC CGTGGTTTGT GGCGATCGGC TACAACTTCT TCATCGTCAT CGCCGGCACT
GGCTATCTCC TGGGTGTCAC CCAGTCGAAG GAATACGCCG AGCCGGAATG GTACGCCGAC
CTCTGGCTCA CCATCGTTTG GGTTACCTAC CTGCTGGTGT TCCTGGCGAC GCTGATGAAG
CGCAAGGAAC CCCACATCTA CGTGGCGAAC TGGTTCTATC TCGCGTTCAT CATCACTATC
GCGGTGCTGC ACCTCGGCAA CAACCCGACG CTGCCGGTCA GCTTCCTCGG CTCGAAGTCC
TACATCGCCT GGTCGGGCGT GCAGGACGCG ATGTTCCAGT GGTGGTACGG CCACAACGCG
GTCGGCTTCT TCCTCACCGC CGGCTTCCTC GCCATCATGT ACTACTTCAT CCCGAAGCGG
GCTGAACGGC CGGTCTATTC CTATCGGCTG TCGATCATCC ACTTCTGGGC GCTGATCTTC
CTGTACATCT GGGCTGGTCC GCACCACCTG CACTACACCG CACTGCCCGA CTGGACGCAG
ACCCTCGGCA TGACCTTCTC GATCATGCTG TGGATGCCCT CCTGGGGCGG CATGATCAAC
GGCCTGATGA CGCTGTCGGG CGCCTGGGAC AAGCTCCGCA CCGACCCCGT GCTGCGCATG
ATGGTGGTGT CGGTCGCCTT CTACGGCATG TCGACCTTCG AAGGTCCGAT GATGGCGATT
AAGGCGGTCA ACTCGCTCAG CCACTACACC GACTGGACCA TCGGCCACGT CCACTCCGGC
GCGCTCGGCT GGGTCGGCTT CGTCTCCTTC GGTGCTCTGT ACTGCCTCGT GCCGTGGGTC
TGGGGCCGCA AGCAGCTCTA CAGCATCCGC CTGGTGAACT GGCACTTCTG GATCGCGACC
CTCGGCATCG TCCTCTACAT CTCGGCGATG TGGGTGTCGG GCATCCTGCA AGGTCTGATG
TGGCGTGCCT ACACCTCGCT CGGCTTCCTC GAATACTCGT TCATCGAGTC GGTCGAAGCG
ATGCACCCCT TCTATGCCAT CCGCGCCGCC GGCGGCGGCC TGTTCCTGAT CGGCGCGCTG
ATCATGGCAT ACAACCTCTG GATGACCGTT CGCGTCGGCG AACGGTCCGA GGCTCAGCCT
CGCGTCGCCC TGCAGCCTGC CGAGTAA
 
Protein sequence
MNQALNSKRI TLGEGGLTVV FALTTILSIY AAIYAKDAPF AFHAALAAAF SAWAVFAIIV 
RHRSRSGLPP QEINGAPNYN LGPIKFLSFM AMFWGIAGFS AGLYIALELA YPGLNIAQWV
NFGRLRPLHT SAVIFAFGGN VLLATSFYVV QKTTRARLAG DLAPWFVAIG YNFFIVIAGT
GYLLGVTQSK EYAEPEWYAD LWLTIVWVTY LLVFLATLMK RKEPHIYVAN WFYLAFIITI
AVLHLGNNPT LPVSFLGSKS YIAWSGVQDA MFQWWYGHNA VGFFLTAGFL AIMYYFIPKR
AERPVYSYRL SIIHFWALIF LYIWAGPHHL HYTALPDWTQ TLGMTFSIML WMPSWGGMIN
GLMTLSGAWD KLRTDPVLRM MVVSVAFYGM STFEGPMMAI KAVNSLSHYT DWTIGHVHSG
ALGWVGFVSF GALYCLVPWV WGRKQLYSIR LVNWHFWIAT LGIVLYISAM WVSGILQGLM
WRAYTSLGFL EYSFIESVEA MHPFYAIRAA GGGLFLIGAL IMAYNLWMTV RVGERSEAQP
RVALQPAE