Gene RPB_3370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3370 
Symbol 
ID3911172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3853080 
End bp3855155 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content67% 
IMG OID637885273 
Productcytochrome c, class I 
Protein accessionYP_486977 
Protein GI86750481 
COG category[C] Energy production and conversion 
COG ID[COG2010] Cytochrome c, mono- and diheme variants
[COG3258] Cytochrome c 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.384758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0138407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCGA ACAAGCGCAG CCTCGGCAAA CGCCTCGGTC TGATCGCGGC CTTTGCTGTC 
GTCGCGATCG GCGGCGCCGC GGCCTGGATC GTGCAACCGC CGCCGATCGA GCGCAACGAC
CCGGGTTTCG GCGCCAATGT CGATCCCGCG CTGATCGCGC GCGGTGAATA TGTCGCGCGG
CTGGGCGACT GCGTCGCCTG CCACACCGCC GAGGGCGGAC CTCTGATGGC TGGCGGCCGG
GCGCTCGAGA CGCCGTTCGG CAAGGTGTAC TCGACCAATG TGACGCCCGA TCCGAAAACC
GGCATCGGGC AATGGTCGTT CGGCGCGTTC GATCGCGCGA TGCGCAAGGG CGTCTCGGCG
GACGGTCACA ATCTCTATCC GGCGATGCCC TACCCGTCTT ACGCCAAGAT GACGGACGAC
GACATGAAGG CGCTGTGGGC CTATCTCCTC AAGGGCCTCG CGCCGGTCGA GAAGGCCAAT
CTCCCGCTCG AGATGCAGTT TCCGTTCAAC ATGCGCATCG CGCTCGCCGC CTGGAATTTC
GCCTTCCTGG ATGCGACGCC GTTCAAGGCC GAGCCTGCGA AGGATGCTGT GTGGAATCGC
GGCGCGTATC TCGTCCAGGG TCTCGGCCAT TGCGGCGCCT GTCACACGCC GCGCGGCATC
GGTTTCCAGG AAAAGGCGAT GAGCGACGCC GGACCGGCGG GCCATTTCTA TCTCGCCGGC
GCCAAGGTCG AAAACTGGAA TGCGATCGAG CTACGCGATT TGTGGACCGT CGAGGACACC
GTCCTGTTGC TGAAGACCGG CCAGAACCGC TTCGCCACCG CGTCCGGCAG CATGACCGAT
GTCATCCTGC ATTCGACGCA GGGACTCTCC GACCAGGACC TGACGGCGAT CGCGACCTAT
CTCAAGGCGC TGCCGTCCGA CCGGCCGAAG GCCGAGCCAC GGATCGCATC GACCGAGGCG
CCTGCGGCCA CCTTCACCAC CCGCGGCGGG CTCGGCTACG CACAGTTCTG CACCGACTGT
CACCGGGCCG ACGGCGCCGG CGTGAAGGGC ATCTTCCCGC CGCTCGCGGG CAATCCCACC
GTCACTTCGA AAGACCCGGC GACGCTGGTG CATATCGCGC TGACCGGCTG GAAGACCACC
GCGACCGCCG CCCATCCGAG GGTCTGGACG ATGCCGGCCT TCGCCCGCCT CGCCGATCGC
GAGATCGCCG AGATCCTGTC CTTCGTGCGC GAGAGCTGGG GCGAAGGCGC CCGCGCCGTC
AGCGAGGCCG AGGTCGCCGC GGCGCGTGCC GCGCTCGATC CGAAGATCGA CAAGTCGCTG
TTCGAGACGC CACGTCTCGC CGATCTGCTG GCGCAGCCCA ACGCGCCGCA ACTGGTCCGC
GGCATGCGGC TCAACGCCGA AACCCGCACG CTGTTGCCGA ACCATGTCGG CGATGAACTC
AACTGCGCCT CGTGCCATCT CAACGCCGGA ACGGTCGCGG ACGGCAGCCC CTATGTCGGC
GTCTCCGCGT TCTTCCCCAG TTACGCGCCG CGCGCCGGTC GCGAGATCAC GCTGGAGGAC
CGCATCAACG GCTGCTTCCT GCGCTCGATG AACGGCAAGC CGCTGCCGGT CGACGGCCCC
GACATGCAGG CGATGGTCGC CTATTTCAAC TGGATGAAAG GCGCGACCAA GCCGTCCGAC
AAAGTGGCCG GACGCGGCGT CGGCAAGGTC GACACGACGT TGAAGCCCGA TCCGGACAAC
GGCAAGGCGA TCTACGTGGC GCAATGCGTG GCCTGCCACG GCCAGAACGG CGAAGGCCTG
AAAGACGCAG CCGGCCGGCT GGTCTATCCC CCGCTGTGGG GCGAGCACTC GTTCAACATC
GGCGCCGGCA TGGCGCGCAC CTACACCGCC GCCGCCTTCG TCAAGCGCAA CATGCCGATC
GGCACCCACG AAAAATTCCC GCTGGGACAA GGCAGTCTGA CCGATCAGGA GGCGATCGAC
GTCGCGGAGT ACTTCACCCA CATGGAGCGG CCGGACTTCG CTCCAAAGGT CAAGGACTGG
CCGAAGGGCA ACAAGCCGAA GGACGCCCGC TACTGA
 
Protein sequence
MTSNKRSLGK RLGLIAAFAV VAIGGAAAWI VQPPPIERND PGFGANVDPA LIARGEYVAR 
LGDCVACHTA EGGPLMAGGR ALETPFGKVY STNVTPDPKT GIGQWSFGAF DRAMRKGVSA
DGHNLYPAMP YPSYAKMTDD DMKALWAYLL KGLAPVEKAN LPLEMQFPFN MRIALAAWNF
AFLDATPFKA EPAKDAVWNR GAYLVQGLGH CGACHTPRGI GFQEKAMSDA GPAGHFYLAG
AKVENWNAIE LRDLWTVEDT VLLLKTGQNR FATASGSMTD VILHSTQGLS DQDLTAIATY
LKALPSDRPK AEPRIASTEA PAATFTTRGG LGYAQFCTDC HRADGAGVKG IFPPLAGNPT
VTSKDPATLV HIALTGWKTT ATAAHPRVWT MPAFARLADR EIAEILSFVR ESWGEGARAV
SEAEVAAARA ALDPKIDKSL FETPRLADLL AQPNAPQLVR GMRLNAETRT LLPNHVGDEL
NCASCHLNAG TVADGSPYVG VSAFFPSYAP RAGREITLED RINGCFLRSM NGKPLPVDGP
DMQAMVAYFN WMKGATKPSD KVAGRGVGKV DTTLKPDPDN GKAIYVAQCV ACHGQNGEGL
KDAAGRLVYP PLWGEHSFNI GAGMARTYTA AAFVKRNMPI GTHEKFPLGQ GSLTDQEAID
VAEYFTHMER PDFAPKVKDW PKGNKPKDAR Y