Gene RPB_1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1959 
Symbol 
ID3908039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2225850 
End bp2227340 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content71% 
IMG OID637883853 
Producthypothetical protein 
Protein accessionYP_485578 
Protein GI86749082 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.521354 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCC TCACCGCCGA TTCGAGCGAC CTCGCCGGCG CACGGCACAG CGCCGCGCTG 
CCGCTGTGGA TCGGGGTCGC GGTCTACGCC TTCCTGCTGG TCTCCGGCGA CCGGCTGCTG
AACGATCCCG ATACGCTGTG GCAGGTCACG CTGGGGCAGT GGATCATCGA CCATCGCGCG
GTGCCGCAGG TCGACGTCTA TTCCTTCACC ATGGCCGGGC AGCCGTGGAT CTCGACGCAA
TGGCTGTCGC AGGTGCTACT GGCGGCGAGC TACCAACTCG CCGGCTGGGC CGGGCCGGTG
GTGCTGTCGG CCGCGGCGCT GGCGGCGACC TTCATGCTTT ATGCGCGCTG GCTCGGCCAG
CGGCTGCGCG GCAGTACCGC GCTGGTGTTC GTCGCCGCGG CGGTGGCGCT GATGGCGCCG
CATATGCTGG CGCGGCCGCA CGTGCTGGCG ATGCCGGTGA TGGTGGCATG GGTCGGCGCG
CTGATGGCGG CCGCCGATCG CCGCGCCGCG CCGTCGTTCT GGCTGCTGCC GCTGATGGTG
CTGTGGGCCA ATCTGCACGG CGGCTTCGTG CTCGGCATCG CGCTGGTGGC GCCGATCGCG
CTCGACGCGG TGCTGGCGGC GGCGCCGTCG CGCCGCGTTT CGCTGCTGCT GCGCTGGGGT
GCGTTCGGCG TCGCCGCGCT GATCGTGAGC TGCGCCAATC CGTACGGCTG GAACGCGATC
CTGGCGTCGC AGCGCATCCT GTCGCTCGGC GCGGCGCTGG CGACGATCGG CGAATGGGCG
CCGGTGAATT TCTCGCGGGT CGGGGCGTTC GAGGTTGCGC TGCTGGGAGG TCTCGGCCTC
GCTTTGTGGC GCGGCGTGAC GCTGCCGCCG GTGCGCATCC TGCTGCTGCT CGGCCTCGTG
CACATGGCGC TGAGCGCCAA CCGCAACATC GAAGTGCTGG CGCTGCTCGC GCCGCTGATC
ATCGCGGCGC CGCTGGCCCG GCAGATCGGC GGCGCCGAAG CGCGCGGCGA GGCGCCGGCG
CGCCTCTCGG TCGGCGCGGT GGCCGCGGCG ATCGCGCTCA CGCTCGGCGG CACGCTGGCG
TTCGCCTCGC TGCATCGCTT CGCGCCGCAT CCGGCGAATT CGCCGGCGGC GGCGGTGGCC
GAGCTGAAGA CGCTCGGCGT GCAGCGGGTG TTCAACGATT ACGATTTCGG CGGCTACCTG
ATCGCCAACG GCGTCGCGCC GTTCATCGAC GGCCGCACCG AACTGTACGG CGAGGCCTTC
GTGGTCGAAC AGAATTCCGC GGTGCGGCTG AAGCCGCCGG AGAAACTGTT CCGACTGCTC
GCGGACTACG ACATCGACGC CACGCTGCTC CGCACCGAGG ACGCGGCCAC CCATCTGCTC
GATCACATCG ACGGCTGGCA GAAGGTGTAT TCCGACGACG TCGCGACCAT CCATTTGCGC
CGGCCGGGCG CGCTGCACGG CGCCGCGCCG AAGGTCACGC CCGCGAACTG A
 
Protein sequence
MTILTADSSD LAGARHSAAL PLWIGVAVYA FLLVSGDRLL NDPDTLWQVT LGQWIIDHRA 
VPQVDVYSFT MAGQPWISTQ WLSQVLLAAS YQLAGWAGPV VLSAAALAAT FMLYARWLGQ
RLRGSTALVF VAAAVALMAP HMLARPHVLA MPVMVAWVGA LMAAADRRAA PSFWLLPLMV
LWANLHGGFV LGIALVAPIA LDAVLAAAPS RRVSLLLRWG AFGVAALIVS CANPYGWNAI
LASQRILSLG AALATIGEWA PVNFSRVGAF EVALLGGLGL ALWRGVTLPP VRILLLLGLV
HMALSANRNI EVLALLAPLI IAAPLARQIG GAEARGEAPA RLSVGAVAAA IALTLGGTLA
FASLHRFAPH PANSPAAAVA ELKTLGVQRV FNDYDFGGYL IANGVAPFID GRTELYGEAF
VVEQNSAVRL KPPEKLFRLL ADYDIDATLL RTEDAATHLL DHIDGWQKVY SDDVATIHLR
RPGALHGAAP KVTPAN