Gene RPB_3675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3675 
Symbol 
ID3911477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4214988 
End bp4216205 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content68% 
IMG OID637885577 
ProductVWA containing CoxE-like 
Protein accessionYP_487281 
Protein GI86750785 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGTA ACCCGATGAC CGCGATGGAT CACCTCAACC CGCCGACCGG CAAGATGGCC 
GACAACATCG TCGGCTTCGC CCGCGCGCTG CGCGCGGCCG GCCTGCCGGT CGGGCCGGGC
GCGGTGATCG ATGCGCTGGA GGCGTTGCAG CTCATCGACA TCGGCAACCG CGCCGATCTC
TACGCCACGC TGGAGGCGAT CTTCGTCAAG CGTCGCGAGC ACGCGCTGAT TTTCGCGCAG
GCGTTCGCAC TGTTCTTCCG CGCCGCCGAA GAATGGCAGC ACATGCTGGA TTCGATCCCG
CTGCCCGATC ACGCCAAGAA GAAGCCGCCG CCCGCCTCGC GCCGCGTGCA GGAAGCGATG
GCGCCGTCCA CCACGCGCGA TTTCCCCGCC GCCGAAGAGC AGGAAGTGCG ACTCGCGGTG
TCCGACAAGG AGATCCTGCA GAAGAAGGAC TTCGCCCAGA TGAGCGCGGC GGAGATCGCC
GAGGTGACGC GGTCGATCGC GCGGATGCGC CTGCCGCAGG CGGAATTGCG CACCCGCCGC
GTCCGCCCGG ACAAGCGCGG CCTCAAGCTC GATCTGCGCC GCACGCTGCG CGCGTCGCTC
CGCACCGGTG GCGACATCGT CGATATTCGC AAGCTCGGGC TGATCGACAA GCCGGCGCCG
ATCGTGGCGC TCCTGGATAT CTCCGGCTCG ATGAGCGAGT ACACGCGGCT GTTCCTGCAT
TTCCTCCACG CCATCACCGA CGACCGCAAG CGCGTCTCGA CCTTCCTGTT CGGCACGCGG
CTGACCAACG TCACCCGCGC GCTGCGGGCG CGCGATCCGG ATGAGGCGCT GGCGAGCTGC
ACCTCGTCGG TCGAGGACTG GGCCGGCGGG ACGCGGATCG CGACCTCGCT GCACGGCTTC
AACAAGCTGT GGGCGCGCCG CGTGCTCGGG CAGGGCGCGA TCGTGCTGCT GATTTCGGAC
GGGCTCGAGC GCGAGGCGGA CTCCAAGCTG GCGTTCGAGA TGGACCGGTT GCATCGCTCC
TGCCGGCGGC TGATCTGGCT CAATCCTCTG CTGCGGTTCG GCGGCTTCGA ACCGCGCGCG
CAGGGCATTA AAATGATGCT GCCGCACGTT GACGAATTCC GCCCGGTGCA TAACTTGACC
TCGATGCAGG GGCTGATCGA GGCGCTGTCG TCGGCGCCGC CGCCGCACCA TTTCAGCGCG
ATCCGCTCCG CCGCCTGA
 
Protein sequence
MQRNPMTAMD HLNPPTGKMA DNIVGFARAL RAAGLPVGPG AVIDALEALQ LIDIGNRADL 
YATLEAIFVK RREHALIFAQ AFALFFRAAE EWQHMLDSIP LPDHAKKKPP PASRRVQEAM
APSTTRDFPA AEEQEVRLAV SDKEILQKKD FAQMSAAEIA EVTRSIARMR LPQAELRTRR
VRPDKRGLKL DLRRTLRASL RTGGDIVDIR KLGLIDKPAP IVALLDISGS MSEYTRLFLH
FLHAITDDRK RVSTFLFGTR LTNVTRALRA RDPDEALASC TSSVEDWAGG TRIATSLHGF
NKLWARRVLG QGAIVLLISD GLEREADSKL AFEMDRLHRS CRRLIWLNPL LRFGGFEPRA
QGIKMMLPHV DEFRPVHNLT SMQGLIEALS SAPPPHHFSA IRSAA