Gene RPB_3221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3221 
Symbol 
ID3911022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3683141 
End bp3684328 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content68% 
IMG OID637885123 
Producthypothetical protein 
Protein accessionYP_486828 
Protein GI86750332 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.978135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.653443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGCA ATGCCGGCGC GGCGGATGCC GACGGCGAGC GTCTGGAAAT CGTCCGGACC 
GCCGAGCGGC TGCGCGAGAT CGGCCCGGCC TGGGAAGCGC TGTGGCACGA CGCCGGCGCG
CTGGTGTTCC AGAGTCATGC CTGGACCGCC GCGTGGTGGA ACGCGGTGCC CGACCGCCCG
CGCCGCGGAT TGTTCATCGT ACTGGCGTGG CGGCACGACA CGCTGGTGGC GGTGCTGCCG
CTGGCGACCT GCCGCTGGTA CGGCGTCCGC GTGCTGGAAT GGGCCGCCAA GGACTACTCC
GACTATTGCG ACGCGCTGCT GCGCCCGGGC ATCGGCCCGG CTGTGGTGCA GCGGATGTGG
GCCCATGCCG ATGTGCAGGG AGGTTTCGAC GCCGCCTATC TCGGCCATGT GCTGCCGACC
GCGATCGTGA ACACGCTGAC CGACGGAACG CGCGGCCGCG GCGTCGTGCT GCGTCCCCAC
TTCCGGCAGG CCACGAGCCT GCGCGTGGTC GGCCCCTGGA GCAACAGCCA GGCTTGGTTC
GACTCGCATT CCGGCAACGC GCGGCGCAAC TATCGCCGCG GTCTCAAGAC CCTTTCAGAC
AACGCCAAGG TCGAATTCCG GCTGATGGCA CCGGACGAGC CGCTCGGGCC CGCCTTGCAG
CGATGCGCCG AGCTGAAGCG CGCCTGGTGC GCCCGCAACG GCCTGGTGGC GCCGCTGTTC
GATGCCGGTT CGCCGATGCT GGAAGCGCTG GTGCAGGTGC TCGCCGACAA CAAGCTGCTG
CATGTGTTCG TGCTCGAGCG CGACGGCGTG ATCGTCGCCA TGACGGTCAA CCTGATGCAG
CACGCCACCA TGATGGCCTA TGTCACCACT TACGATTCCA GTTTCGAACG CAGTTCGCCC
GGCAACATCC TGCTGTTCGA CTACATCCGA TGGTCGATCG ATCACGGCGC GACGACCGTC
GATTTCCTGT GCGGCGACGA GGACTACAAA TATCGCTTCA GCAACCAGCA GGTCACCCTG
AACTCGTTCG CGGGGGGCCG CACGCTGCTG GGCAAGGCGG CGATCCTGGC GGACAAGGCG
CTGCACGCCG TCAACGCCTT CCGCGCGCGA TCGCTGAACC GCCCGTCGAA GTCCGCGGCG
AAGCCGGACG ATCGTGGCGC CCTCGGTGCG CCTGTCGGCG AACCCTAG
 
Protein sequence
MIGNAGAADA DGERLEIVRT AERLREIGPA WEALWHDAGA LVFQSHAWTA AWWNAVPDRP 
RRGLFIVLAW RHDTLVAVLP LATCRWYGVR VLEWAAKDYS DYCDALLRPG IGPAVVQRMW
AHADVQGGFD AAYLGHVLPT AIVNTLTDGT RGRGVVLRPH FRQATSLRVV GPWSNSQAWF
DSHSGNARRN YRRGLKTLSD NAKVEFRLMA PDEPLGPALQ RCAELKRAWC ARNGLVAPLF
DAGSPMLEAL VQVLADNKLL HVFVLERDGV IVAMTVNLMQ HATMMAYVTT YDSSFERSSP
GNILLFDYIR WSIDHGATTV DFLCGDEDYK YRFSNQQVTL NSFAGGRTLL GKAAILADKA
LHAVNAFRAR SLNRPSKSAA KPDDRGALGA PVGEP