Gene RPB_3091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3091 
Symbol 
ID3910892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3523359 
End bp3524447 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content64% 
IMG OID637884995 
Producthypothetical protein 
Protein accessionYP_486700 
Protein GI86750204 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.972235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCT CGAAGTCCGT GAAATCACCA TCAACTGGGT TGTTTGGGGA GGGAGCCGTG 
CCCGCTAGTT CCGGATCGAA CGCGCGCCTG TCACGCCGCA CACTGATCGG CGGCGCTCTC
GCCGCTCCGT TTGTGCTGCG CAGCGGACGG GCGTTGGCCG ACGAACCCCT GTCGGTTCGC
GTCGACTTCG CACCCTGGGG GGTGCACTCC GGTTTGCATC TGTCGAAAGC GAAGGGATGG
TTCAAGGAAG ACGGCCTAAA CGTCGACCTG CAGGACGGAA CCGGCACGCT CAACACCATC
AATCTGGTCG CCGCCGGCAA TGTCGATGTC GGACTGGTTC AGCTCGGGAT GCTGGCGATC
GCGCGGTCGC AGGGACTGCC CGTCACGTCG TTCGCCGGCT TCCTGCGCAA GGGCGATCTC
GCGACCTTGG TTGACGCCAA GGCCGGGCCG AAGACCCCGC AGGACCTCGC CGGCAAGAAG
ATCGTCTGTT TCGCCAACAG CCCCTGGGCG CCGTTCGTCG ACGTGTACTT GAAGCGCATC
GGCCTTTCGC GCGGCGAAGG ACCCGACAAG GTCAATGTCG TCATGGTGTC GCCGGCGGCG
ATGGTTTCGA CCTATGCGTC GGGCGCGGCG GACGGCTTCA TGTCGCTCAA GGAATTCGGC
GAGCCTTATG TCGAACAGGC CCGGCCTGCT CGCTCGCTGC TGGCGGCCGA TGTCGGCATC
GCGTTTCCGA GCTACGGTCT GATCGCCACC GATGCGACGC TCGCGAAACG CAAGGATCTG
CTCGCCAAGC TCGTCGCCAA TCAGCGTCGG GCCTGGGACT ACATCTTCGC GGACCCGTCC
CACATCGACG AAGGCGTGCG CGCCATCATC GCCAACCGTC CGGACAAGCA GCTCAACTTC
GACATCCTCA AGGGGCAGAC CGCACTCTGC AAGGAGTTCG TCGACACCGA AAACACCAAG
GGCAAGCCGC TCGGCTGGCA GTCGCCTGCC GATTGGAAGG CCACGATCGC GATGATGGCG
GAAGCCGGTC AGGCCAAGGC GGACGCCGAC GTCTCCGGAT TCTTCACCAA CGATCTGGTC
GGGGCATGA
 
Protein sequence
MTISKSVKSP STGLFGEGAV PASSGSNARL SRRTLIGGAL AAPFVLRSGR ALADEPLSVR 
VDFAPWGVHS GLHLSKAKGW FKEDGLNVDL QDGTGTLNTI NLVAAGNVDV GLVQLGMLAI
ARSQGLPVTS FAGFLRKGDL ATLVDAKAGP KTPQDLAGKK IVCFANSPWA PFVDVYLKRI
GLSRGEGPDK VNVVMVSPAA MVSTYASGAA DGFMSLKEFG EPYVEQARPA RSLLAADVGI
AFPSYGLIAT DATLAKRKDL LAKLVANQRR AWDYIFADPS HIDEGVRAII ANRPDKQLNF
DILKGQTALC KEFVDTENTK GKPLGWQSPA DWKATIAMMA EAGQAKADAD VSGFFTNDLV
GA